Qwen3.6 35B hits 564/41 on 5070 Ti

// 2h agoBENCHMARK RESULT

Qwen3.6 35B hits 564/41 on 5070 Ti

This Reddit post shows Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive running in GGUF Q4_K_M on an RTX 5070 Ti with a 262K context setup. The poster reports 564/41 token speed and shares the llama.cpp flags needed to keep the model usable with 16GB VRAM plus heavy RAM spillover.

// ANALYSIS

The setup leans on llama.cpp flags such as n-cpu-moe, cache-type-k q4_0, and cache-type-v q4_0 to make a 35B MoE model fit. The memory profile is the constraint: 10.8/16GB VRAM plus shared RAM and normal RAM pressure, so this is viable only on a relatively loaded but carefully managed workstation. A 262K context window is impressive, but it makes performance claims highly configuration-dependent rather than broadly transferable. The TurboQuants miss is a useful warning sign that local LLM tuning still has rough edges even when the base model runs well.

// TAGS

qwen3.6-35b-a3b-uncensored-hauhucs-aggressivellmopen-weightsquantizationmoegpubenchmark

DISCOVERED

2h ago

2026-05-11

PUBLISHED

3h ago

2026-05-11

RELEVANCE

9/ 10

AUTHOR

KptEmreU

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS2h ago

Claude case study shows solo operator power

An X post highlights a Hong Kong marketer who quit his job, went deep on AI, and reportedly used Claude to generate $360,000 in about a year. It reads more like social proof for Claude than a product announcement.

OPEN SOURCE2h ago

Hermes Agent powers agentic OS pitch

The video frames Hermes Agent as the persistent engine behind an agentic-OS-style setup: long-lived memory, reusable skills, multi-agent coordination, and hands-off workflows across chat and terminal surfaces. That pitch matches Hermes Agent v0.13.0, which shipped May 7 with durable Kanban-style task orchestration and stronger session persistence.

OPEN SOURCE2h ago

AionUi turns Hermes into agentic OS

AionUi is a local, open-source desktop app for coordinating AI agents across an entire machine. It positions itself as the UI and control plane for everyday agent work, with built-in assistants, skills, MCP support, remote access, scheduled automation, and native office-file workflows for slides, spreadsheets, documents, and PDFs.