Qwen3.5 397B probes pooled VRAM, RAM speeds

// 128d agoBENCHMARK RESULT

Qwen3.5 397B probes pooled VRAM, RAM speeds

Redditors are trying to pin down real tok/s for Qwen3.5-397B-A17B in hybrid VRAM+system RAM setups, because Unsloth’s 25+ tok/s claim depends heavily on CPU, channel count, and memory speed. The thread is less about theory than a practical buying guide for anyone considering this MoE model locally.

// ANALYSIS

The interesting question isn’t whether the model runs; it’s whether it stays fast once most of the weight shard spills out of VRAM and onto host memory.

–Qwen3.5-397B-A17B is a 397B-total, 17B-active MoE model, so hybrid offloading shifts the bottleneck from GPU compute to memory bandwidth.
–Unsloth’s headline number looks plausible only on very fast memory systems; throughput should swing hard between mainstream dual-channel desktops and high-bandwidth workstation or unified-memory setups.
–Community reports around similar Qwen3.5 local runs already vary widely, from roughly 10 tok/s in RAM-heavy multi-GPU rigs to the mid-30s on fast unified-memory hardware, which makes the ask for exact configs completely reasonable.
–For buyers, RAM topology matters almost as much as the GPU itself; if you want this model to feel responsive, memory bandwidth is the spec to watch.

// TAGS

qwen3.5-397b-a17bllminferencegpuopen-weightsbenchmark

DISCOVERED

128d ago

2026-03-19

PUBLISHED

128d ago

2026-03-19

RELEVANCE

8/ 10

AUTHOR

Leading-Month5590

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE1h ago

Softr adds visual co-building and vibe coding

Softr has introduced visual co-building alongside customizable vibe-coded blocks, pairing prompt-based AI generation with direct visual editing. The platform allows users to rapidly generate, adjust, and deploy custom business portals, CRMs, and internal tools, bridging the gap between natural language prompt creation and precise interface design.

UPDATE2h ago

Bribes.fyi unveils "Know Before You Go" bribe benchmarks

Bribes.fyi, an anonymous crowdsourced corruption transparency platform in India, has launched a new "Know Before You Go" feature. The tool aggregates user-reported bribery data into city breakdowns, department rankings, and service-level averages, enabling citizens to look up expected bribe amounts prior to visiting public offices while offering automated complaint letter generation for anti-corruption authorities.

OPEN SOURCE4h ago

Cli-Proxy-API Management Center launches WebUI configuration dashboard

Cli-Proxy-API Management Center is an open-source web interface designed to simplify the administration of CLI-Proxy-API instances. It replaces manual YAML configuration file editing with an intuitive visual dashboard for adjusting settings, monitoring runtime status, viewing live logs, and managing token authentication.