Qwen3.5-397B-A17B tops local LLM benchmark tests

// 62d agoBENCHMARK RESULT

Qwen3.5-397B-A17B tops local LLM benchmark tests

Developer u/awl130's "AI Analytical Intelligence Test" series crowns Qwen3.5-397B-A17B as the premier local LLM for high-spec workstations. By leveraging the 512GB unified memory of the Mac Studio M3 Ultra, the model achieves frontier-level reasoning with a Mixture-of-Experts architecture that only activates 17B parameters at a time.

// ANALYSIS

Massive MoE models like Qwen 3.5 397B are redrawing the boundaries for local AI, proving that frontier-class intelligence is no longer restricted to multi-GPU data centers.

–High efficiency: 17B active parameters deliver intelligence comparable to top-tier proprietary models while maintaining a manageable compute footprint.
–Hardware threshold: Q8_0 quantization requires nearly 400GB of RAM, making the 512GB Mac Studio the only consumer device capable of hosting the model at high precision.
–Optimization breakthroughs: Jangq.ai's "mixed-precision" quantization prevents the coherence failures seen in standard 2-bit quants for large MoE architectures.
–Performance bottleneck: While the model fits in unified memory, the 800GB/s throughput of the M3 Ultra limits tokens-per-second, favoring deep reasoning over real-time chat.
–Ecosystem growth: The success of vMLX and MLX Studio suggests a maturing software stack for high-end local LLM inference on macOS.

// TAGS

qwen3-5-397b-a17bllmopen-weightsmac-studiomoebenchmarkai-codingapple-silicon

DISCOVERED

62d ago

2026-03-26

PUBLISHED

62d ago

2026-03-26

RELEVANCE

8/ 10

AUTHOR

awl130

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

INFRA4h ago

iii turns backends into observable workers

iii is an open-source backend runtime that collapses the usual patchwork of queues, cron jobs, HTTP handlers, state, observability, and agent tooling into one live system surface. Workers expose functions and triggers that other workers can discover and call immediately, making composition and tracing part of the platform across Rust, TypeScript, and Python.

OPEN SOURCE5h ago

Weasel operating contract fuels autonomous AI novel

A Claude-based agent running on the "Weasel" operating contract has authored a complex, multi-chapter story called "The Fractal Kingdom" with zero human guidance on plot or themes. The experiment demonstrates a significant leap in long-form narrative coherence for autonomous agents using structured system instructions.

UPDATE5h ago

Kilo adds xAI Grok integration, hits #1

Kilo Code’s open-source agentic IDE extension hits #1 on Product Hunt, adding deep xAI Grok integration for X Premium+ users via a "Bring Your Own Key" architecture. It positions itself as a powerful, vendor-agnostic alternative to Cursor for developers who prioritize transparency and cost-control.