OPEN_SOURCE ↗
REDDIT · REDDIT// 4h agoBENCHMARK RESULT
Qwen3.6-35B-A3B crushes 27B UD on M5 Pro
A developer benchmarked Qwen 3.6 35B A3B against the 27B UD model on a 64GB M5 Pro Mac. The 35B MoE model ran 8x faster and scored higher on coding intelligence tasks than its dense counterpart.
// ANALYSIS
This benchmark highlights the stark performance and quality differences between MoE and dense architectures on local Apple Silicon hardware.
- –The 35B A3B MLX 4bit model (MoE) achieved ~72 tokens/s, while the 27B UD MLX 6bit model (dense) crawled at ~9 tokens/s.
- –Despite having fewer active parameters per token, the 35B MoE scored 9.8/10 versus 8.75/10 for the dense model on four coding tasks.
- –The 27B dense model failed to show its expected reasoning depth advantage, making the 35B MoE the clear choice for local development on high-end Macs.
// TAGS
qwen3-6-35b-a3bllmai-codingbenchmarkinference
DISCOVERED
4h ago
2026-04-25
PUBLISHED
4h ago
2026-04-25
RELEVANCE
8/ 10
AUTHOR
skyyyy007