Unsloth refreshes Qwen3.5 GGUF lineup

// 79d agoPRODUCT UPDATE

Unsloth refreshes Qwen3.5 GGUF lineup

Unsloth says its final Qwen3.5 GGUF refresh adds an improved quantization algorithm, new imatrix calibration data, and tool-calling fixes across key Qwen3.5 variants including 27B, 35B-A3B, 122B-A10B, and 397B-A17B. The update is positioned as a real quality pass for local inference, with refreshed benchmarks and a recommendation to re-download the affected models.

// ANALYSIS

This is the kind of update that matters more than a flashy model drop: better quants, fewer template bugs, and clearer performance tradeoffs for people actually running large models locally.

–Unsloth is optimizing for real workloads like chat, coding, long context, and tool calling, not just headline compression ratios
–The new imatrix data and quantization changes suggest the team is tuning for practical quality retention instead of blindly minimizing model size
–Retiring MXFP4 from several GGUF variants is notable because it admits some popular quant choices were hurting quality more than they helped
–Publishing detailed KL divergence benchmarks and large research artifacts makes this more credible than the usual opaque “improved weights” announcement

// TAGS

unslothqwen3.5llminferencebenchmarkopen-source

DISCOVERED

79d ago

2026-03-10

PUBLISHED

83d ago

2026-03-06

RELEVANCE

8/ 10

AUTHOR

jferments

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL2h ago

Anthropic drops Opus 4.8 for Claude Code

Anthropic has released Opus 4.8, integrating the new model into Claude Code with high-effort defaults for complex coding tasks. The update boosts SWE-bench Pro scores to 69.2% and drastically reduces unremarked flaws in generated code.

VIDEO2h ago

Google AI animates cardboard TPUs for I/O 2026

Google AI partners with director Laurie Rowan and Nexus Studios to create a promotional short film for Google I/O 2026. The project leverages AI models to animate physical materials like cardboard and markers into characters representing Tensor Processing Units.

MODEL2h ago

Claude Opus 4.8 drops with extended agentic autonomy

Anthropic has released Claude Opus 4.8, bringing improvements to agentic skills, reasoning, and coding capabilities at the exact same price. The update introduces sharper judgment, increased honesty about its task progress, and the ability to operate autonomously for much longer periods.