Qwen3.5-9B lands Opus 4.6 GGUF reasoning boost

// 79d agoMODEL RELEASE

Qwen3.5-9B lands Opus 4.6 GGUF reasoning boost

This is a local-first Qwen3.5-9B finetune/export pipeline that leans on nohurry/Opus-4.6-Reasoning-3000x-filtered, mixes in function-calling and assistant data, and ships clean GGUF quants with llama.cpp. The author's first GSM8K pass on Q4_K_M lands around 0.84 exact match, and the RTX 4090 throughput numbers make the release feel practical rather than purely experimental.

// ANALYSIS

This looks like a credible local-model release rather than a vanity quant drop: the recipe is focused, the early GSM8K number is strong, and the speed tradeoff data is actually useful. The real test is whether the reasoning gains hold up on messy instruction-following and structured outputs, which is where most small finetunes separate themselves.

–The blend of Opus 4.6 reasoning data with `Salesforce/xlam-function-calling-60k` and `OpenAssistant/oasst2` is the right kind of mix if the goal is a small assistant that can reason and format outputs, not just ace math.
–`Q4_K_M` looks like the day-to-day winner; it should be the first quant most people try, while `Q8_0` is the safer pick if you want to squeeze out a bit more fidelity.
–The benchmark story is still early because only `Q4_K_M` has a task eval so far; `Q8_0` is speed-tested, but not yet quality-compared head to head.
–The explicit naming (`opus46`, `mix`, `i1`) is a nice touch for reproducibility and future comparisons.

// TAGS

qwen35-9b-opus46-mix-i1-ggufllmfine-tuningreasoninginferenceopen-weightsbenchmarkself-hosted

DISCOVERED

79d ago

2026-03-23

PUBLISHED

79d ago

2026-03-23

RELEVANCE

8/ 10

AUTHOR

RiverRatt

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL27m ago

Anthropic releases public Claude Mythos model

Anthropic has publicly released a modified version of its frontier AI model, Claude Mythos, under the name Claude Fable 5. The new public version incorporates safety guardrails to restrict offensive cyber capabilities while the unrestricted model remains limited to vetted partners.

MODEL30m ago

Anthropic launches Claude Fable 5

Anthropic has launched Claude Fable 5, a new "Mythos-class" model designed for complex agentic workflows, software engineering, and research synthesis. The model is available via the Claude API, subscription plans, and cloud platforms, with safety guardrails that fallback to Claude Opus for risky queries.

UPDATE39m ago

Vercel v0 adds /improve via Claude Fable 5

Vercel has integrated a new /improve command into its generative UI design tool, v0, to let users leverage Anthropic's new Claude Fable 5 reasoning model. The feature allows developers to invoke the model's advanced reasoning capabilities to iterate, polish, and optimize generated UI code.

Qwen3.5-9B lands Opus 4.6 GGUF reasoning boost