Qwen3.5 Users Trade Sampler Presets by Task

// 69d agoNEWS

Qwen3.5 Users Trade Sampler Presets by Task

A r/LocalLLaMA thread is crowdsourcing the best local inference settings for Qwen3.5, with the poster sharing an Unsloth-based llama.cpp preset on a Q4_K_M GGUF and asking for better ways to keep the model from overthinking. The discussion focuses on quants, inference engines, and task-specific sampling knobs for chat versus coding.

// ANALYSIS

The real story here is that Qwen3.5 is strong enough to create a new tuning problem: people are now optimizing behavior, not just benchmark quality.

–The posted preset is already fairly constrained, but the long reasoning budget and high presence penalty still leave the model feeling overly deliberate for casual chat
–Commenters are converging on separate presets by task, with lower-temp setups for coding and different sampler mixes for creative chat or general reasoning
–Qwen’s own recommendations are becoming the baseline, but local users are quickly diverging based on quant, engine, and workload
–llama.cpp plus GGUF remains the practical local stack, which makes sampler tuning almost as important as the model weights themselves
–This is a healthy sign for open weights: the debate has moved from “does it work?” to “how do we make it behave?”

// TAGS

qwen-3.5llminferencereasoningopen-weightsself-hostedprompt-engineering

DISCOVERED

69d ago

2026-03-19

PUBLISHED

69d ago

2026-03-19

RELEVANCE

8/ 10

AUTHOR

rm-rf-rm

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL33m ago

Anthropic drops Opus 4.8 for Claude Code

Anthropic has released Opus 4.8, integrating the new model into Claude Code with high-effort defaults for complex coding tasks. The update boosts SWE-bench Pro scores to 69.2% and drastically reduces unremarked flaws in generated code.

VIDEO34m ago

Google AI animates cardboard TPUs for I/O 2026

Google AI partners with director Laurie Rowan and Nexus Studios to create a promotional short film for Google I/O 2026. The project leverages AI models to animate physical materials like cardboard and markers into characters representing Tensor Processing Units.

MODEL35m ago

Claude Opus 4.8 drops with extended agentic autonomy

Anthropic has released Claude Opus 4.8, bringing improvements to agentic skills, reasoning, and coding capabilities at the exact same price. The update introduces sharper judgment, increased honesty about its task progress, and the ability to operate autonomously for much longer periods.