Qwen3.5 models spit gibberish on long prompts

// 111d agoMODEL RELEASE

Qwen3.5 models spit gibberish on long prompts

Reddit users say Qwen3.5 4B/9B/27B/122B models in LM Studio start turning 50K+ token prompts into non-grammatical word salad, and the breakdown shows up even in the model's thinking trace. The same input reportedly stays coherent on GPT-OSS-120B, which makes this look more like a long-context serving or cache-handling issue than a pure prompt problem.

// ANALYSIS

Hot take: this smells like a runtime/config bug first and a model-quality problem second. Qwen3.5's own docs advertise a 262K-token default context and recommend keeping at least 128K for thinking, so collapsing around 50K is well below the published envelope.

–Local stacks like LM Studio, GGUF builds, and llama.cpp derivatives can diverge from upstream serving recipes on chat templates, reasoning parsers, and context management.
–The fact that the gibberish starts in the thinking trace points toward cache/state corruption or context-window handling, not just a bad sampling preset.
–GPT-OSS-120B handling the same prompt cleanly suggests the input itself is not inherently pathological.
–If others can reproduce this on official Qwen presets, it deserves a backend bug report with exact quantization, context length, and template settings.

// TAGS

qwen3-5llmreasoningopen-weightsinferenceself-hosted

DISCOVERED

111d ago

2026-03-24

PUBLISHED

111d ago

2026-03-24

RELEVANCE

9/ 10

AUTHOR

custodiam99

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS13m ago

swyx outlines specialized multi-model AI workflow

In a recent tweet, swyx shared his multi-model AI stack for complex projects, assigning specialized tasks to models like sol ultra for planning, fable 5 for critiquing, and sonnet 5 for code generation. He also highlighted the importance of interactive, interview-style prompting to clarify design decisions.

NEWS16m ago

Tweet mocks Claude Fable 5 safety filters

Indie developer Pieter Levels (@levelsio) shared a post mocking the overly sensitive safety guardrails of Anthropic's Claude Fable 5 AI model. The message satirizes Fable's warning system by claiming a 'life simulation' was downgraded to Opus 4.5 without appeal, highlighting developer frustration with aggressive safety routing.

LAUNCH42m ago

Brockman highlights ChatGPT Work mobile experience

OpenAI President and Co-founder Greg Brockman shared his enthusiasm for ChatGPT Work, noting that while the new agent-based platform has received less attention than other recent updates, it offers a highly functional and impressive mobile experience. Powered by the GPT-5.6 model family, ChatGPT Work transitions ChatGPT from a conversational chatbot into an autonomous agent capable of executing complex, multi-step workflows and cross-app integrations directly from mobile and desktop interfaces.