Qwen3.5-9B hits thinking loops in Ollama

// 116d agoMODEL RELEASE

Qwen3.5-9B hits thinking loops in Ollama

Users are reporting that Alibaba’s newly released Qwen3.5-9B model enters infinite internal reasoning cycles when deployed via Ollama and OpenWebUI. This "thinking loop" behavior often manifests as repetitive plan-checking monologues, preventing the model from delivering a final answer despite its high reasoning benchmarks.

// ANALYSIS

Qwen 3.5 9B’s hyper-efficient reasoning architecture is a double-edged sword: it punches way above its weight class but oscillates wildly without strict constraint parameters.

–The issue is often tied to high temperatures (1.0+) in small quantized versions; lowering temperature to 0.7-0.8 typically stabilizes the internal monologue.
–Ollama's native `--think=false` flag or the `/set nothink` command can force-disable the reasoning path to bypass the loop entirely.
–System prompts that explicitly limit reasoning steps to a fixed number (e.g., "Analyze in max 3 steps") have proven effective at forcing termination.
–With a 256K native context and GPQA scores topping GPT-4o, the model is clearly optimized for "thinking" which UI wrappers aren't yet perfectly tuned to handle.

// TAGS

qwen3-5-9bllmollamareasoningopen-sourceself-hosted

DISCOVERED

116d ago

2026-03-16

PUBLISHED

116d ago

2026-03-16

RELEVANCE

9/ 10

AUTHOR

Xyhelia

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE2h ago

Terminal Control is an open-source tool that enables AI coding agents to control, test, and capture real terminal applications through pseudo-terminals.

Terminal Control provides a Rust-based command-line interface and a TypeScript client library that allow external drivers, such as AI agents and automated testing suites, to interact directly with Terminal User Interfaces (TUIs). By offering a real pseudo-terminal environment, it overcomes the limitations of parsing plain text output, enabling precise keystroke injection, screen capture, timeline recording, and extraction of structured visual states like SVG and JSON.

NEWS2h ago

Greptile supports OSS with free accounts

The creator of the open-source repository claude-code-templates shared positive feedback on using Greptile for automated pull request reviews. Supported by a free open-source software (OSS) account from the Greptile team, the maintainer integrated the tool into incoming PRs, where it successfully generated diagrams of the code changes and left detailed reviews that caught real issues.

MODEL3h ago

LingBot-VA 2.0 launches robot control model

Developed by Robbyant under Ant Group, LingBot-VA 2.0 is a video-action foundation model built from scratch for native robot control. It employs a causal Mixture-of-Experts architecture and consistency distillation to reduce control loop latency to 142 ms.

Qwen3.5-9B hits thinking loops in Ollama