Qwen3.6-35B-A3B stalls inside agent wrappers

// 90d agoMODEL RELEASE

Qwen3.6-35B-A3B stalls inside agent wrappers

Qwen3.6-35B-A3B runs fine in Ollama’s CLI, but this Reddit thread reports it hanging inside OpenCode and Claude Code. The debate is whether the issue is the new model, a too-small context window, or missing agent/tool-call config.

// ANALYSIS

My read: this is more likely a wrapper mismatch than a broken model. Qwen3.6 is explicitly aimed at agentic coding, but its official docs assume specific reasoning and tool-call parsers that local agent shells may not be wired for by default.

–Qwen’s docs show Qwen3.6 defaults to thinking mode and recommend `reasoning-parser qwen3` plus `tool-call-parser qwen3_coder` for tool use, which means a generic agent client can stall even when plain chat works.
–Context is worth checking, but it is probably not the root cause: the model is natively 262K tokens, and Qwen says 128K+ helps preserve thinking behavior. A 4K default would be bad for agents, but not the only plausible failure mode.
–Qwen also documents a non-thinking path via `chat_template_kwargs.enable_thinking: false`, so some agent workflows may need thinking disabled or preserved explicitly rather than left to defaults.
–The practical takeaway is that “works in `ollama run`” does not prove agent compatibility; tool protocols, chat templates, and wrapper support matter more than raw generation speed here.

// TAGS

qwen3.6-35b-a3bqwenllmagentai-codingcliollamaopencodeclaude-code

DISCOVERED

90d ago

2026-04-18

PUBLISHED

90d ago

2026-04-18

RELEVANCE

9/ 10

AUTHOR

vuncentV7

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE13m ago

Vercel discounts GLM 5.2 on AI Gateway

Vercel is offering a 35% discount for developers running Z.ai's open-weight GLM 5.2 model via Novita on the Vercel AI Gateway until July 24. Supported in the Vercel AI SDK, the integration allows developers to target Novita's serverless endpoints using gateway provider configuration options.

MODEL38m ago

Shanghai AI Lab releases Intern-S2-Preview-397B

Shanghai AI Lab has released Intern-S2-Preview-397B, an Apache-2.0 licensed, open-weight scientific multimodal Mixture-of-Experts model built on Qwen3.5-MoE. The model features 397 billion parameters (activating approximately 17 billion per token) and is designed for advanced scientific reasoning and long-horizon agent tasks.

NEWS1h ago

Kimi K3 succeeds where Claude Code struggles

Developer levelsio reported that Moonshot AI's Kimi K3 model successfully powered through their Windows XP Simulator to-do list, a task that Claude Code failed to complete over a two-week period. The developer blamed Claude Code's aggressive safety guardrails, which repeatedly downgraded their access from Claude 3 Opus to Claude 3.5 Sonnet, causing constant disruption and wasted time.