Qwen3.6 keeps CoT context intact

// 90d agoMODEL RELEASE

Qwen3.6 keeps CoT context intact

Qwen3.6-Plus now preserves prior reasoning inside the conversation when `preserve_thinking` is enabled, so agent loops can reuse chain-of-thought instead of re-deriving it every turn. The practical win is better decision consistency in multi-step workflows, especially when the model is making choices during reasoning.

// ANALYSIS

Small flag, big effect: this turns hidden reasoning from disposable scratchpad into stateful context for agentic work.

–`--chat-template-kwargs '{"preserve_thinking": true}'` is the difference between “works in theory” and “actually behaves consistently” in local setups
–Keeping thinking tokens around should reduce redundant re-reasoning across turns, which matters for long tool-using sessions
–The tradeoff is obvious: more context retained means more tokens on the wire, but less thrash in repeated deliberation
–This is a wrapper/inference-contract issue as much as a model issue; the model can support it, but your serving stack has to pass it through correctly
–For agent builders, this is the kind of feature that quietly improves reliability more than benchmark numbers do

// TAGS

qwen3-6-plusllmreasoningagentapi

DISCOVERED

90d ago

2026-04-17

PUBLISHED

90d ago

2026-04-17

RELEVANCE

9/ 10

AUTHOR

Big_Mix_4044

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

LAUNCH12m ago

PALO-AI launches agentic governance architecture

Fabrizio Degni has announced the developer preview of PALO-AI, a reference architecture that uses governance contracts to manage and audit the delegated authority of autonomous agents and collaborative teams. The preview includes sample JSON contracts, Rego policies, Model Context Protocol (MCP) tool definitions, and integration examples for n8n and Dify.

TUTORIAL41m ago

Microsoft "ML for Beginners" adds 50+ translations

Microsoft's popular 12-week open-source machine learning curriculum, ML for Beginners, has been updated to offer automated, always up-to-date translations into more than 50 languages, including Arabic, Hindi, and Swahili. This update aims to lower barriers to entry for aspiring machine learning practitioners globally by making the educational content accessible in their native languages.

LAUNCH1h ago

Fly.io launches Sprites, providing stateful and hardware-isolated Linux sandbox environments with fast copy-on-write checkpoint and restore capabilities.

Fly.io has introduced Sprites, which are stateful sandbox environments running in hardware-isolated AWS Firecracker microVMs designed for executing arbitrary, untrusted code or AI agents. Unlike traditional ephemeral serverless functions, Sprites retain their disk state between runs, utilizing a fast NVMe filesystem that continuously syncs to durable external storage. The platform features an ultra-fast copy-on-write checkpoint and restore system taking about 300ms, granular network egress policies using simple domain-level allowlists, and custom port forwarding for public or private service access. Sprites scale to zero and burst dynamically, meaning developers only pay for actual CPU, memory, and written storage usage.

Qwen3.6 keeps CoT context intact