Open WebUI Breaks Qwen 3.6 Thinking

// 1h agoNEWS

Open WebUI Breaks Qwen 3.6 Thinking

Open WebUI users report Qwen 3.6 with llama.cpp losing its `preserve_thinking` behavior, even though the same model works in llama.cpp's own web UI. Open WebUI docs say it only preserves reasoning that the model actually returns, and a current GitHub discussion points to `reasoning_content` being mishandled on reinjection.

// ANALYSIS

This looks more like a client-side compatibility bug than a model issue: the backend can emit reasoning, but Open WebUI may be serializing or replaying it in the wrong shape. Open WebUI’s docs say it can only preserve reasoning that the model actually returns, and the GitHub discussion suggests `reasoning_content` is being stripped or moved into the wrong field on the next turn. That would break agentic workflows, which makes the llama.cpp native UI a better reference implementation for now because it passes the chat-template kwargs through more directly. If you need the feature today, the likely fix is a pipe/filter or a targeted Open WebUI issue or PR rather than a hidden toggle.

// TAGS

open-webuillama-cppreasoningllmapidebuggingself-hosted

DISCOVERED

1h ago

2026-05-11

PUBLISHED

2h ago

2026-05-11

RELEVANCE

7/ 10

AUTHOR

sterby92

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE5m ago

ElevenLabs Adds Studio Agent to ElevenCreative

Studio Agent is a conversational AI co-editor built into the ElevenCreative Studio timeline. It can take a prompt, ask clarifying questions, and draft a first cut by placing clips, generating voiceovers, finding voices, syncing sound effects, and building a video rough cut while still letting the user take manual control at any point. This is an extension of ElevenLabs’ broader ElevenCreative platform rather than a separate standalone app.

TUTORIAL6m ago

Mintlify launches free docs course

Mintlify Learn is a free course from Mintlify focused on improving documentation workflows, structure, and maintenance for modern product teams. The course includes lessons on scaling docs architecture, creating agent-friendly docs, working with Git and GitHub in a docs-as-code workflow, and using Mintlify components effectively. It is aimed at helping teams ship clearer documentation that works well for both human readers and AI tools.

LAUNCH54m ago

OmniSocials brings social scheduling to Claude

OmniSocials is an AI-friendly social media management platform that exposes posting, scheduling, account management, and analytics through MCP and agent skills so assistants like Claude can draft, schedule, and analyze social content across multiple platforms. The post highlights a workflow where someone installed the MCP, scheduled posts on four platforms, and pulled top LinkedIn posts from inside Claude.