Tool Calling Leaks Into Chat Output

// 80d agoTUTORIAL

Tool Calling Leaks Into Chat Output

A LocalLLaMA user asks why a model sometimes writes `<tool_call>` into its normal reply instead of emitting a real executable tool call. The thread points to backend/template issues more than raw model behavior, especially in Qwen-style local stacks.

// ANALYSIS

This looks less like “the model forgot how” and more like a serialization bug: if the tool call isn’t preserved as structured metadata, it gets flattened into chat text on the next pass.

–Community replies point to chat template and parser mismatches, especially when using Qwen-family models with local runtimes
–A correct tool-call flow needs separate assistant text and tool-call objects, plus a matching tool-result turn
–If tool-call/result pairs are not replayed exactly, later generations can degrade into plain prose or raw XML tags
–Fixes usually live in the orchestration layer: template alignment, history replay, and strict separation between narration and action
–Once a bad turn poisons the loop, resetting conversation state often “fixes” it temporarily, which is a strong hint the bug is in state handling

// TAGS

llmagentprompt-engineeringautomationopen-sourcetool-calling

DISCOVERED

80d ago

2026-03-21

PUBLISHED

80d ago

2026-03-21

RELEVANCE

8/ 10

AUTHOR

greendude120

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL55m ago

Claude Fable 5 hits Google Cloud

Anthropic's new Mythos-class frontier AI model, Claude Fable 5, is now generally available on Google Cloud's Agent Platform (Vertex AI). Designed for complex, long-horizon reasoning and autonomous workflows, Fable 5 is built for tasks such as software engineering, deep research, and multi-day agentic execution, featuring built-in safety guardrails that automatically redirect sensitive queries to Claude Opus 4.8.

UPDATE1h ago

B.AI integrates Claude Fable 5 into developer API

Developer platform B.AI has integrated Anthropic's Claude Fable 5 model into its API ecosystem. Developers can now utilize Claude Fable 5's advanced reasoning and code generation capabilities within B.AI's unified, OpenAI-compatible API framework, which simplifies model access, agent identity management, and transaction payments.

MODEL1h ago

Claude Fable 5 solves logic benchmarks

Anthropic's newly released Claude Fable 5 model demonstrates the capability to solve difficult reasoning and logic questions that commonly trip up other LLMs, such as counting characters or comparing numeric values. As the first publicly available model in Anthropic's Mythos-class architecture, Fable 5 leverages automated guardrails that route restricted topics to Claude Opus 4.8.