Qwen 2.5 sparks context bleed debate
A Reddit post in r/LocalLLaMA shows Qwen 2.5:7B, served through Ollama, answering a zero-context eval as if it were already mid-conversation. The thread treats it less as true retained memory and more as a revealing hallucination triggered by a prompt that assumes prior dialogue.
This looks more like prompt-induced confabulation than spooky hidden memory, but it is still a useful warning for anyone building context-sensitive agents.
- –The prompt says “Based upon the conversation so far,” which nudges the model to invent a prior exchange instead of first checking whether any context exists.
- –The model’s reference to “me being Qwen from Alibaba Cloud” reads like a learned default persona leaking into an underspecified eval, not evidence of persistent session state.
- –For local LLM developers, the real lesson is eval hygiene: explicitly declare when no prior context exists, and test router prompts that do not presuppose a chat history.
DISCOVERED
91d ago
2026-03-11
PUBLISHED
91d ago
2026-03-10
RELEVANCE
AUTHOR
KindnessBiasedBoar