Nemotron 3 Super hits reasoning loop

// 117d agoMODEL RELEASE

Nemotron 3 Super hits reasoning loop

Nemotron 3 Super appears to get trapped in a self-referential reasoning loop when served through llama-server with Aider, with the model seemingly reading its own chain-of-thought back as user input. The poster says the same model behaves normally in OpenRouter/SillyTavern, which points more toward a serving-stack or template mismatch than a universal model bug.

// ANALYSIS

This smells like prompt plumbing gone wrong, not the base weights suddenly losing it. The fact that the same model is fine in some frontends but loops in others is the biggest clue.

–The poster says context was not overflowing, so this does not look like a simple window-limit failure.
–Other commenters reproduce similar behavior in different stacks, while one notes it works in LM Studio, which strongly suggests backend-specific formatting differences.
–Nemotron 3 Super is tuned for agentic reasoning, so if a runtime leaks reasoning traces back into the next turn, the model can self-amplify into a loop.
–A bad quant or GGUF could worsen the problem, but cross-backend variation makes chat-template and stop-token handling the first thing to debug.
–For local deployments, this is a reminder that reasoning models can be unusually sensitive to how “thinking” text is captured and reinserted.

// TAGS

nemotron-3-superllmreasoninginferenceagentaider

DISCOVERED

117d ago

2026-03-18

PUBLISHED

117d ago

2026-03-18

RELEVANCE

8/ 10

AUTHOR

Real_Ebb_7417

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

BENCHMARK2h ago

Gemini 3.5 Pro Tops Rivals in Leak

A leaked benchmark report claims that Google's rumored Gemini 3.5 Pro model achieves superior performance compared to rival models Claude Fable 5 and GPT-5.6 in internal evaluations. The leak suggests significant advancements in Google's next-generation frontier AI model, though official validation is still pending.

NEWS3h ago

Ivan Raskovsky, CTO and Co-founder of GenLayer Foundation, joins RallyOnChain to discuss the protocol's Internet Court initiative and the upcoming Clark Testnet roadmap.

GenLayer Foundation's CTO and Co-founder, Ivan Raskovsky, was featured on the RallyOnChain Community Space (Episode 27) hosted by stargirl_hills and 0X_CUPZ. The discussion centered on GenLayer's vision for an "Internet Court"—a decentralized system enabling AI agents to resolve subjective disputes using natural language processing and consensus. Raskovsky highlighted their progress, including an internal Epoch Zero test run and the roadmap for the upcoming Clark Testnet, which is targeted at autonomous network operations following their initial Asimov and Bradbury testnets.

UPDATE4h ago

Native SDK v0.5 compiles TypeScript to native

Vercel Labs has released Native SDK v0.5, introducing TypeScript support to compile applications directly to native machine code without a JavaScript engine or garbage collector. Designed with AI agents in mind, the update features 83ns update dispatch latency, supports robust TypeScript features, and allows developers to eject to Zig at any point.