Gemma 4 loops in LM Studio

// 99d agoMODEL RELEASE

Gemma 4 loops in LM Studio

A Reddit user reports Gemma 4-26B-A4B collapsing into recursive junk output in LM Studio on dual MI50s with Vulkan, Q4_K_M, and Q8_0 KV cache. The repeated `</think>` and `<|im_end|>` tokens suggest a template or backend mismatch more than a simple “bad model” complaint.

// ANALYSIS

This looks like an integration bug disguised as a model failure. Gemma 4 is meant to run locally, but if the runtime is feeding it the wrong chat format or stop tokens, the model can spiral into exactly this kind of self-referential loop.

–The output tokens shown here are from non-Gemma chat schemas, which points to a prompt/template mismatch or incorrect stop-sequence handling.
–Vulkan plus quantized KV cache plus a MoE model is a brittle stack; any backend edge case can turn into repeated garbage generation.
–Google positions Gemma 4 as a local-first, agentic open model family, so a failure like this is a support-gap issue that matters for real-world adoption.
–The first things to try are disabling KV-cache quantization, verifying the Gemma 4 chat template, and testing a different backend or build.

// TAGS

gemma-4llminferencegpuopen-weightsreasoningmultimodal

DISCOVERED

99d ago

2026-04-04

PUBLISHED

99d ago

2026-04-04

RELEVANCE

9/ 10

AUTHOR

Savantskie1

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS35m ago

OpenAI, xAI, Meta drop major models

The AI model landscape saw unprecedented rapid shifts over a 96-hour period. OpenAI released the GPT-5.6 family to general availability, xAI took Grok 4.5 public following the SpaceX merger, and Meta introduced a new paid Model API, marking significant paradigm shifts across major AI players.

INFRA46m ago

Ritual builds infrastructure for autonomous AI agents

Ritual is an AI lab and infrastructure project that aims to move beyond simply making AI models smarter by focusing on granting them autonomous agency. The project is developing the underlying stack—including cryptography, consensus, and privacy mechanisms—required for AI agents to operate persistently, hold and spend their own money, and execute tasks without needing manual human approval for every action.

OPEN SOURCE1h ago

Agent Skills guides agent UI design

Agent Skills is an open-source library and prompting system designed to help front-end coding agents like Cursor and Claude Code build premium user interfaces. The project provides reusable design guardrails and procedural workflows for advanced styling, GSAP animations, and WebGL.