Gemma 4 Heretic Q2_K Spits Gibberish

// 45d agoOPENSOURCE RELEASE

Gemma 4 Heretic Q2_K Spits Gibberish

This Reddit post flags the Q2_K GGUF build of the Gemma 4 26B A4B Heretic model as producing gibberish, and suggests the issue may extend to other quants in the repo. The underlying Hugging Face card shows this is a community GGUF release based on `coder3101/gemma-4-26B-A4B-it-heretic`, itself built on Google’s Gemma 4 26B A4B IT model, with Q2_K listed as the smallest option and higher-bit quants like Q4_K_M and Q6_K positioned as better-quality choices.

// ANALYSIS

Hot take: this reads more like “2-bit MoE compression hit the floor” than a fundamentally broken repo. The model is probably not the problem so much as the quantization level, unless there’s also a tokenizer or chat-template mismatch in the runner.

–The repo is a community GGUF packaging of a fine-tuned Gemma 4 26B A4B model, not the official upstream release.
–The Hugging Face card itself implies the lower-bit end is risky: it labels some quants as lower quality and points users toward Q4_K_M or Q6_K for better results.
–The Reddit report is specifically about `Q2_K`, which is the most plausible failure point for gibberish on a model this large and MoE-shaped.
–Inference: if other quants are also broken, the more likely causes are prompt/template wiring or a bad conversion path, not the entire model family.

// TAGS

gemmagemma-4ggufquantizationlocal-llmllama-cppmoeheretic

DISCOVERED

45d ago

2026-04-20

PUBLISHED

45d ago

2026-04-19

RELEVANCE

8/ 10

AUTHOR

Academic-Map268

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE1h ago

Cloudflare AI Gateway receives a major dashboard and visual redesign to improve developer experience.

Cloudflare has released a significant design and dashboard refresh for its AI Gateway product to streamline developer workflows. The update relocates AI features to a dedicated top-level section in the dashboard sidebar, simplifies the onboarding process for new gateway configurations, and consolidates fragmented code snippets into a unified view customizable by provider, SDK, and API type. Additionally, the release introduces more precise cost analytics charts for small monetary values, updates the performance of the dynamic route builder, and enhances keyboard navigation accessibility.

SECURITY2h ago

Miasma campaign targets npm ecosystem, compromising AI packages

The Miasma supply chain campaign, which previously compromised 32 Red Hat packages, is now targeting the npm ecosystem in a new wave of attacks. This campaign specifically targets high-traffic AI packages, including vapi-ai/server-sdk with 71,000 weekly downloads and ai-sdk-ollama with 31,000 weekly downloads.

UPDATE2h ago

Lightpanda CLI adds networkalmostidle support

Lightpanda, a lightweight headless browser built in Zig for AI agents and web automation, has updated its CLI to support `networkalmostidle` as a `--wait-until` condition. This integration allows automated tasks to proceed as soon as network activity subsides, ensuring that pages are effectively loaded without waiting for non-essential network connections to close, resulting in faster and more reliable agent interactions.