llama.cpp adds Gemma 4 support

// 60d agoPRODUCT UPDATE

llama.cpp adds Gemma 4 support

Google's Gemma 4 family lands in llama.cpp with critical parsing fixes for its novel, reasoning-focused prompt architecture. This update ensures seamless local execution for the 26B and 31B variants, including their new multi-channel token support.

// ANALYSIS

Gemma 4's integration marks a pivot toward complex reasoning architectures in the open-weights ecosystem, moving beyond simple chat completion.

–New specialized tokens like <|channel|> and <|turn|> suggest a shift toward native multi-agent and multi-modal handling
–Native support for "reasoning traces" brings Google's open models into direct competition with specialized reasoning powerhouses like DeepSeek-R1
–26B A4B variant architecture hints at a hybrid attention mechanism optimized for long-context reasoning tasks
–Rapid day-zero support from llama.cpp reinforces its status as the industry-standard gateway for local AI deployment
–vLLM parity ensures developers can transition from cloud-scale inference to local dev environments without prompt re-engineering

// TAGS

llama-cppgemma-4llmopen-sourcereasoningopen-weights

DISCOVERED

60d ago

2026-04-14

PUBLISHED

60d ago

2026-04-13

RELEVANCE

9/ 10

AUTHOR

jacek2023

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

POLICY27m ago

Cognition drops Claude Fable 5 from Devin

Cognition has announced the removal of access to Anthropic's Claude Fable 5 model across its products, including Devin, following a US government directive to suspend access. Other frontier models like Claude Opus 4.8 and GPT-5.5 remain supported, and Devin Ultra mode will continue operating with alternative models.

POLICY34m ago

Anthropic suspends Claude Fable 5, Mythos 5

Anthropic has suspended global access to Claude Fable 5 and Claude Mythos 5 following a U.S. government export control directive restricting access by foreign nationals. Although complying with the order, Anthropic expressed disagreement, stating the cited security vulnerability was narrow and easily replicated with other public models.

POLICY44m ago

US blocks foreign access to Claude models

The U.S. Commerce Department has ordered Anthropic to suspend foreign nationals' access to its newly launched Claude Fable 5 and Mythos 5 AI models due to national security concerns. Anthropic complied by temporarily disabling the models for all users, though the company disputed the severity of the alleged jailbreak exploit that triggered the government's decision.