Gemma 4 coding: Harness choice swings performance

// 45d agoBENCHMARK RESULT

Gemma 4 coding: Harness choice swings performance

Google's Gemma 4 31B and 26B MoE models emerge as open-weight coding powerhouses, with community benchmarks revealing significant performance variance across different agentic frameworks.

// ANALYSIS

Gemma 4 marks the end of "vibe coding" benchmarks as the agentic harness becomes just as important as the model weights.

–The 31B Dense model hits 80% on LiveCodeBench v6, rivaling proprietary frontier models in a local-first package.
–Frameworks like Kilo Code and Roo Code extract more performance through highly structured system prompts and autonomous tool execution.
–Variation in scores across harnesses (Claude Code vs. Kilo Code) suggests that "raw" model evals are increasingly decoupled from real-world agentic utility.
–26B MoE variant is the sweet spot for developers, offering 97% of Dense performance at a fraction of the inference cost.

// TAGS

gemma-4ai-codingllmopen-weightsbenchmarkgoogleagent

DISCOVERED

45d ago

2026-04-18

PUBLISHED

45d ago

2026-04-17

RELEVANCE

10/ 10

AUTHOR

jazir55

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

TUTORIAL29m ago

Build consistent storyboards with Seedance 2.0

A prompt sharing post details a storyboard creation workflow featuring a character named Ezreal, combining the capabilities of GPT Image 2 and Seedance 2.0. The author highlights common challenges with panel-to-panel continuity and rendering errors in GPT Image 2, offering their prompts and techniques to help other creators generate clean and visually cohesive storyboard sequences.

NEWS1h ago

Intel, Perplexity showcase hybrid local search

During a demonstration with Intel, Perplexity AI showcased a hybrid local search architecture that processes sensitive data on-device using local neural processing hardware like Intel Core Ultra. By offloading large-scale contextual queries to the cloud while keeping raw personal information local, this approach balances edge-computing privacy with cloud-scale reasoning.

INFRA1h ago

Browser Use launches custom browser infrastructure

Browser Use is transitioning to a custom, in-house developed browser infrastructure designed to address the speed and detection limitations of standard third-party solutions. This new in-house infrastructure is optimized specifically for AI agent interactions, offering sub-second cold starts, lower costs, and advanced stealth features to easily bypass sophisticated anti-bot detectors.