OpenCode benchmark favors Qwen, Gemma

// 65d agoBENCHMARK RESULT

OpenCode benchmark favors Qwen, Gemma

This post compares self-hosted LLMs inside OpenCode across an easy CLI build task and a harder site-migration mapping task. On the author’s RTX 4080 setup, Qwen 3.5 27B and Gemma 4 26B came out strongest overall, with several other models trailing on either quality, speed, or both.

// ANALYSIS

Useful, practical signal for anyone trying to run an agentic coding stack locally: the best model on paper is not always the best model in OpenCode, where tool use, consistency, and latency all matter.

–Qwen 3.5 looks like the safest all-around pick for 16GB VRAM hardware, especially when you want decent quality without brutal slowdown
–Gemma 4 26B is the surprise contender here; it appears competitive enough that it deserves a longer local-coding trial
–GLM-4.7 Flash and Nemotron 3 seem to struggle more on the harder, structured task, which is usually where agent workflows expose weak reasoning or instruction-following
–The 25k-50k context range is a reminder that real agent use is not a toy benchmark; model behavior can change a lot once prompts and repo context get large
–The speed table matters as much as the task results, because local coding agents become frustrating fast once throughput drops below interactive

// TAGS

opencodebenchmarkai-codingllmself-hostedclitesting

DISCOVERED

65d ago

2026-04-06

PUBLISHED

65d ago

2026-04-06

RELEVANCE

8/ 10

AUTHOR

rosaccord

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS31m ago

Claude Code Fable 5 triggers billing warnings

Developer Daniel Avila flagged a potential issue in Anthropic's Claude Code CLI when selecting the newly released Claude Fable 5 model, noting that he received billing warnings despite Anthropic's promotion offering free access to the model until June 23, 2026. The issue likely stems from a conflict in how the CLI manages authentication, as the free promotional period is restricted to subscription plan logins (Pro, Max, Team, Enterprise) and does not apply if the tool detects a direct ANTHROPIC_API_KEY environment variable, which bills the user immediately.

TUTORIAL32m ago

Claude Fable tutorial builds MotionSites animated websites

A new twelve-minute tutorial by Viktor Oddy demonstrates how to build animated, award-winning websites using Claude Fable 5. The workflow leverages a library of pre-designed motion prompts from MotionSites to generate frontend components without manual coding.

MODEL34m ago

Claude Fable 5 one-shots playable horror game

BridgeMind highlighted the capabilities of Anthropic's newly released Claude Fable 5 model, sharing a demonstration where it generated a complete playable horror game from a single prompt. The model marks a significant leap in coding benchmarks, scoring 80.3% on SWE-Bench Pro compared to 69.2% for Claude Opus 4.8, reflecting its advanced agentic architecture and autonomous planning abilities.

OpenCode benchmark favors Qwen, Gemma