Qwen3-Coder-Next leads SWE-rebench pass@5

// 125d agoBENCHMARK RESULT

Qwen3-Coder-Next leads SWE-rebench pass@5

Qwen3-Coder-Next is posting the strongest pass@5 among standalone models on the January 2026 SWE-rebench leaderboard, reaching 64.6% while landing at 40.0% resolved rate overall. That makes Alibaba’s open-source coding model one of the clearest signs yet that local and self-hosted coding stacks are closing the gap with frontier closed systems.

// ANALYSIS

The interesting part is not just that Qwen3-Coder-Next is good — it is that an instruct-style open model is now competitive in exactly the multi-step recovery-heavy workflows where coding agents usually break.

–SWE-rebench’s January 2026 leaderboard explicitly calls Qwen3-Coder-Next the best open-source model by pass@5, with the site highlighting its strong showing despite only ~3B active parameters
–Its 64.6% pass@5 beats every non-harness model on the board, even though Claude Code and Junie still rank higher as full agent systems rather than raw models
–The result matters for developers running private coding workflows locally, where open weights, controllable inference, and lower operational risk can matter more than absolute frontier polish
–The benchmark notes a practical catch: hosted providers often lack token or prefix caching support for Qwen3-Coder-Next, which can hurt real-world agent efficiency even when raw capability is excellent
–This also strengthens the case that Qwen’s coding line is iterating unusually fast, with Qwen3-Coder-Next materially outperforming earlier Qwen coding and general-purpose variants on agentic software tasks

// TAGS

qwen3-coder-nextllmai-codingbenchmarkopen-weights

DISCOVERED

125d ago

2026-03-10

PUBLISHED

129d ago

2026-03-07

RELEVANCE

9/ 10

AUTHOR

BitterProfessional7p

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

RESEARCH20m ago

MIT proposes open-ended AI representation framework

A theoretical paper from MIT proposes a framework for open-ended AI innovation, addressing the structural limitations of modern AI systems that operate within fixed representational frames. By characterizing the distance to open-ended intelligence through vocabulary and verifier gaps, the authors introduce a "ladder of innovation autonomy" to guide the creation of systems that can generate and validate their own representations.

UPDATE28m ago

Superconductor adds in-place visual feedback

Super.engineering has released a visual feedback feature for Superconductor, its native macOS workspace for agentic engineering. The update allows developers to click and annotate webpage elements in-place, sending structural context directly into the AI agent's chat to streamline UI/UX debugging.

NEWS51m ago

Anthropic commits $10M to Canadian AI research

Anthropic has committed $10 million CAD to fund responsible AI research across leading Canadian universities, healthcare organizations, and regional AI institutes. The initiative also extends the Anthropic for Startups program to Canada, offering local startups at least $5,000 USD in API credits.