Kimi K2.7-Code ranks second on ErdosBench
Moonshot AI's Kimi K2.7-Code achieved second place on ErdosBench, demonstrating high precision with 13/14 coverage and zero major false or unsafe partials. The model matched the top-performing Claude Fable 5 max on all solved results, highlighting the growing reasoning capabilities of Chinese AI laboratories.
The competitive performance of Kimi K2.7-Code shows that Chinese AI labs are closing the reasoning gap with top-tier US frontier models.
- –Placing right behind Claude Fable 5 max and ahead of other major models demonstrates significant progress in agentic reasoning.
- –Achieving 13/14 coverage with zero false or unsafe partials indicates high accuracy, making the model dependable for complex tasks.
- –This result highlights Moonshot AI's focus on reasoning token efficiency, proving that reduced token overhead can coexist with frontier-level performance.
DISCOVERED
1h ago
2026-06-14
PUBLISHED
2h ago
2026-06-14
RELEVANCE
AUTHOR
mark_k