MiroThinker-1.7 tops BrowseComp with verification agents

// 133d agoMODEL RELEASE

MiroThinker-1.7 tops BrowseComp with verification agents

MiroMind AI releases MiroThinker-1.7 (235B open-source) and MiroThinker-H1 (proprietary), a new generation of deep research agents built around verification-centric architecture. H1 hits 88.2% on OpenAI's BrowseComp benchmark — leading all known models — while the 30B mini variant sets SOTA among open-source models on BrowseComp-ZH at 72.3%.

// ANALYSIS

MiroThinker is making the case that scaling *interaction depth* — not just parameters or context — is the missing axis for agents that do real research, not just plausible retrieval.

–The verification-centric architecture is the real differentiator: local verification breaks probability bias at each reasoning step, global verification audits the full evidence chain end-to-end — and paradoxically, verified runs use *fewer* steps than unverified ones by filtering no-info-gain actions
–BrowseComp 88.2% (H1) is a meaningful signal — this benchmark tests genuine web research ability, not memorization, making it harder to game than static evals
–The 30B mini outperforming Kimi-K2-Thinking (1T parameters) on BrowseComp-ZH at roughly 1/20th the inference cost ($0.07 vs $1.40 per call) is a striking efficiency claim worth watching
–Full open-source release includes weights, training code, and the MiroVerse 147K-sample dataset — unusual transparency for a frontier research agent
–6,700+ GitHub stars suggests strong developer traction despite flying under mainstream radar

// TAGS

mirothinkeragentllmopen-sourceopen-weightsbenchmarkreasoning

DISCOVERED

133d ago

2026-03-14

PUBLISHED

135d ago

2026-03-12

RELEVANCE

8/ 10

AUTHOR

wuqiao

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

SECURITY2h ago

Kimi K3 demonstrates autonomous corporate network intrusion

A joint evaluation by the UK and US AI Security Institutes revealed that Moonshot AI's Kimi K3 model possesses significant offensive cyber capabilities. During testing, Kimi K3 successfully achieved multi-step corporate network intrusions in an entirely autonomous manner.

NEWS3h ago

GM, Peak Energy partner on sodium-ion grid storage

General Motors has backed sodium-ion startup Peak Energy to co-develop passively cooled battery storage systems purpose-built for grid applications and AI data centers. The technology leverages abundant raw materials to target 20% lower lifetime costs and a 20-year operating life, with prototyping scheduled for 2026.

NEWS4h ago

Florida Resident Protests Flock Safety License Plate Cameras

Carl Gunn, a 77-year-old resident of St. Petersburg, Florida, has mounted a public protest against localized mass surveillance by targeting Flock Safety license plate reader cameras in his neighborhood. Alarmed by AI-powered vehicle tracking near his home, Gunn set up a lawn chair and used makeshift tools to block the camera lens, drawing attention to civil liberty concerns.