MiniMax M3 Tops GPT 5.5 on SWE-Bench Pro
The recently announced MiniMax M3 model has reportedly beaten GPT 5.5 on the SWE-Bench Pro coding benchmark, scoring 59.0% compared to GPT 5.5's 58.6%. Operating at $0.30 per million input and $1.20 per million output tokens, M3 offers immense price-to-performance potential for highly affordable agentic coding workflows.
MiniMax M3 proves that frontier-level coding intelligence is no longer locked behind premium pricing, though benchmark contamination concerns remain a crucial caveat.
* MiniMax M3 achieves a 59.0% resolve rate on SWE-Bench Pro, slightly edging out GPT 5.5's 58.6%.
* At $0.30/M input and $1.20/M output, M3 delivers a massive price-to-performance breakthrough for developers.
* Concerns persist regarding SWE-Bench Pro's susceptibility to contamination, meaning real-world testing is required to validate these claims.
* The model's low latency and cheap pricing make it a prime candidate for developer agents and automated software engineering workflows.
DISCOVERED
1h ago
2026-06-01
PUBLISHED
1h ago
2026-06-01
RELEVANCE
AUTHOR
bridgemindai