Gemini 3.5 Flash tops 3.1 Pro on GDPval
Google releases Gemini 3.5 Flash, a high-speed model that leapfrogs Gemini 3.1 Pro's agentic performance while maintaining 280+ tps. The release signals a shift in focus toward the "Intelligence-vs-Speed" Pareto frontier for production agents.
Gemini 3.5 Flash establishes itself as the new standard for efficient agentic workflows, demonstrating that mid-tier models can handle complex reasoning tasks previously reserved for flagships. It achieved 1656 Elo on GDPval-AA, significantly outperforming Gemini 3.1 Pro while maintaining a throughput exceeding 280 tokens per second. High scores on Terminal-Bench 2.1 and the MCP Atlas benchmark, combined with a new encrypted reasoning context for its Thinking Mode, make it a formidable choice for production-grade IDE and CLI agents.
DISCOVERED
2h ago
2026-05-22
PUBLISHED
2h ago
2026-05-22
RELEVANCE
AUTHOR
demishassabis