Anthropic drops Claude Opus 4.8
Anthropic's Claude Opus 4.8 sets a new frontier with 69.2% on SWE-Bench Pro and 83.4% on agentic computer use. The generational upgrade reportedly destroys GPT-5.5 across almost every benchmark.
Opus 4.8 establishes a terrifying new baseline for autonomous engineering and computer use capabilities.
- –69.2% on SWE-Bench Pro suggests it can resolve the vast majority of real-world software issues without human intervention
- –83.4% on agentic computer use indicates a massive leap in its ability to directly drive desktop applications
- –Beating GPT-5.5 across the board solidifies Anthropic's lead in the frontier model race
- –Scoring 57.9% on Humanity's Last Exam with tools highlights advanced reasoning on complex edge cases
DISCOVERED
12d ago
2026-05-28
PUBLISHED
12d ago
2026-05-28
RELEVANCE
AUTHOR
bridgemindai