Anthropic drops Claude Opus 4.8

// 57d agoMODEL RELEASE

Anthropic drops Claude Opus 4.8

Anthropic's Claude Opus 4.8 sets a new frontier with 69.2% on SWE-Bench Pro and 83.4% on agentic computer use. The generational upgrade reportedly destroys GPT-5.5 across almost every benchmark.

// ANALYSIS

Opus 4.8 establishes a terrifying new baseline for autonomous engineering and computer use capabilities.

–69.2% on SWE-Bench Pro suggests it can resolve the vast majority of real-world software issues without human intervention
–83.4% on agentic computer use indicates a massive leap in its ability to directly drive desktop applications
–Beating GPT-5.5 across the board solidifies Anthropic's lead in the frontier model race
–Scoring 57.9% on Humanity's Last Exam with tools highlights advanced reasoning on complex edge cases

// TAGS

claude-opus-4-8llmreasoningai-codingagentcomputer-usetool-usebenchmark

DISCOVERED

57d ago

2026-05-28

PUBLISHED

57d ago

2026-05-28

RELEVANCE

10/ 10

AUTHOR

bridgemindai

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

VIDEO2h ago

Granola CEO demonstrates OpenAI Codex browser automation

In a video demonstration presented by Every, Granola's CEO showcases OpenAI Codex functioning as an autonomous agent executing complex, multi-step browser workflows. Drawing upon saved user context, Codex navigates web applications and customer support chats to negotiate an internet plan migration and eliminate extra fees.

LAUNCH3h ago

Moonshot AI introduces Kimi K3 Agent Swarm

Moonshot AI has introduced Agent Swarm mode for Kimi K3, a horizontal scaling architecture capable of coordinating up to 300 parallel sub-agents to tackle complex software engineering tasks. By dividing web development across autonomous agent teams working concurrently, the system can generate multi-page websites and frontend applications significantly faster than traditional single-agent approaches.

OPEN SOURCE4h ago

Jakub Antalik releases thinking-orbs for AI UI states

thinking-orbs is an open-source animation library designed by Jakub Antalik to replace static spinners with state-aware visual loading indicators for AI agents. Built for React and Tailwind CSS, the SSR-safe library provides six hand-tuned canvas states with automatic theme switching and preset sizing.