Claude Sonnet 4.6 sharpens computer use
Anthropic’s Sonnet 4.6 is a broad upgrade across coding, computer use, long-context reasoning, agent planning, knowledge work, and design, with a 1M-token context window in beta. It’s now the default on Claude’s free and Pro plans and available across Claude, Claude Code, the API, and major clouds, making the desktop-app demo feel more like product direction than a stunt.
Hot take: this is the first Sonnet release that feels like a plausible desktop worker instead of a smarter chat model. The real story is reliability, not just raw capability. Anthropic says Sonnet 4.6 is a major step up on OSWorld-style computer use and hit 94% on a complex insurance computer-use benchmark, so the gains look real, not theatrical. The 1M-token context window matters most for long-horizon planning, where agents usually lose the thread before the work is done. Prompt-injection resistance is the gating factor for real-world app driving, and Anthropic is clearly treating safety as part of the product, not an afterthought. Keeping Sonnet 4.5 pricing while raising the ceiling makes this the model most teams will test first for browser and desktop automation.
DISCOVERED
18d ago
2026-03-24
PUBLISHED
18d ago
2026-03-24
RELEVANCE
AUTHOR
Bijan Bowen