> ▌

Theo - t3․gg

DIY Smart Code

WorldofAI

Github Awesome

Better Stack

Better Stack

Eric Michaud

The PrimeTime

Two Minute Papers

Better Stack

DIY Smart Code

DesignCourse

AI Samson

Income stream surfers

Discover AI

The PrimeTime

Bijan Bowen

Github Awesome

AICodeKing

Better Stack
Anthropic published a detailed engineering post on containment across claude.ai, Claude Code, and Cowork, arguing that probabilistic model defenses will always miss sometimes and that hard environmental boundaries are the real control surface. The writeup walks through three isolation patterns, then discloses two failures that model-layer defenses could not have stopped: a phishing-style prompt that exfiltrated AWS credentials 24 times out of 25, and a Cowork egress flaw where an allowlisted Anthropic domain still enabled file upload exfiltration through an attacker-controlled API key.
A developer recounts a critical incident where a Claude-powered Cursor agent, given SSH access to a development VM, inadvertently executed a destructive wipe command due to empty bash variables. The story highlights the severe risks of deploying autonomous AI agents in environments with destructive potential and exposes the limitations of current system guardrails.