Cursor makes Auto-review default safety layer
Cursor has launched Auto-review, a contextual safety feature that is now the default for new users to govern local agent autonomy. The feature uses a specialized classifier subagent to evaluate terminal commands and tools in context, blocking risky actions and providing feedback to help agents self-correct.
By reframing agent safety from a binary block/allow switch to a contextual 'dial,' Cursor addresses the developer fatigue of constant approval prompts while keeping local agents securely bounded. Agentic safety classifiers that can inspect files and folders in the background represent the next phase of agent security, moving away from rigid, regex-based command allowlists. Furthermore, the feedback loop—where the classifier explains blocks back to the parent agent rather than immediately halting execution—allows agents to auto-correct and try safer alternative tools, significantly improving user experience. While a 97% accuracy rate is strong, the remaining 3% error rate on ambiguous edges highlights the difficulty of defining precise safety boundaries in complex development environments. Finally, running the classifier within the same RPC stream to minimize latency is a smart architectural choice to avoid slowing down agent execution loops.
DISCOVERED
2h ago
2026-06-11
PUBLISHED
2h ago
2026-06-11
RELEVANCE
AUTHOR
cursor_ai