Claude Code soft_deny policy hits human review gap

// 45d agoNEWS

Claude Code soft_deny policy hits human review gap

Hedgineer's enterprise rollout of Claude Code reveals that natural language 'soft_deny' rules, while doubling automated rejections, fail to catch many risky bash commands that developers still manually veto. The findings highlight a persistent gap between AI intent classification and human risk assessment in autonomous coding environments.

// ANALYSIS

Automation is only as good as its telemetry, and currently, Claude's 'soft_deny' is a blunt instrument that misses subtle context.

–Soft_deny rules are bypassable by explicit user intent, making them "negotiable" guardrails rather than hard blocks
–Classifier-driven rejections jumped 123% post-policy, yet Bash remains the top tool rejected by humans in the loop
–Current OTEL spans don't distinguish between hard, soft, and permission denials, making it impossible to surgically tune rules
–The "trap" of omitting "$defaults" in config can inadvertently allow dangerous operations like force pushes
–Enterprise safety relies on identifying "bad vibes" in telemetry and encoding them back into natural language policy

// TAGS

claude-codeai-codingagentsafetyobservabilitymcpdevtool

DISCOVERED

45d ago

2026-05-30

PUBLISHED

45d ago

2026-05-30

RELEVANCE

8/ 10

AUTHOR

dani_avila7

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE3h ago

scroll-world launches scroll-driven 3D flight skill

scroll-world is an open-source, framework-agnostic agent skill that leverages Higgsfield to generate immersive, scroll-driven 3D camera flights through diorama scenes for landing pages. By rendering seamless connection clips between neighboring frames, it allows developers to build interactive 3D narrative websites navigated simply by scrolling, without requiring heavy game engines.

MODEL4h ago

OpenAI GPT-5.6 hits Amazon Bedrock

OpenAI's GPT-5.6 model family—including Sol, Terra, and Luna—is now generally available on Amazon Bedrock. Running on Bedrock's next-generation inference engine, the models support prompt caching with a 90% discount and match OpenAI's first-party pricing.

UPDATE5h ago

OpenRouter splits rankings by model weight

OpenRouter has updated its rankings platform by introducing separate leaderboards for open-weight and closed-weight models. This allows developers to track and compare usage statistics of proprietary, API-exclusive models against downloadable open-weight models.