NVIDIA drops Nemotron 3.5 Content Safety
NVIDIA released Nemotron 3.5 Content Safety, a 4-billion parameter guardrail model fine-tuned from Google's Gemma-3-4B to moderate LLM inputs and outputs. It classifies text and images across 23 safety categories in 12 languages and features a chain-of-thought "THINK Mode" to explain safety decisions.
AI safety is evolving from a static blocklist into a programmable policy runtime.
- –Built-in reasoning (THINK Mode) addresses the black-box problem of LLM moderation, making decisions auditable and easier to debug.
- –The 4B parameter size allows the model to run efficiently as a sidecar alongside main application LLMs without causing unacceptable latency.
- –Relying on the Aegis v2 taxonomy provides a standardized framework that fits enterprise safety compliance workflows.
DISCOVERED
2h ago
2026-06-06
PUBLISHED
2h ago
2026-06-06
RELEVANCE
AUTHOR
dball1126