NVIDIA drops Nemotron 3.5 Content Safety

// 45d agoMODEL RELEASE

NVIDIA drops Nemotron 3.5 Content Safety

NVIDIA released Nemotron 3.5 Content Safety, a 4-billion parameter guardrail model fine-tuned from Google's Gemma-3-4B to moderate LLM inputs and outputs. It classifies text and images across 23 safety categories in 12 languages and features a chain-of-thought "THINK Mode" to explain safety decisions.

// ANALYSIS

AI safety is evolving from a static blocklist into a programmable policy runtime.

–Built-in reasoning (THINK Mode) addresses the black-box problem of LLM moderation, making decisions auditable and easier to debug.
–The 4B parameter size allows the model to run efficiently as a sidecar alongside main application LLMs without causing unacceptable latency.
–Relying on the Aegis v2 taxonomy provides a standardized framework that fits enterprise safety compliance workflows.

// TAGS

nvidianemotronsafetycontent-safetyguardrailsgemma-3model-release

DISCOVERED

45d ago

2026-06-06

PUBLISHED

45d ago

2026-06-06

RELEVANCE

8/ 10

AUTHOR

dball1126

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

INFRA1h ago

NVIDIA Details Vera Rubin Agentic AI Architecture

NVIDIA unveiled its Vera Rubin architecture, marking a transition toward purpose-built systems for complex agentic AI reasoning rather than a conventional accelerator refresh. The full-stack platform integrates custom Vera CPUs, Rubin GPUs equipped with 288GB of HBM4 memory, and advanced NVLink 6 networking infrastructure to address key memory and communication bottlenecks in multi-step AI workflows.

INFRA1h ago

Meta builds Switchboard AI router to cut costs

Meta is building an internal AI model routing system named Switchboard to curb escalating inference costs across its AI services. Developed within Meta's AAI Labs incubator, it evaluates prompt complexity to route routine tasks to smaller, lower-cost models while preserving frontier models for complex requests.

UPDATE3h ago

Perplexity Computer post-trained orchestrator becomes second most used

Perplexity CEO Aravind Srinivas shared an update regarding model adoption within Perplexity Computer, revealing that a newly integrated post-trained orchestrator model has risen to become the second most utilized central orchestrator on the platform, trailing only Claude Opus 4.8. Srinivas added that once Perplexity secures additional compute capacity, the company plans to increase usage limits through credits and release improved iterations of the post-trained orchestrator.