NVIDIA Nemotron-Cascade 2 debuts open 30B MoE

// 68d agoMODEL RELEASE

NVIDIA Nemotron-Cascade 2 debuts open 30B MoE

NVIDIA's Nemotron-Cascade-2-30B-A3B is an open 30B MoE model with 3B activated parameters, tuned for reasoning and agentic tasks. It ships with both thinking and instruct modes, plus checkpoints and a paper on Hugging Face.

// ANALYSIS

Hot take: this looks like NVIDIA pushing sparse MoE as the practical path to "big-model" capability without big-model cost. The real question is whether it delivers outside benchmark slides, but the packaging is strong enough that developers will actually try it.

–The model card says it is an open 30B MoE with 3B active parameters, released under NVIDIA's Open Model License on March 19, 2026.
–NVIDIA's claims lean hard on math, code, and agentic evals, including IMO, IOI, ICPC, LiveCodeBench, and SWE Verified performance.
–Early Reddit reaction looks split between "interesting and usable" and "sparse MoE counts can be misleading," which is the right kind of skepticism for this release.
–One commenter says it runs on 6GB VRAM plus system RAM and is useful for background docs and small coding tasks, which suggests real local-deployment appeal.
–The open chat template and tool-use prompting make it easier to slot into agent workflows than a raw benchmark drop.

// TAGS

llmreasoningagentopen-weightsopen-sourcenemotron-cascade-2-30b-a3b

DISCOVERED

68d ago

2026-03-21

PUBLISHED

68d ago

2026-03-21

RELEVANCE

9/ 10

AUTHOR

Odd-Ordinary-5922

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL2h ago

Anthropic drops Opus 4.8 for Claude Code

Anthropic has released Opus 4.8, integrating the new model into Claude Code with high-effort defaults for complex coding tasks. The update boosts SWE-bench Pro scores to 69.2% and drastically reduces unremarked flaws in generated code.

VIDEO2h ago

Google AI animates cardboard TPUs for I/O 2026

Google AI partners with director Laurie Rowan and Nexus Studios to create a promotional short film for Google I/O 2026. The project leverages AI models to animate physical materials like cardboard and markers into characters representing Tensor Processing Units.

MODEL2h ago

Claude Opus 4.8 drops with extended agentic autonomy

Anthropic has released Claude Opus 4.8, bringing improvements to agentic skills, reasoning, and coding capabilities at the exact same price. The update introduces sharper judgment, increased honesty about its task progress, and the ability to operate autonomously for much longer periods.