DeepInfra hosts NVIDIA Nemotron 3.x
DeepInfra has introduced day-zero support for NVIDIA's newly released Nemotron 3.x models, hosting both Nemotron 3 Ultra and Nemotron 3.5 Content Safety. The open models are live on DeepInfra's zero-retention, enterprise-grade inference platform, offering up to 5x faster inference for agentic reasoning and robust multimodal safety filtering.
Serverless inference providers are competing fiercely on day-zero model availability; hosting NVIDIA's Nemotron 3.x allows DeepInfra to capture early developer interest for highly optimized, cost-effective agentic workflows.
- –Up to 5x faster inference and 30% lower cost on Nemotron 3 Ultra significantly lowers the barrier for running complex, long-running agent tasks.
- –Immediate hosting of Nemotron 3.5 Content Safety gives developers a robust, multimodal safety evaluation tool directly integrated with high-throughput API endpoints.
- –Day-zero access eliminates cold-start infrastructure challenges, enabling rapid prototyping of safety guardrails and frontier reasoning applications.
DISCOVERED
2h ago
2026-06-04
PUBLISHED
2h ago
2026-06-04
RELEVANCE
AUTHOR
DeepInfra