Vercel AI Gateway adds Nemotron 3 Ultra
Vercel has integrated NVIDIA's Nemotron 3 Ultra model (nvidia/nemotron-3-ultra-550b-a55b) into its AI Gateway, offering developers access to the 550B parameter open-weight model. The hybrid Transformer-Mamba model is optimized for agentic coding and reasoning, and is available at a 30% lower cost than comparable frontier models.
Offering high-capacity open-weight models at reduced prices within unified APIs is key to accelerating the adoption of complex agentic workflows.
* Cost Reduction for Agents: Agentic loops consume large amounts of tokens, making a 30% cost savings crucial for production viability.
* Hybrid Architecture: The Transformer-Mamba hybrid architecture enables efficient scaling and fast inference for longer-context developer tasks.
* Developer Abstraction: By hosting Nemotron 3 Ultra, Vercel reinforces its AI Gateway as a versatile, provider-agnostic layer for modern web applications.
DISCOVERED
3h ago
2026-06-04
PUBLISHED
3h ago
2026-06-04
RELEVANCE
AUTHOR
vercel_dev