NVIDIA Cosmos 3 hits DeepInfra serverless

// 45d agoINFRASTRUCTURE

NVIDIA Cosmos 3 hits DeepInfra serverless

DeepInfra has added serverless inference support for NVIDIA's Cosmos 3 physical AI foundation model family, starting with the 16B Cosmos 3 Nano model. The Mixture-of-Transformers architecture enables sub-second reasoning and generation for physical AI applications like robotics and autonomous vehicles without local GPU requirements.

// ANALYSIS

Hosting Cosmos 3 on DeepInfra democratizes access to low-latency physical AI reasoning, but the true test will be whether the serverless model's latency can meet the strict real-time requirements of real-world edge robotics.

* Cosmos 3 Nano's 16B parameter size is optimized for sub-second inference, making it suitable for latency-sensitive applications.

* The Mixture-of-Transformers architecture represents a shift towards models that reason about physics and action before generating output.

* Serverless hosting on DeepInfra significantly reduces the cost and infrastructure complexity for developers building prototypes in robotics and simulation.

// TAGS

nvidianvidia-cosmos-3deepinfraphysical-aiserverless-inferenceroboticsmodel-hosting

DISCOVERED

45d ago

2026-06-03

PUBLISHED

45d ago

2026-06-03

RELEVANCE

8/ 10

AUTHOR

DeepInfra

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE10m ago

Apache Ossie enters Apache Incubator

Apache Ossie is an open-source specification designed to standardize semantic metadata sharing across analytics, AI, and business intelligence platforms. Currently incubating under the Apache Software Foundation, the project provides a vendor-neutral, single source of truth using machine-readable JSON and YAML definitions.

LAUNCH12m ago

Browser Use launches Browser Use Cloud

Browser Use Cloud is a managed infrastructure platform built to run open-source browser-use agents at scale. The hosted environment handles proxy rotation, anti-bot protection, and CAPTCHA solving via a single API key.

UPDATE15m ago

Hex voice prompting tool comes to Linux

Hex, the macOS push-to-talk voice dictation utility developed by Kit Langton, is being ported to Linux. The utility allows developers to dictate text prompts directly into their active terminal or editor using local, privacy-preserving speech-to-text models.