YOU ARE VIEWING ONE ITEM FROM THE AICRIER FEED

NVIDIA GB300 NVL72 hits 20x AgentPerf throughput

AICrier tracks AI developer news across Product Hunt, GitHub, Hacker News, YouTube, X, arXiv, and more. This page keeps the article you opened front and center while giving you a path into the live feed.

// WHAT AICRIER DOES

7+

TRACKED FEEDS

24/7

SCRAPED FEED

Short summaries, external links, screenshots, relevance scoring, tags, and featured picks for AI builders.

NVIDIA GB300 NVL72 hits 20x AgentPerf throughput
OPEN LINK ↗
// 2h agoBENCHMARK RESULT

NVIDIA GB300 NVL72 hits 20x AgentPerf throughput

NVIDIA's GB300 NVL72 platform achieved record performance on Artificial Analysis's new AA-AgentPerf benchmark, delivering up to 20x more concurrent agents per megawatt than previous H200 systems. The platform leverages full-stack optimizations like high-speed NVLink fabrics and DeepGEMM to handle the complex, multi-turn reasoning requirements of advanced AI agents.

// ANALYSIS

As the industry shifts from simple chatbots to long-horizon agentic workflows, hardware optimization is evolving from raw token latency to high-density, multi-turn reasoning throughput.

* The shift to "relay-style" agent workflows makes traditional token-per-second benchmarks less relevant compared to multi-turn agent trajectories.

* Fusing 72 GPUs over a single NVLink domain is crucial for maintaining low-latency state sharing in complex, iterative reasoning tasks.

* Hardware-software co-design using DeepGEMM, MXFP4 precision, and SGLang routing is proving to be a major differentiator for scaling agent concurrency.

// TAGS
nvidiagb300-nvl72agentperfagentbenchmarkgpublackwelldeepgemm

DISCOVERED

2h ago

2026-06-22

PUBLISHED

2h ago

2026-06-22

RELEVANCE

9/ 10

AUTHOR

DIY Smart Code