Grok 4.20 multi-agent beta burns 333k tokens on trivial prompt

// 74d agoNEWS

Grok 4.20 multi-agent beta burns 333k tokens on trivial prompt

A user testing Grok 4.20 Multi-Agent Beta via OpenRouter triggered 333,210 input tokens and 53,859 output tokens — costing $0.47 — just by asking how many agents were working on a response. The token explosion on a near-zero-complexity prompt raises questions about whether xAI's multi-agent coordination overhead is a bug or an inherent architectural cost.

// ANALYSIS

This is a red flag for anyone considering Grok's multi-agent tier in production: the cost-per-query economics appear broken at launch.

–333k input tokens on a three-word follow-up question suggests the system is passing full conversation context to every spawned agent simultaneously — a naive fan-out architecture
–At that rate, real workloads could cost 10-100x more than equivalent single-agent calls with no clear quality benefit
–The original poster noted Grok 4.2 single-agent scored reasonably on the AA-Index, making the multi-agent overhead even harder to justify
–OpenRouter users are effectively beta-testing internal xAI orchestration plumbing — pricing and behavior may change without warning
–This mirrors early complaints about other multi-agent frameworks (AutoGen, CrewAI) where agent-to-agent context passing silently multiplies token costs

// TAGS

grokllmagentinferencebenchmark

DISCOVERED

74d ago

2026-03-15

PUBLISHED

74d ago

2026-03-14

RELEVANCE

6/ 10

AUTHOR

rnahumaf

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS2h ago

Dev lets Claude trade BTC overnight, nets $95 profit

A developer gave Claude a $20 budget to autonomously script and execute Bitcoin trades overnight, waking up to a functional trading bot and a $95 profit across five trades.

OPEN SOURCE2h ago

Plannotator 0.19.24 adds Amp support and configurable storage

Plannotator 0.19.24 is a substantial release that expands the tool beyond Claude Code with native Amp support, adds a `PLANNOTATOR_DATA_DIR` override so users can move the default `~/.plannotator` data directory, introduces Auto Mode in the permission selector for newer Claude Code versions, and fixes a Pi approval crash after plan acceptance. The update folds multiple stacked PRs into one release and pushes the project further toward a multi-agent review layer rather than a single-agent hook utility.

NEWS3h ago

Aaronson says AI turns mathematicians into curators

Scott Aaronson says recent AI results in mathematics, including a GPT-5.5 Pro solution to Erdős’s Unit Distance Problem, suggest humans may increasingly focus on choosing questions and interpreting model outputs. He extends the argument to AI-written fiction and the Vatican’s AI encyclical as signs of a broader cultural shift.