Parakeet TDT v3 adds 25 languages, 3000x speed

// 112d agoMODEL RELEASE

Parakeet TDT v3 adds 25 languages, 3000x speed

NVIDIA's latest Token-and-Duration Transducer (TDT) models achieve 3000x real-time throughput, with v3 expanding support to 25 languages while v2 remains the precision choice for English-only tasks.

// ANALYSIS

Parakeet TDT is the speed king of ASR, effectively rendering Whisper-based pipelines obsolete for high-volume tasks, but it prioritizes "clean" readability over verbatim audio records. Version 2 remains the precision choice for English-only tasks with a 6.05% WER compared to v3's 6.32% WER. v3's primary value lies in its 25-language multilingual engine and improved robustness against non-speech audio hallucinations. Both models aggressively filter "ums" and "uhs," making them unsuitable for legal or clinical verbatim requirements. Achieving 3000x RTFx means one hour of audio is processed in ~1 second, enabling massive-scale transcription at negligible cost.

// TAGS

speechopen-sourcellmnvidia-parakeet-tdt

DISCOVERED

112d ago

2026-04-04

PUBLISHED

112d ago

2026-04-03

RELEVANCE

8/ 10

AUTHOR

walleynguyen

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE1h ago

Softr adds visual co-building and vibe coding

Softr has introduced visual co-building alongside customizable vibe-coded blocks, pairing prompt-based AI generation with direct visual editing. The platform allows users to rapidly generate, adjust, and deploy custom business portals, CRMs, and internal tools, bridging the gap between natural language prompt creation and precise interface design.

UPDATE1h ago

Bribes.fyi unveils "Know Before You Go" bribe benchmarks

Bribes.fyi, an anonymous crowdsourced corruption transparency platform in India, has launched a new "Know Before You Go" feature. The tool aggregates user-reported bribery data into city breakdowns, department rankings, and service-level averages, enabling citizens to look up expected bribe amounts prior to visiting public offices while offering automated complaint letter generation for anti-corruption authorities.

OPEN SOURCE3h ago

Cli-Proxy-API Management Center launches WebUI configuration dashboard

Cli-Proxy-API Management Center is an open-source web interface designed to simplify the administration of CLI-Proxy-API instances. It replaces manual YAML configuration file editing with an intuitive visual dashboard for adjusting settings, monitoring runtime status, viewing live logs, and managing token authentication.