OPEN_SOURCE ↗
REDDIT · REDDIT// 20d agoOPENSOURCE RELEASE
NVIDIA PersonaPlex: Open-source full-duplex voice hits 170ms
NVIDIA's new 7B full-duplex AI model enables real-time, simultaneous voice interactions with sub-200ms latency. The open-weights release supports custom personas and zero-shot voice cloning but requires significant VRAM.
// ANALYSIS
PersonaPlex shifts conversational AI from slow, sequential turns to fluid, human-like dialogue by listening and speaking simultaneously.
- –Full-duplex architecture handles interruptions and "backchanneling" naturally, a major hurdle for current voice assistants.
- –Sub-200ms latency outperforms commercial leaders like Gemini Live, effectively removing the "thinking" pause between exchanges.
- –Open-weights distribution on GitHub and Hugging Face allows for developer fine-tuning and local experimentation.
- –High hardware floor (24GB VRAM) remains the bottleneck for local users, though 4-bit quantization may soon lower the entry barrier.
// TAGS
personaplexllmspeechaudio-genopen-sourcenvidia
DISCOVERED
20d ago
2026-03-23
PUBLISHED
20d ago
2026-03-22
RELEVANCE
9/ 10
AUTHOR
iKontact