BACK_TO_FEEDAICRIER_2
NVIDIA PersonaPlex: Open-source full-duplex voice hits 170ms
OPEN_SOURCE ↗
REDDIT · REDDIT// 20d agoOPENSOURCE RELEASE

NVIDIA PersonaPlex: Open-source full-duplex voice hits 170ms

NVIDIA's new 7B full-duplex AI model enables real-time, simultaneous voice interactions with sub-200ms latency. The open-weights release supports custom personas and zero-shot voice cloning but requires significant VRAM.

// ANALYSIS

PersonaPlex shifts conversational AI from slow, sequential turns to fluid, human-like dialogue by listening and speaking simultaneously.

  • Full-duplex architecture handles interruptions and "backchanneling" naturally, a major hurdle for current voice assistants.
  • Sub-200ms latency outperforms commercial leaders like Gemini Live, effectively removing the "thinking" pause between exchanges.
  • Open-weights distribution on GitHub and Hugging Face allows for developer fine-tuning and local experimentation.
  • High hardware floor (24GB VRAM) remains the bottleneck for local users, though 4-bit quantization may soon lower the entry barrier.
// TAGS
personaplexllmspeechaudio-genopen-sourcenvidia

DISCOVERED

20d ago

2026-03-23

PUBLISHED

20d ago

2026-03-22

RELEVANCE

9/ 10

AUTHOR

iKontact