VocoLoco launches local TTS, voice cloning
VocoLoco is an open-source, browser-based text-to-speech studio that runs entirely on-device with no server or API keys. It uses OmniVoice via ONNX Runtime Web, WebGPU, and WASM to provide local speech generation, voice design controls, and voice cloning, with saved voices and generation history stored in the browser.
Hot take: this is a strong privacy-first demo of where local AI audio is headed, but the hardware bar is real and will limit adoption outside desktop-class machines.
- –The main appeal is zero-data-leaves-device execution, which is a meaningful differentiator versus hosted TTS services.
- –Voice cloning plus local persistence makes it more than a toy; it is positioned as a personal studio, not just a demo.
- –The biggest constraint is compatibility: the project explicitly targets WebGPU-capable desktop browsers, so mid-range phones and weaker GPUs are likely out.
- –The product is best framed as an open-source research/engineering release rather than a mass-market consumer app.
DISCOVERED
62d ago
2026-04-09
PUBLISHED
62d ago
2026-04-09
RELEVANCE
AUTHOR
Zestyclose_Salary738
