Google has launched Gemini 3.5 Live Translate, a new audio model offering low-latency, real-time speech translation across over 70 languages.
Google has released Gemini 3.5 Live Translate, a new audio model designed to process streaming speech in near real time, providing low-latency translation across more than 70 languages. The model is aimed at developers building real-time communication tools and is supported out of the box by ecosystem partners like Agora, LiveKit, Pipecat AI, and Software Mansion. Developers can test its capabilities within Google AI Studio's live playground.
Native streaming audio-to-audio models are a major step forward, bypassing high-latency cascaded translation pipelines to enable true conversational speed.
- –Native processing lowers latency and preserves speech nuances compared to traditional cascading speech-to-text-to-translation methods.
- –Out-of-the-box partnerships with WebRTC platforms like Agora and LiveKit simplify production deployment.
- –Translating across 70+ languages establishes strong coverage, though real-world accuracy will depend on audio quality and ambient noise.
DISCOVERED
3h ago
2026-06-09
PUBLISHED
3h ago
2026-06-09
RELEVANCE
AUTHOR
googleaidevs