OpenAI adds realtime translation model
OpenAI introduced GPT-Realtime-Translate in the Realtime API, a live speech translation model that supports 70+ input languages and 13 output languages. It ships alongside new realtime voice and transcription models for developers building low-latency voice apps.
OpenAI is turning translation into an API primitive instead of a separate pipeline step, which is the right move if it wants voice apps to feel native rather than stitched together.
- –Live translation inside the same realtime stack reduces the need to chain ASR, MT, and TTS services
- –The 70+ to 13 language matrix makes this immediately useful for support, events, education, and cross-border sales
- –Pricing per minute suggests OpenAI wants production adoption, not just a demo loop
- –The real test is latency plus accent robustness in messy real-world audio, not benchmark fluency
- –Bundling translation with GPT-Realtime-2 and GPT-Realtime-Whisper signals a broader voice platform, not a one-off feature
DISCOVERED
1h ago
2026-05-07
PUBLISHED
2h ago
2026-05-07
RELEVANCE
AUTHOR
OpenAI