Grok TTS API debuts, five voices
xAI has opened a Text to Speech beta inside Grok Voice, with five voices, expressive speech tags, and telephony-ready output via `POST /v1/tts`. It pushes Grok beyond chat into a more complete voice platform for developers.
This looks less like a cute TTS add-on and more like xAI staking out the full voice stack around Grok.
- –xAI now exposes voice agents, text to speech, and speech to text under one umbrella, which reduces the need to stitch together multiple vendors.
- –The API returns raw audio and supports inline expressive tags, so it is built for product UX, not just demo playback.
- –Five voices and a beta label suggest the surface area is still narrow, but the 20-language support makes it immediately usable.
- –Pricing at $4.20 per 1M characters is a clear production signal, even if the product is still early.
- –Product Hunt’s Grok launch trail suggests xAI is methodically turning Grok into an API family, not a single chatbot feature.
DISCOVERED
70d ago
2026-03-18
PUBLISHED
70d ago
2026-03-18
RELEVANCE