Fish Audio S2 open-sources expressive TTS

// 136d agoOPENSOURCE RELEASE

Fish Audio S2 open-sources expressive TTS

ANNOUNCEMENT PRODUCT GITHUB PRODUCT HUNT

Fish Audio has open-sourced S2, a new text-to-speech model with natural-language emotion control, native multi-speaker generation, and a production-minded streaming stack built on SGLang. It stands out because Fish shipped a full developer package — model weights, fine-tuning code, API access, benchmarks, and self-hosting paths — instead of just a flashy demo.

// ANALYSIS

This is one of the more serious voice AI releases of the year: Fish Audio is pitching S2 as an actually deployable stack, not just a consumer voice toy. The caveat is that the release uses a research license for free use, so teams need to check commercial terms before treating it like a fully permissive open-source drop-in.

–Natural-language inline tags like [whisper], [laugh], and custom prosody cues make S2 much more steerable than TTS systems built around fixed emotion presets
–Fish is leaning hard on performance as a differentiator, claiming roughly 100 ms time-to-first-audio and strong streaming throughput on H200 hardware
–The company says S2 posts leading results on Seed-TTS Eval and EmergentTTS-Eval, beating Seed-TTS, MiniMax Speech, and a GPT-4o-mini-tts baseline on multiple measures
–Shipping the GitHub repo, Hugging Face weights, blog writeup, and hosted API together gives developers multiple adoption paths: experiment locally, self-host, or just call the service

// TAGS

fish-audio-s2speechopen-sourceinferenceapiresearch

DISCOVERED

136d ago

2026-03-11

PUBLISHED

137d ago

2026-03-10

RELEVANCE

8/ 10

AUTHOR

[REDACTED]

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

SECURITY3h ago

Kimi K3 demonstrates autonomous corporate network intrusion

A joint evaluation by the UK and US AI Security Institutes revealed that Moonshot AI's Kimi K3 model possesses significant offensive cyber capabilities. During testing, Kimi K3 successfully achieved multi-step corporate network intrusions in an entirely autonomous manner.

NEWS4h ago

GM, Peak Energy partner on sodium-ion grid storage

General Motors has backed sodium-ion startup Peak Energy to co-develop passively cooled battery storage systems purpose-built for grid applications and AI data centers. The technology leverages abundant raw materials to target 20% lower lifetime costs and a 20-year operating life, with prototyping scheduled for 2026.

NEWS5h ago

Florida Resident Protests Flock Safety License Plate Cameras

Carl Gunn, a 77-year-old resident of St. Petersburg, Florida, has mounted a public protest against localized mass surveillance by targeting Flock Safety license plate reader cameras in his neighborhood. Alarmed by AI-powered vehicle tracking near his home, Gunn set up a lawn chair and used makeshift tools to block the camera lens, drawing attention to civil liberty concerns.