Fish Speech S2 Pro open-sources 15k inline tags

// 121d agoOPENSOURCE RELEASE

Fish Speech S2 Pro open-sources 15k inline tags

Fish Audio open-sources S2 Pro, a 4.4B parameter text-to-speech model featuring a dual-autoregressive architecture and unprecedented word-level emotional control via 15,000+ natural language inline tags.

// ANALYSIS

Fish Speech S2 Pro is a direct shot at ElevenLabs, offering production-grade latency and deep prosodic control that was previously locked behind proprietary APIs.

–Dual-AR architecture (4B Slow AR + 400M Fast AR) balances linguistic structure with high-fidelity acoustic detail for more natural phrasing
–Massive library of 15,000+ inline tags like [whisper] and [sigh] allows for granular emotional directing without external conditioning models
–Optimized for sub-150ms latency on H100/H200 hardware, making it viable for real-time conversational agents and interactive gaming
–Multilingual support for 80+ languages trained on 10M+ hours of audio puts it in the top tier of open-weights TTS models
–Early user reports suggest a learning curve for local hardware optimization, but the underlying model quality is a significant leap for the open-source audio ecosystem

// TAGS

fish-speechttsaudio-genopen-sourceopen-weightsspeechai-audio

DISCOVERED

121d ago

2026-03-26

PUBLISHED

121d ago

2026-03-26

RELEVANCE

9/ 10

AUTHOR

iKontact

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

VIDEO13m ago

Granola CEO demonstrates OpenAI Codex browser automation

In a video demonstration presented by Every, Granola's CEO showcases OpenAI Codex functioning as an autonomous agent executing complex, multi-step browser workflows. Drawing upon saved user context, Codex navigates web applications and customer support chats to negotiate an internet plan migration and eliminate extra fees.

LAUNCH1h ago

Moonshot AI introduces Kimi K3 Agent Swarm

Moonshot AI has introduced Agent Swarm mode for Kimi K3, a horizontal scaling architecture capable of coordinating up to 300 parallel sub-agents to tackle complex software engineering tasks. By dividing web development across autonomous agent teams working concurrently, the system can generate multi-page websites and frontend applications significantly faster than traditional single-agent approaches.

OPEN SOURCE2h ago

Jakub Antalik releases thinking-orbs for AI UI states

thinking-orbs is an open-source animation library designed by Jakub Antalik to replace static spinners with state-aware visual loading indicators for AI agents. Built for React and Tailwind CSS, the SSR-safe library provides six hand-tuned canvas states with automatic theme switching and preset sizing.