BACK_TO_FEEDAICRIER_2
SILMA AI drops open-source Arabic-English TTS model
OPEN_SOURCE ↗
REDDIT · REDDIT// 27d agoOPENSOURCE RELEASE

SILMA AI drops open-source Arabic-English TTS model

SILMA AI has released the weights of its 150M-parameter bilingual text-to-speech model under Apache 2.0, supporting both Modern Standard Arabic and English with voice cloning from under 8 seconds of audio. Built on the F5-TTS diffusion architecture and trained from scratch on tens of thousands of hours of audio, it runs at RTF ~0.12 on an RTX 4090.

// ANALYSIS

Arabic is a vastly underserved language in open-source TTS — SILMA filling that gap with a commercially-permissive, production-ready release is a meaningful contribution to the ecosystem.

  • At 150M parameters, it's roughly half the size of the base F5-TTS config while retaining voice cloning and full Arabic diacritization (Tashkeel) support — a real engineering win for edge/low-resource deployments
  • RTF of ~0.12 makes real-time streaming viable, putting it competitive with ElevenLabs and PlayHT on latency even on a single consumer GPU
  • Apache 2.0 license removes the friction that hampered F5-TTS adoption in commercial products; developers can ship without legal review overhead
  • Arabic NLP has lagged behind English by years; a high-quality open bilingual TTS directly unblocks voice assistants, accessibility tools, and content generation for 400M+ Arabic speakers
  • Available via `pip install silma-tts` with a one-command demo app — unusually smooth DX for an academic-style model drop
// TAGS
silma-ttsspeechaudio-genopen-sourceopen-weights

DISCOVERED

27d ago

2026-03-15

PUBLISHED

27d ago

2026-03-15

RELEVANCE

7/ 10

AUTHOR

oudak2019