Scenema Audio drops open-source emotional voice cloning

// 45d agoOPENSOURCE RELEASE

Scenema Audio drops open-source emotional voice cloning

Scenema Audio is a new open-source, zero-shot voice cloning model that decouples voice identity from emotional performance. Built on Gemma 3 and an LTX diffusion transformer, it uses XML-style stage directions to make any cloned voice perform complex emotions like rage or grief alongside scene-aware background audio.

// ANALYSIS

Scenema shifts the TTS focus from mere phonetic accuracy to actual acting, solving the persistent "robotic" feel of most open-source audio generators. Decoupling identity from emotion means you can clone a flat 10-second reference clip and make that voice scream, whisper, or cry. XML-based action tags give creators fine-grained control over mid-sentence emotional shifts, pacing, and breath control. By co-generating speech and ambient environmental audio in a single pass, it drastically simplifies audio-first video generation workflows. The 16GB VRAM requirement makes this high-fidelity, performative audio accessible to developers on consumer hardware.

// TAGS

scenema-audiottsspeechaudio-genopen-weightsopen-source

DISCOVERED

45d ago

2026-05-17

PUBLISHED

45d ago

2026-05-17

RELEVANCE

8/ 10

AUTHOR

AI Search

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE1h ago

DataFast launches server-side bot tracking

DataFast has released a new server-side bot traffic tracking feature using a lightweight npm package (@datafast/ai-crawl) integrated into backends, middleware, or edge proxies. By tracking at the server level, developers can capture bot activity that client-side analytics miss without affecting page load performance.

OPEN SOURCE2h ago

500-AI-Agents-Projects launches on GitHub

This GitHub repository compiles over 500 ready-to-run, self-contained AI agent projects organized by industry and frameworks like LangGraph, CrewAI, AutoGen, and Agno. Each project is fully configured to run with a single command, serving as a practical directory and educational resource.

BENCHMARK3h ago

RuneBench tests AI agent planning in RuneScape

RuneBench is an open-source evaluation benchmark designed to measure the planning capabilities and process reliability of AI coding agents. Using a TypeScript SDK, agents must navigate game systems, consult wiki documentation, and optimize for max XP rate to achieve long-horizon goals.

Scenema Audio drops open-source emotional voice cloning