OPEN_SOURCE ↗
REDDIT · REDDIT// 29d agoOPENSOURCE RELEASE
SongGeneration 2 beats Suno on lyric accuracy
Tencent AI Lab's SongGeneration v2 (LeVo 2) is an open-weight music generation model that produces full songs up to 4.5 minutes with vocals and full accompaniment from structured lyrics. It's the first open-source music AI to outperform Suno v5 on lyric accuracy, hitting 8.55% phoneme error rate against Suno's 12.4%, validated by 20 professional music evaluators.
// ANALYSIS
Open-source music AI just crossed the commercial quality threshold — and the gap to Suno and Udio is now measurable and real.
- –PER benchmark puts SongGeneration v2 ahead of every commercial competitor: 8.55% vs. Mureka v8's 9.96% and Suno v5's 12.4% — lyric hallucination has been the Achilles heel of open-source music AI, and DPO training on 200K preference pairs finally kills it
- –Hybrid LLM-Diffusion architecture uses dual parallel token streams (mixed + separated vocal/accompaniment) for structural coherence, then a diffusion renderer for high-fidelity audio — smart two-stage design borrowed from image gen playbook
- –Three model variants span 10–28GB VRAM, making it accessible across consumer and datacenter GPUs; the large model runs 0.82x real-time on an H20
- –Multilingual from day one (Chinese, English, Spanish, Japanese) with structured lyric tags for verse/chorus/bridge — gives producers actual compositional control, not just vibes prompting
- –CC BY 4.0 license and local deployment mean zero per-generation cost; community already shipped UI wrappers and WaveSpeed added hosted API at $0.05/gen within days of launch
// TAGS
songgenerationaudio-genopen-sourceopen-weightsllmbenchmark
DISCOVERED
29d ago
2026-03-14
PUBLISHED
31d ago
2026-03-11
RELEVANCE
8/ 10
AUTHOR
foldl-li