BACK_TO_FEEDAICRIER_2
SongGeneration 2 beats Suno on lyric accuracy
OPEN_SOURCE ↗
REDDIT · REDDIT// 29d agoOPENSOURCE RELEASE

SongGeneration 2 beats Suno on lyric accuracy

Tencent AI Lab's SongGeneration v2 (LeVo 2) is an open-weight music generation model that produces full songs up to 4.5 minutes with vocals and full accompaniment from structured lyrics. It's the first open-source music AI to outperform Suno v5 on lyric accuracy, hitting 8.55% phoneme error rate against Suno's 12.4%, validated by 20 professional music evaluators.

// ANALYSIS

Open-source music AI just crossed the commercial quality threshold — and the gap to Suno and Udio is now measurable and real.

  • PER benchmark puts SongGeneration v2 ahead of every commercial competitor: 8.55% vs. Mureka v8's 9.96% and Suno v5's 12.4% — lyric hallucination has been the Achilles heel of open-source music AI, and DPO training on 200K preference pairs finally kills it
  • Hybrid LLM-Diffusion architecture uses dual parallel token streams (mixed + separated vocal/accompaniment) for structural coherence, then a diffusion renderer for high-fidelity audio — smart two-stage design borrowed from image gen playbook
  • Three model variants span 10–28GB VRAM, making it accessible across consumer and datacenter GPUs; the large model runs 0.82x real-time on an H20
  • Multilingual from day one (Chinese, English, Spanish, Japanese) with structured lyric tags for verse/chorus/bridge — gives producers actual compositional control, not just vibes prompting
  • CC BY 4.0 license and local deployment mean zero per-generation cost; community already shipped UI wrappers and WaveSpeed added hosted API at $0.05/gen within days of launch
// TAGS
songgenerationaudio-genopen-sourceopen-weightsllmbenchmark

DISCOVERED

29d ago

2026-03-14

PUBLISHED

31d ago

2026-03-11

RELEVANCE

8/ 10

AUTHOR

foldl-li