Alibaba HappyHorse 1.1 hits Replicate
Alibaba has launched its HappyHorse 1.1 video generation model on Replicate, enabling developers to run it via API. The model supports text-to-video, image animation, and up to nine reference images alongside unified audio synthesis and multilingual lip-sync.
Unified audio-video generation and reference-to-video capabilities are becoming the new baseline for production-ready AI video tools. HappyHorse 1.1's release on Replicate makes these high-end consistency features easily accessible to developers via a simple API.
- –Unified A/V Synthesis: By generating synchronized dialogue, music, and ambient audio in a single pass, it eliminates the complex post-processing pipelines typically required with video-only models.
- –Reference-Guided Consistency: Supporting up to nine reference images allows developers to maintain strict character and scene consistency across different generations, a major pain point in video editing.
- –Lip-Sync Capabilities: Native support for multilingual lip-sync (across seven languages including English, Mandarin, and French) makes it highly suitable for global marketing and localized content workflows.
- –Developer Accessibility: Launching on Replicate lowers the barrier to entry, letting teams integrate cinematic video generation into applications without managing complex GPU infrastructure.
DISCOVERED
2h ago
2026-06-25
PUBLISHED
2d ago
2026-06-22
RELEVANCE
AUTHOR
replicate