BACK_TO_FEEDAICRIER_2
OpenMOSS unveils MOVA for synced video-audio generation
OPEN_SOURCE ↗
YT · YOUTUBE// 37d agoMODEL RELEASE

OpenMOSS unveils MOVA for synced video-audio generation

OpenMOSS released MOVA, an open-source 32B MoE (18B active) model for synchronized video and audio generation with lip-synced speech and aligned sound effects. The release includes a technical report plus open code, weights, and inference workflows, positioning MOVA as a rare open alternative to closed audiovisual generators.

// ANALYSIS

This is a meaningful open-source milestone in multimodal generation, especially for teams that need controllable end-to-end audio-video outputs instead of black-box APIs.

  • Joint video-audio generation in one pass can reduce sync drift and pipeline complexity versus cascaded setups.
  • Open weights and training/inference code make MOVA useful for research replication and downstream customization.
  • Support for 360p and 720p checkpoints gives developers a practical quality-vs-compute path.
  • The project directly targets a gap where most top-tier synchronized AV systems remain closed.
// TAGS
movamultimodalvideo-genaudio-genopen-sourceopen-weightsresearch

DISCOVERED

37d ago

2026-03-05

PUBLISHED

37d ago

2026-03-05

RELEVANCE

9/ 10

AUTHOR

AI Search