OpenMOSS unveils MOVA for synced video-audio generation

// 128d agoMODEL RELEASE

OpenMOSS unveils MOVA for synced video-audio generation

OpenMOSS released MOVA, an open-source 32B MoE (18B active) model for synchronized video and audio generation with lip-synced speech and aligned sound effects. The release includes a technical report plus open code, weights, and inference workflows, positioning MOVA as a rare open alternative to closed audiovisual generators.

// ANALYSIS

This is a meaningful open-source milestone in multimodal generation, especially for teams that need controllable end-to-end audio-video outputs instead of black-box APIs.

–Joint video-audio generation in one pass can reduce sync drift and pipeline complexity versus cascaded setups.
–Open weights and training/inference code make MOVA useful for research replication and downstream customization.
–Support for 360p and 720p checkpoints gives developers a practical quality-vs-compute path.
–The project directly targets a gap where most top-tier synchronized AV systems remain closed.

// TAGS

movamultimodalvideo-genaudio-genopen-sourceopen-weightsresearch

DISCOVERED

128d ago

2026-03-05

PUBLISHED

128d ago

2026-03-05

RELEVANCE

9/ 10

AUTHOR

AI Search

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE25m ago

prose stylesheet forces clean AI writing

prose is a lightweight, single-file Markdown prompt configuration that guides AI coding agents to communicate like a direct, confident senior engineer. Appended directly to local agent instruction files, it establishes clear rules to eliminate common AI patterns like cheesy setups, over-bulleted reasoning, and theatrical language.

MODEL3h ago

Reve 2.1 drops native 4K rendering

Reve has released version 2.1 of its creative image generation model, introducing native 4K rendering, object-level editing, and a new "Live Layers" feature. The update enables users to perform localized edits and manage layouts directly, catering to professional design workflows requiring precise control.

OPEN SOURCE3h ago

ABot-World simulates infinite 720p worlds on single GPU

ABot-World is an open-source, action-conditioned infinite world simulator designed to generate interactive 720p environments at 16 frames per second with low latency on a single desktop GPU. By utilizing an NVIDIA RTX 5090 and requiring just 19GB of GPU memory, this embodied world model offers physical compliance, action controllability, and zero-shot generalization, making real-time, interactive environment simulation accessible on consumer-grade hardware.