daVinci-MagiHuman drops open-source human video model

// 62d agoMODEL RELEASE

daVinci-MagiHuman drops open-source human video model

daVinci-MagiHuman is a 15B open-source audio-video foundation model for human-centric generation. It uses a single-stream Transformer to sync speech, facial performance, and body motion, and ships the full stack with multilingual support plus fast-inference tooling.

// ANALYSIS

This is the kind of release that makes human-video generation feel less like a demo and more like a system other teams can actually build on. The architecture is the real story: simplifying multimodal fusion may matter more than any single benchmark number.

–Single-stream self-attention removes cross-attention plumbing, which should make training and debugging simpler.
–The speed claims are real but hardware-bound: 5 seconds of 256p in 2 seconds is strong, but 1080p still needs a much heavier second stage.
–Pairwise wins over Ovi 1.1 and LTX 2.3 suggest competitive quality, though the evaluation is still early and likely curated.
–Releasing the base, distilled, SR, and inference stack makes this much more useful to researchers than a paper-only announcement.

// TAGS

multimodalvideo-genaudio-genspeechopen-sourcedavinci-magihuman

DISCOVERED

62d ago

2026-03-28

PUBLISHED

62d ago

2026-03-28

RELEVANCE

9/ 10

AUTHOR

Github Awesome

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL1d ago

Anthropic drops Opus 4.8, teases upcoming Mythos model

Anthropic launched Claude Opus 4.8 with adjustable effort controls, dynamic workflows for Claude Code, and a cheaper fast mode. The release serves as a precursor to their highly anticipated Claude Mythos model, which is slated to roll out in the coming weeks.

VIDEO1d ago

Viral video teases Claude Opus 4.8

A viral video directed by Miguel07Code showcases impressive "hyperframes" camera movements, allegedly generated by Claude Opus 4.8. The post has sparked speculation about Claude's video generation capabilities.

LAUNCH1d ago

Browser Use Terminal launches Rust web-agent TUI

Browser Use Terminal is a new Rust-based TUI that lets developers automate and steer browser tasks directly from the command line. It combines a lightweight LLM harness with direct CDP control over Chrome for highly observable, interactive automation.