Get the latest tech news

OmniHuman: ByteDance’s new AI creates realistic videos from a single photo


ByteDance's new OmniHuman AI can turn a single photo into a realistic video of a person speaking, singing and moving naturally, trained on 18,700 hours of human motion data.

ByteDance researchers have developed an artificial intelligence system that transforms single photographs into realistic videos of people speaking, singing and moving naturally — a breakthrough that could reshape digital entertainment and communications. The new system, called OmniHuman, generates full-body videos showing people gesturing and moving in ways that match their speech, surpassing previous AI models that could only animate faces or upper bodies. Credit: ByteDance “Our key insight is that incorporating multiple conditioning signals, such as text, audio, and pose, during training can significantly reduce data wastage,” the research team explained.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of new ai

new ai

Photo of single photo

single photo

Photo of ByteDance

ByteDance

Related news:

News photo

New AI picks up 97% of lung diseases, and can tell pneumonia from COVID-19

News photo

Trae: AI Code Editor from ByteDance

News photo

JetBrains launches Junie, a new AI coding agent for its IDEs