Get the latest tech news

OmniHuman: ByteDance’s new AI creates realistic videos from a single photo

ByteDance's new OmniHuman AI can turn a single photo into a realistic video of a person speaking, singing and moving naturally, trained on 18,700 hours of human motion data.

ByteDance researchers have developed an artificial intelligence system that transforms single photographs into realistic videos of people speaking, singing and moving naturally — a breakthrough that could reshape digital entertainment and communications. The new system, called OmniHuman, generates full-body videos showing people gesturing and moving in ways that match their speech, surpassing previous AI models that could only animate faces or upper bodies. Credit: ByteDance “Our key insight is that incorporating multiple conditioning signals, such as text, audio, and pose, during training can significantly reduce data wastage,” the research team explained.

Get the Android app

Or read this on Venture Beat