Get the latest tech news
OmniHuman: ByteDance’s new AI creates realistic videos from a single photo
ByteDance's new OmniHuman AI can turn a single photo into a realistic video of a person speaking, singing and moving naturally, trained on 18,700 hours of human motion data.
ByteDance researchers have developed an artificial intelligence system that transforms single photographs into realistic videos of people speaking, singing and moving naturally — a breakthrough that could reshape digital entertainment and communications. The new system, called OmniHuman, generates full-body videos showing people gesturing and moving in ways that match their speech, surpassing previous AI models that could only animate faces or upper bodies. Credit: ByteDance “Our key insight is that incorporating multiple conditioning signals, such as text, audio, and pose, during training can significantly reduce data wastage,” the research team explained.
Or read this on Venture Beat