Get the latest tech news

Microsoft's VASA-1 Can Deepfake a Person With One Photo and One Audio Track


Microsoft Research Asia earlier this week unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. ArsTechnica: In the future, it could power virtual avatars that render locally and don't require video ...

Microsoft Research Asia earlier this week unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track. ArsTechnica: In the future, it could power virtual avatars that render locally and don't require video feeds -- or allow anyone with similar tools to take a photo of a person found online and make them appear to say whatever they want. The VASA framework (short for "Visual Affective Skills Animator") uses machine learning to analyze a static image along with a speech audio clip.

Get the Android app

Or read this on Slashdot

Read more on:

Photo of Microsoft

Microsoft

Photo of person

person

Photo of photo

photo

Related news:

News photo

Microsoft’s new VASA-1 AI model can turn photos into ‘talking faces’

News photo

Microsoft's VASA-1 can deepfake a person with one photo and one audio track

News photo

Microsoft Does Not Want You To Use iPerf3 To Measure Network Performance on Windows