Get the latest tech news

Microsoft's VASA-1 is a new AI model that turns photos into 'talking faces'


Impressive lip-syncing

A new AI research paper from Microsoft promises a future where you can upload a photo, a sample of your voice and create a live, animated talking head of your own face. Similar lip sync and head movement technology is already available from Runway and Nvidia but this seems to be of a much higher quality and realism, reducing mouth artifacts. The model also seems to have a high degree of control, capable of taking eye gaze direction, head distance and even emotion as an input to steer the generation.

Get the Android app

Or read this on r/technology

Read more on:

Photo of Microsoft

Microsoft

Photo of Photos

Photos

Photo of faces

faces

Related news:

News photo

Microsoft aims to triple datacenter capacity to fuel AI boom

News photo

Microsoft shows off VASA-1, an AI framework that makes human headshots talk, sing

News photo

October 2025 will be a support massacre for a bunch of Microsoft products