Get the latest tech news

Microsoft's VASA-1 is a new AI model that turns photos into 'talking faces'

Impressive lip-syncing

A new AI research paper from Microsoft promises a future where you can upload a photo, a sample of your voice and create a live, animated talking head of your own face. Similar lip sync and head movement technology is already available from Runway and Nvidia but this seems to be of a much higher quality and realism, reducing mouth artifacts. The model also seems to have a high degree of control, capable of taking eye gaze direction, head distance and even emotion as an input to steer the generation.

Get the Android app

Or read this on r/technology