Get the latest tech news

Pushing the frontiers of audio generation


Our pioneering speech generation technologies are helping people around the world interact with more natural, conversational and intuitive digital assistants and AI tools.

Over the past few years, we’ve been pushing the frontiers of audio generation, developing models that can create high quality, natural speech from a range of inputs, like text, tempo controls and particular voices. This technology powers single-speaker audio in many Google products and experiments — including Gemini Live, Project Astra, Journey Voices and YouTube’s auto dubbing — and is helping people around the world interact with more natural, conversational and intuitive digital assistants and AI tools. Authors of this work: Zalán Borsos, Matt Sharifi, Brian McWilliams, Yunpeng Li, Damien Vincent, Félix de Chaumont Quitry, Martin Sundermeyer, Eugene Kharitonov, Alex Tudor, Victor Ungureanu, Sertan Girgin, Jonas Rothfuss, Jake Walker and Marco Tagliasacchi.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of audio generation

audio generation

Photo of Frontiers

Frontiers

Related news:

News photo

Try Avatar: Frontiers of Pandora for free right now

News photo

Bungie teases Destiny 2 codenamed ‘Frontiers’ for 2025

News photo

Game of the Week: Avatar: Frontiers of Pandora and true immersion in a game world