Get the latest tech news

Nvidia Fugatto: "World's Most Flexible Sound Machine"


Fugatto generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files.

For example, it can create a music snippet based on a text prompt, remove or add instruments from an existing song, change the accent or emotion in a voice — even let people produce sounds never heard before. “We wanted to create a model that understands and generates sound like humans do,” said Rafael Valle, a manager of applied audio research at NVIDIA and one of the dozen-plus people behind Fugatto, as well as an orchestral conductor and composer. Plus, unlike most models, which can only recreate the training data they’ve been exposed to, Fugatto allows users to create soundscapes it’s never seen before, such as a thunderstorm easing into a dawn with the sound of birds singing.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of World

World

Photo of nvidia fugatto

nvidia fugatto

Related news:

News photo

Margrethe Vestager, the World’s Top Tech Cop, Is Making Her Exit

News photo

Chemists Create World's Thinnest Spaghetti

News photo

World Agrees on $300B Climate Aid Financial Deal - After COP29 Summit 'Nearly Implodes'