Nvidia claims a new AI audio generator can make sounds never heard before

Nov 26, 2024 12:38 AM - 1 week ago 16068

Nvidia says its caller AI euphony editor tin create “sounds ne'er heard before” — for illustration a trumpet that meows. The tool, called Fugatto, is tin of generating music, sounds, and reside utilizing matter and audio inputs it’s ne'er been trained on.

As shown successful this video embedded below, this allows Fugatto to put together songs based connected chaotic prompts, for illustration “Create a saxophone howling, barking past physics euphony pinch dogs barking.”

Some different examples shared by the institution see the expertise to nutrient unsocial sound effects based connected a description, for illustration “Deep, rumbling bass pulses paired pinch intermittent, high-pitched integer chirps, for illustration the sound of a monolithic sentient instrumentality waking up.”

It tin moreover toggle shape the sound of someone’s voice, changing their accent aliases giving them a different tone, for illustration angry aliases calm. There are ways to edit music, too, arsenic Fugatto tin isolate the vocals successful a song, adhd instruments, and moreover alteration up a melody by swapping retired a soft for an opera singer.

A paper released pinch the announcement shows the agelong database of each the datasets Nvidia says Fugatto was trained on, 1 of which includes a room of sound effects from the BBC.

There are already respective different AI audio devices retired there, including those from Stability AI, OpenAI, Google DeepMind, ElevenLabs, and Adobe, but not ones claiming to create wholly caller and unheard-of sounds. Some AI startups are moreover facing copyright lawsuits complete their euphony creation tools, while a caller study recovered that Nvidia and different companies trained AI models on subtitles from thousands of YouTube videos.

To build Fugatto, Nvidia says researchers had to put together a dataset pinch millions of audio samples. They past created instructions “that considerably expanded the scope of tasks the exemplary could perform, while achieving much meticulous capacity and enabling caller tasks without requiring further data.” Nvidia doesn’t opportunity erstwhile — aliases if — the instrumentality will beryllium wide available.

More