viralamo

Menu
  • Technology
  • Science
  • Money
  • Culturs
  • Trending
  • Video

Subscribe To Our Website To Receive The Last Stories

Join Us Now For Free
Home
Technology
Google’s SoundFilter AI separates any sound or voice from mixed-audio recordings
Technology

Google’s SoundFilter AI separates any sound or voice from mixed-audio recordings

11/11/2020

Researchers at Google claim to have developed a machine learning model that can separate a sound source from noisy, single-channel audio based on only a short sample of the target source. In a paper, they say their SoundFilter system can be tuned to filter arbitrary sound sources, even those it hasn’t seen during training.

The researchers believe a noise-eliminating system like SoundFilter could be used to create a range of useful technologies. For instance, Google drew on audio from thousands of its own meetings and YouTube videos to train the noise-canceling algorithm in Google Meet. Meanwhile, a team of Carnegie Mellon researchers created a “sound-action-vision” corpus to anticipate where objects will move when subjected to physical force.

SoundFilter treats the task of sound separation as a one-shot learning problem. The model receives as input the audio mixture to be filtered and a single short example of the kind of sound to be filtered out. Once trained, SoundFilter is expected to extract this kind of sound from the mixture if present.

SoundFilter adopts what’s known as a wave-to-wave neural network architecture that can be trained using audio samples without requiring labels that denote the type of source. A conditioning encoder takes the conditioning audio and computes the corresponding embedding (i.e., numerical representation), while a conditional generator takes the mixture audio and the conditioning embedding as input and produces the filtered output. The system assumes that the original audio collection consists of many clips a few seconds in length that contain the same type of sound for the whole duration. Beyond this, SoundFilter assumes that each such clip contains a single audio source, such as one speaker, one musical instrument, or one bird singing.

The model is trained to produce the target audio, given the mixture and the conditioning audio as inputs. A SoundFilter training example consists of three parts.:

  1. The target audio, which contains only one sound
  2. A mixture, which contains two different sounds, one of which is the target audio
  3. A conditioning audio signal, which is another example containing the same kind of sound as the target audio

In experiments, the researchers trained SoundFilter on two open source datasets: FSD50L (a collection of over 50,000 sounds) and LibriSpeech (around 1,000 hours of English speech). They report that the conditioning encoder learned to produce embeddings that represent the acoustic characteristics of the conditioning audio, enabling SoundFilter to successfully separate voices from mixtures of speakers, sounds from mixtures of sounds, and individual speakers/sounds from mixtures of speakers and sounds.

Here’s one sample before SoundFilter processed it:


https://venturebeat.com/wp-content/uploads/2020/11/download-1.wav

Here’s the sample post-processing:

https://venturebeat.com/wp-content/uploads/2020/11/download.wav

Here’s another sample:

https://venturebeat.com/wp-content/uploads/2020/11/download-6.wav

And here’s the post-processed result:

https://venturebeat.com/wp-content/uploads/2020/11/download-7.wav

“Our work could be extended by exploring how to use the embedding learned as part of SoundFilter as a representation for an audio event classifier,” the researchers wrote. “In addition, it would be of interest to extend our approach from one-shot to many-shot.”


How startups are scaling communication:

The pandemic is making startups take a close look at ramping up their communication solutions. Learn how


Source link

Share
Tweet
Pinterest
Linkedin
Stumble
Google+
Email
Prev Article
Next Article

Related Articles

Here’s what the ‘new normal’ remote sales stack looks like
There has been a sea change of late in B2B …

Here’s what the ‘new normal’ remote sales stack looks like

Google’s MixIT AI isolates speakers in audio recordings
Google today released MinDiff, a new framework for mitigating (but …

Google’s MinDiff aims to mitigate unfair biases in classifiers

Leave a Reply Cancel reply

Find us on Facebook

Related Posts

  • Stanford researchers propose AI in-home system that can monitor for coronavirus symptoms
    Stanford researchers propose AI in-home system that …
    06/04/2020
  • PlayStation 5 gets Godfall looter-slasher from Gearbox Publishing
    Amazon asks all employees to work from …
    13/03/2020
  • Black Friday 2019: The best AI smartphones
    Black Friday 2019: The best AI smartphones
    27/11/2019
  • AMD reveals Ryzen 3 chips that open Zen 2 up to budget gaming PCs
    AMD reveals Ryzen 3 chips that open …
    21/04/2020
  • Boeing’s Starliner crew spacecraft nails desert landing, a first for a U.S.-made, human-rated capsule – TechCrunch
    Boeing’s Starliner crew spacecraft nails desert landing, …
    22/12/2019

Popular Posts

  • DDoSers are abusing Microsoft RDP to make attacks more powerful
    DDoSers are abusing Microsoft RDP to make …
    23/01/2021 0
  • Top 10 Amazing Actors Who Are Always …
    25/12/2020 0
  • 13 acquisitions highlight Big Tech’s AI talent grab in 2020
    13 acquisitions highlight Big Tech’s AI talent …
    25/12/2020 0
  • The Last of Us Part II takes Game of the Year at The Game Awards
    The DeanBeat: My favorite games of 2020
    26/12/2020 0
  • How to build tech products for a diverse user base
    How to build tech products for a …
    26/12/2020 0

viralamo

Pages

  • Contact Us
  • Privacy Policy
Copyright © 2021 viralamo
Theme by MyThemeShop.com

Ad Blocker Detected

Our website is made possible by displaying online advertisements to our visitors. Please consider supporting us by disabling your ad blocker.

Refresh
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.I AgreePrivacy policy