Facebook AI’s Demucs teaches AI to hear in a more human-like way

Music source separation can be a tricky task for machines, while it’s easier for humans to distinguish the vocals, bass or drums. To help with this task, Facebook AI research scientist Alexandre Defossez has developed Demucs (deep extractor for music sources).

Spectrograms vs. waveforms

Most commonly, as Defossez points out, AI separates music sources by analyzing spectrograms. While this method is well suited for instruments that resonate on a single frequency, spectrogram-based methods have their weaknesses. For examples, saxophone and guitar frequencies may cancel each other out.

This is where Demucs comes into play—an AI-based waveform model that is designed to work in a similar way to how computer vision detects patterns in images. “It detects patterns in the waveforms and then adds higher-scale structure,” as Defossez explains. Or in other words: “Demucs can re-create the audio that it thinks is there but got lost in the mix.”

Defossez based Demucs on Wave-U-Net, an earlier AI-powered waveform model, and then went on to fine-tune his model. It now not only outperforms Wave-U-Net, but is also “‘way beyond’ state-of-the-art spectrograms.”

In the future, technology like Demucs may improve the abilities of AI assistants to hear voice commands in loud environments. Additionally, it could also be used for hearing aids or noise-canceling headphones.

Facebook AI’s Demucs teaches AI to hear in a more human-like way

SEE ALSO: Deep learning in 3D with Facebook AI’s new tool PyTorch3D

Spectrograms vs. waveforms

ML Conference – The Conference
for Machine Learning Innovation

Protecting AI Solutions From Attacks

AI for Decision Makers

Reinforcement Learning and Imitation Learning Workshop

SEE ALSO: Using AI for managing images and videos at scale

You may also like...

Random Post

Recent

Facebook AI’s Demucs teaches AI to hear in a more human-like way

SEE ALSO: Deep learning in 3D with Facebook AI’s new tool PyTorch3D

Spectrograms vs. waveforms

ML Conference – The Conference for Machine Learning Innovation

Protecting AI Solutions From Attacks

AI for Decision Makers

Reinforcement Learning and Imitation Learning Workshop

SEE ALSO: Using AI for managing images and videos at scale

You may also like...

The Best Books of 2019 So Far: Critical Linking, June 9, 2019

The Duke’s Daughter; and, The Fugitives; vol. 1/3 by Mrs. Oliphant

The Bedbug [1916] by C. L. Marlatt

Random Post

Recent

Tags

ML Conference – The Conference
for Machine Learning Innovation