Deep attractor network for single-microphone speaker separation
@article{Chen2017DeepAN, title={Deep attractor network for single-microphone speaker separation}, author={Zhuo Chen and Yi Luo and Nima Mesgarani}, journal={2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, year={2017}, pages={246-250} }
Despite the overwhelming success of deep learning in various speech processing tasks, the problem of separating simultaneous speakers in a mixture remains challenging. [...] Key Method Attractor points in this study are created by finding the centroids of the sources in the embedding space, which are subsequently used to determine the similarity of each bin in the mixture to each source. The network is then trained to minimize the reconstruction error of each source by optimizing the embeddings.Expand Abstract
245 Citations
Speaker-Independent Speech Separation With Deep Attractor Network
- Computer Science
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
- 2018
- 128
- PDF
Online Deep Attractor Network for Real-time Single-channel Speech Separation
- Computer Science
- ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2019
- 7
- PDF
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures
- Computer Science, Engineering
- INTERSPEECH
- 2018
- 45
- PDF
Listening to Each Speaker One by One with Recurrent Selective Hearing Networks
- Computer Science
- 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2018
- 34
- Highly Influenced
- PDF
Speaker Attractor Network: Generalizing Speech Separation to Unseen Numbers of Sources
- Computer Science
- IEEE Signal Processing Letters
- 2020
- Highly Influenced
Cracking the cocktail party problem by multi-beam deep attractor network
- Computer Science, Engineering
- 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
- 2017
- 24
- PDF
Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation
- Computer Science
- 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2018
- 17
- Highly Influenced
- PDF
Multi-channel Speech Separation Using Deep Embedding Model with Multilayer Bootstrap Networks
- Computer Science, Engineering
- ArXiv
- 2019
- 1
- PDF
References
SHOWING 1-10 OF 24 REFERENCES
Deep clustering: Discriminative embeddings for segmentation and separation
- Computer Science, Mathematics
- 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2016
- 613
- Highly Influential
- PDF
Single-Channel Multi-Speaker Separation Using Deep Clustering
- Computer Science, Mathematics
- INTERSPEECH
- 2016
- 257
- PDF
Permutation invariant training of deep models for speaker-independent multi-talker speech separation
- Computer Science
- 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2017
- 339
- Highly Influential
- PDF
The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition
- Computer Science
- 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
- 2015
- 48
- PDF
Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks
- Computer Science
- IEEE Transactions on Multimedia
- 2015
- 266
- PDF
Long short-term memory recurrent neural network architectures for large scale acoustic modeling
- Computer Science
- INTERSPEECH
- 2014
- 1,607
- PDF
Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition
- Computer Science
- IEEE Transactions on Audio, Speech, and Language Processing
- 2012
- 2,452
- PDF
Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks
- Computer Science
- INTERSPEECH
- 2015
- 101
- PDF
Acoustic modelling with CD-CTC-SMBR LSTM RNNS
- Computer Science
- 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
- 2015
- 101
- PDF