Deep attractor network for single-microphone speaker separation

  title={Deep attractor network for single-microphone speaker separation},
  author={Zhuo Chen and Yi Luo and Nima Mesgarani},
  journal={2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  • Zhuo Chen, Yi Luo, Nima Mesgarani
  • Published 2017
  • Computer Science, Medicine
  • 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • Despite the overwhelming success of deep learning in various speech processing tasks, the problem of separating simultaneous speakers in a mixture remains challenging. [...] Key Method Attractor points in this study are created by finding the centroids of the sources in the embedding space, which are subsequently used to determine the similarity of each bin in the mixture to each source. The network is then trained to minimize the reconstruction error of each source by optimizing the embeddings.Expand Abstract
    245 Citations
    Speaker-Independent Speech Separation With Deep Attractor Network
    • 128
    • PDF
    Online Deep Attractor Network for Real-time Single-channel Speech Separation
    • C. Han, Yi Luo, Nima Mesgarani
    • Computer Science
    • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
    • 2019
    • 7
    • PDF
    Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures
    • 45
    • PDF
    Listening to Each Speaker One by One with Recurrent Selective Hearing Networks
    • 34
    • Highly Influenced
    • PDF
    Speaker Attractor Network: Generalizing Speech Separation to Unseen Numbers of Sources
    • Highly Influenced
    Cracking the cocktail party problem by multi-beam deep attractor network
    • Zhuo Chen, J. Li, +4 authors Y. Gong
    • Computer Science, Engineering
    • 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
    • 2017
    • 24
    • PDF
    Deep Speech Denoising with Vector Space Projections
    • PDF
    Improved Source Counting and Separation for Monaural Mixture
    • 2
    • PDF
    Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation
    • 17
    • Highly Influenced
    • PDF
    Multi-channel Speech Separation Using Deep Embedding Model with Multilayer Bootstrap Networks
    • 1
    • PDF


    Deep clustering: Discriminative embeddings for segmentation and separation
    • 613
    • Highly Influential
    • PDF
    Single-Channel Multi-Speaker Separation Using Deep Clustering
    • 257
    • PDF
    Permutation invariant training of deep models for speaker-independent multi-talker speech separation
    • 339
    • Highly Influential
    • PDF
    The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition
    • 48
    • PDF
    Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks
    • 266
    • PDF
    Long short-term memory recurrent neural network architectures for large scale acoustic modeling
    • 1,607
    • PDF
    Speech enhancement based on deep denoising autoencoder
    • 487
    • PDF
    Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition
    • 2,452
    • PDF
    Acoustic modelling with CD-CTC-SMBR LSTM RNNS
    • 101
    • PDF