Corpus ID: 46116832

Enhancing LSTM RNN-Based Speech Overlap Detection by Artificially Mixed Data

@inproceedings{Hagerer2017EnhancingLR,
  title={Enhancing LSTM RNN-Based Speech Overlap Detection by Artificially Mixed Data},
  author={Gerhard Hagerer and Vedhas Pandit and F. Eyben and B. Schuller},
  booktitle={Semantic Audio},
  year={2017}
}
  • Gerhard Hagerer, Vedhas Pandit, +1 author B. Schuller
  • Published in Semantic Audio 2017
  • Computer Science
  • This paper presents a new method for Long Short-Term Memory Recurrent Neural Network (LSTM) based speech overlap detection. To this end, speech overlap data is created artificially by mixing large amounts of speech utterances. Our elaborate training strategies and presented network structures demonstrate performance surpassing the considered state-of-the-art overlap detectors. Thereby we target the full ternary task of non-speech, speech, and overlap detection. Furthermore, speakers’ gender is… CONTINUE READING
    12 Citations
    Overlap-Aware Diarization: Resegmentation Using Neural End-to-End Overlapped Speech Detection
    • 14
    • PDF
    CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning
    • 13
    • PDF
    Gender Classification Based on the Non-Lexical Cues of Emergency Calls with Recurrent Neural Networks (RNN)
    • 4
    • Highly Influenced
    • PDF
    DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs
    • 1
    • PDF
    Spherediar: An Effective Speaker Diarization System for Meeting Data
    • 2
    • Highly Influenced
    • PDF
    Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation
    • 14
    • PDF
    Robust Laughter Detection for Wearable Wellbeing Sensing
    • PDF
    "Did you laugh enough today?" - Deep Neural Networks for Mobile and Wearable Laughter Trackers
    • 5
    • PDF
    A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits
    • 7

    References

    SHOWING 1-10 OF 28 REFERENCES
    Detecting overlapping speech with long short-term memory recurrent neural networks
    • 25
    • PDF
    Using linguistic information to detect overlapping speech
    • 9
    • PDF
    The Detection of Overlapping Speech with Prosodic Features for Speaker Diarization
    • 29
    • PDF
    Speech overlap detection in a two-pass speaker diarization system
    • 27
    • PDF
    Speech overlap detection and attribution using convolutive non-negative sparse coding
    • 25
    • PDF
    Improved overlap speech diarization of meeting recordings using long-term conversational features
    • 21
    • PDF
    Annotating and categorizing competition in overlap speech
    • 15
    • PDF
    Overlapped speech detection for improved speaker diarization in multiparty meetings
    • 98
    • PDF
    Speech recognition robust against speech overlapping in monaural recordings of telephone conversations
    • 4
    • PDF