Multimodal Multi-Channel On-Line Speaker Diarization Using Sensor Fusion Through SVM

@article{Minotto2015MultimodalMO,
  title={Multimodal Multi-Channel On-Line Speaker Diarization Using Sensor Fusion Through SVM},
  author={Vicente P. Minotto and Cl{\'a}udio Rosito Jung and Bowon Lee},
  journal={IEEE Transactions on Multimedia},
  year={2015},
  volume={17},
  pages={1694-1705}
}
Speaker diarization (SD) is the process of assigning speech segments of an audio stream to its corresponding speakers, thus comprising the problem of voice activity detection (VAD), speaker labeling/identification, and often sound source localization (SSL). Most research activities in the past aimed towards applications as broadcast news, meetings, conversational telephony, and automatic multimodal data annotation, where SD may be performed off-line. However, a recent research focus is human… CONTINUE READING
10 Extracted Citations
70 Extracted References
Similar Papers

Citing Papers

Publications influenced by this paper.
Showing 1-10 of 10 extracted citations

Referenced Papers

Publications referenced by this paper.
Showing 1-10 of 70 references

Similar Papers

Loading similar papers…