DiarTk : An Open Source Toolkit for Research in Multistream Speaker Diarization and its Application to Meetings Recordings

@inproceedings{Vijayasenan2012DiarTkA,
  title={DiarTk : An Open Source Toolkit for Research in Multistream Speaker Diarization and its Application to Meetings Recordings},
  author={Deepu Vijayasenan and Fabio Valente},
  booktitle={INTERSPEECH},
  year={2012}
}
The speaker diarization task consists of inferring “who spo ke when” in an audio stream without any prior knowledge and has been object of several NIST international evaluation campa igns is last years. A common trend for improving performances has been the use of several different feature streams as diverse as speaker location features, visual features or noise robust acoustic features. This paper describes an open source toolkit re leased under GPL license aiming at facilitating research in… CONTINUE READING
Highly Cited
This paper has 41 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 31 extracted citations

Enhancement and Analysis of Conversational Speech: JSALT 2017

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2018
View 2 Excerpts
Highly Influenced

A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2018
View 1 Excerpt

Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion

IEEE Transactions on Pattern Analysis and Machine Intelligence • 2018
View 3 Excerpts

Robust Feature Extraction from AD-HOC Microphones for Meeting Diarization

2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC) • 2018
View 2 Excerpts

References

Publications referenced by this paper.
Showing 1-10 of 15 references

An overview of automatic speaker diarization systems

IEEE Transactions on Audio, Speech, and Language Processing • 2006
View 8 Excerpts
Highly Influenced

Multimodal Speaker Diarization

IEEE Transactions on Pattern Analysis and Machine Intelligence • 2012

Speaker Diarization: A Review of Recent Research

IEEE Transactions on Audio, Speech, and Language Processing • 2012

Online diariza tion of streaming audio-visual data for smart environments

J. Schmalenstroeer, R. Haeb-Umbach
J. Sel. Topics Signal Processing, vol. 4, no. 5, pp. 845–856, 2010. • 2010
View 1 Excerpt

Using audio and visual cues for speaker diarisation initialisation

2010 IEEE International Conference on Acoustics, Speech and Signal Processing • 2010
View 1 Excerpt

An Information Theoretic Approach to Speaker Diarization of Meeting Data

IEEE Transactions on Audio, Speech, and Language Processing • 2009
View 3 Excerpts

Similar Papers

Loading similar papers…