Segmental Modeling for Audio Segmentation

  title={Segmental Modeling for Audio Segmentation},
  author={Hagai Aronowitz},
  journal={2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07},
Trainable speech/non-speech segmentation and music detection algorithms usually consist of a frame based scoring phase combined with a smoothing phase. This paper suggests a framework in which both phases are explicitly unified in a segment based classifier. We suggest a novel segment based generative model in which audio segments are modeled as supervectors and each class (speech, silence, music) is modeled by a distribution over the supervector space. Segmental speech classes can then be… CONTINUE READING

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 11 extracted citations


Publications referenced by this paper.
Showing 1-10 of 14 references

The MIT Lincoln laboratory RT-04F diarization systems: applications to broadcast news and telephone conversations

  • D. A. Reynolds, P. Torres-Carrasquillo
  • in Proc. RT-04F Workshop,
  • 2004
Highly Influential
5 Excerpts

Modeling intraspeaker variability for speaker recognition "

  • D. Irony H. Aronowitz, D. Burshtein
  • 2005

Similar Papers

Loading similar papers…