Fast speaker diarization using a high-level scripting language

@article{Gonina2011FastSD,
  title={Fast speaker diarization using a high-level scripting language},
  author={Ekaterina Gonina and Gerald Friedland and Henry Cook and Kurt Keutzer},
  journal={2011 IEEE Workshop on Automatic Speech Recognition & Understanding},
  year={2011},
  pages={553-558}
}
Most current speaker diarization systems use agglomerative clustering of Gaussian Mixture Models (GMMs) to determine “who spoke when” in an audio recording. While state-of-the-art in accuracy, this method is computationally costly, mostly due to the GMM training, and thus limits the performance of current approaches to be roughly real-time. Increased sizes of current datasets require processing of hundreds of hours of data and thus make more efficient processing methods highly desirable. With… CONTINUE READING

References

Publications referenced by this paper.
SHOWING 1-10 OF 26 REFERENCES

Similar Papers

Loading similar papers…