Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition

  title={Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition},
  author={Marijn Huijbregts and Roeland Ordelman and Franciska de Jong},
This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data. 
Highly Cited
This paper has 82 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 56 extracted citations

82 Citations

Citations per Year
Semantic Scholar estimates that this publication has 82 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 18 references

Evaluation campaigns and TRECVid

Multimedia Information Retrieval • 2006
View 1 Excerpt

Huijbregts . The ami speaker diarization system for nist rt 06 s meeting data

D. A. van Leeuwen, H. M.A.
NIST Rich Transcription • 2006

BBN/LIMSI 10xRT English Broadcast News Transcription System

L. Nguyen, S. Abdou, +9 authors F. Lefevre
In Proc. DARPA RT04, • 2004
View 1 Excerpt

Dutch Speech Recognition in Multimedia Information Retrieval

Roeland Ordelman
PhD thesis, • 2003
View 1 Excerpt

Fast LM look-ahead for large vocabulary continuous speech recognition using perfect hashing

2002 IEEE International Conference on Acoustics, Speech, and Signal Processing • 2002
View 2 Excerpts

Similar Papers

Loading similar papers…