TED-LIUM: an Automatic Speech Recognition dedicated corpus

@inproceedings{Rousseau2012TEDLIUMAA,
  title={TED-LIUM: an Automatic Speech Recognition dedicated corpus},
  author={Anthony Rousseau and Paul Del{\'e}glise and Yannick Est{\`e}ve},
  booktitle={LREC},
  year={2012}
}
This paper presents the corpus developed by the LIUM for Automatic Speech Recognition (ASR), based on the TED Talks. This corpus was built during the IWSLT 2011 Evaluation Campaign, and is composed of 118 hours of speech with its accompanying automatically aligned transcripts. We describe the content of the corpus, how the data was collected and processed, how it will be publicly available and how we built an ASR system using this data leading to a WER score of 17.4%. The official results we… CONTINUE READING
Highly Influential
This paper has highly influenced 11 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 129 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

Explore Further: Topics Discussed in This Paper

Citations

Publications citing this paper.
Showing 1-10 of 64 extracted citations

TEDxSK AND JUMPSK: A NEW SLOVAK SPEECH RECOGNITION DEDICATED CORPUS

2018
View 4 Excerpts
Method Support
Highly Influenced

ZERO-SHOT LEARNING FOR SPEECH RECOGNITION WITH UNIVERSAL PHONETIC MODEL

2018
View 4 Excerpts
Method Support
Highly Influenced

Building and using multimodal comparable corpora for machine translation

Natural Language Engineering • 2016
View 7 Excerpts
Method Support
Highly Influenced

Scaling recurrent neural network language models

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2015
View 2 Excerpts
Method Support
Highly Influenced

129 Citations

02040'14'16'18
Citations per Year
Semantic Scholar estimates that this publication has 129 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-9 of 9 references

Quicknet

ICSI.
http://www.icsi. berkeley.edu/Speech/qn.html. • 2012
View 1 Excerpt

The MIT-LL/AFRL IWSLT-2011 MT System

A. Ryan Aminzadeh, Tim Anderson, +5 authors Terry Gleason.
Proceedings of the International Workshop on Spoken Language Translation (IWSLT), San Fran- • 2011
View 1 Excerpt

Optimizing bottle-neck features for lvcsr

2008 IEEE International Conference on Acoustics, Speech and Signal Processing • 2008
View 1 Excerpt

Construction automatique du vocabulaire d’un système de transcription

A. Allauzen, J.-L. Gauvain.
Journées d’Étude sur la Parole. • 2004
View 1 Excerpt