The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News

@inproceedings{Estve2010TheEC,
  title={The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News},
  author={Yannick Est{\`e}ve and Thierry Bazillon and Jean-Yves Antoine and Fr{\'e}d{\'e}ric B{\'e}chet and J{\'e}r{\^o}me Farinas},
  booktitle={LREC},
  year={2010}
}
This paper presents the EPAC corpus which is composed by a set of 100 hours of conversational speech manually transcribed and by the outputs of automatic tools (automatic segmentation, transcription, POS tagging, etc.) applied on the entire French ESTER 1 audio corpus: this concerns about 1700 hours of audio recordings from radiophonic shows. This corpus was built during the EPAC project funded by the French Research Agency (ANR) from 2007 to 2010. This corpus increases significantly the amount… CONTINUE READING
Highly Cited
This paper has 45 citations. REVIEW CITATIONS

From This Paper

Figures, tables, results, and topics from this paper.

Key Quantitative Results

  • For example, on the EPAC test data set our ASR system yields a word error rate equals to 17.25%.

Citations

Publications citing this paper.
Showing 1-10 of 32 extracted citations

References

Publications referenced by this paper.
Showing 1-10 of 10 references

Similar Papers

Loading similar papers…