Lexicon Design for Transcription of Spontaneous Voice Messages

@inproceedings{Gishri2010LexiconDF,
  title={Lexicon Design for Transcription of Spontaneous Voice Messages},
  author={Michal Gishri and Vered Silber-Varod and Ami Moyal},
  booktitle={LREC},
  year={2010}
}
Building a comprehensive pronunciation lexicon is a crucial element in the success of any speech recognition engine. The first stage of lexicon design involves the compilation of a comprehensive word list that keeps the Out-Of-Vocabulary (OOV) word rate to a minimum. The second stage involves providing optimized phonemic representations for all lexical items on the list. The research presented here focuses on the first stage of lexicon design – word list compilation, and describes the… CONTINUE READING

Figures, Results, and Topics from this paper.

Key Quantitative Results

  • The conclusion is for a design goal of 90% coverage of words in each domain.

References

Publications referenced by this paper.
SHOWING 1-10 OF 10 REFERENCES

The Talkbank Project

VIEW 13 EXCERPTS
HIGHLY INFLUENTIAL

Specification of corpora and word lists in 12 languages

U. Ziegenhain
  • Last retrieved March
  • 2004
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

Conversation SBCSAE Corpus

J. DuBois, R. Englebretson
  • Retrieved on November
  • 2004
VIEW 2 EXCERPTS

Voicemail Corpus - Part II (LDC2002S35). CD-ROM. Philadelphia: Linguistic Data Consortium

M Padmanabhan
  • 2002
VIEW 1 EXCERPT

Automatic transcription of voicemail at AT&T

  • 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
  • 2001
VIEW 1 EXCERPT

Voicemail Corpus - Part I (LDC98S77). CD-ROM. Philadelphia: Linguistic Data Consortium

M Padmanabhan
  • 1998
VIEW 1 EXCERPT

On designing pronunciation lexicons for large vocabulary continuous speech recognition

  • Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
  • 1996
VIEW 2 EXCERPTS