Automatic detection of sentence boundaries and disfluencies based on recognized words

  title={Automatic detection of sentence boundaries and disfluencies based on recognized words},
  author={Andreas Stolcke and Elizabeth Shriberg and Rebecca A. Bates and Mari Ostendorf and Dilek Z. Hakkani-T{\"u}r and Madelaine Plauch{\'e} and G{\"o}khan T{\"u}r and Yu Lu},
We study the problem of detecting linguistic events at interword boundaries, such as sentence boundaries and disfluency locations, in speech transcribed by an automatic recognizer. Recovering such events is crucial to facilitate sp eech understanding and other natural language processing tasks. Our approach is based on a combination of prosodic cues modeled by decision trees, and word-based event N-gram language models. Several model combination approaches are investigated. The techniques are… CONTINUE READING
Highly Cited
This paper has 135 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 84 extracted citations

Resource-limited sentence boundary detection

View 7 Excerpts
Highly Influenced

Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches

2016 IEEE Spoken Language Technology Workshop (SLT) • 2016
View 4 Excerpts
Highly Influenced

Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2017
View 2 Excerpts

136 Citations

Citations per Year
Semantic Scholar estimates that this publication has 136 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 16 references

et al

M. Metee
Dysfluency annotation stylebook for the Switchboard corpus. Distributed by LDC,, • 1995
View 3 Excerpts
Highly Influenced

, and A . Stolcke . A prosody - only decision - tree model for disfluency detection

R. Bates Shriberg


M. Mast, R. Kompe, S. Harbeck, A. Kießling, H. Niemann
N ̈ oth, E. G. Schukat-Talamazzini, and V. Warnke. Dialog act classification with the help of prosody. In H. T. Bunnell and W. Idsardi, editors, Proc. ICSLP, vol. 3, pp. 1732–1735, Philadelphia, • 1996
View 2 Excerpts

SWITCHBOARD: Telephone speech corpus for research and development

J. J. Godfrey, E. C. Holliman, J. McDaniel
In Proc. ICASSP, vol. 1, pp. 517–520, San Francisco, • 1992
View 1 Excerpt

Similar Papers

Loading similar papers…