Bilingual Experiments on Automatic Recovery of Capitalization and Punctuation of Automatic Speech Transcripts

@article{Batista2012BilingualEO,
  title={Bilingual Experiments on Automatic Recovery of Capitalization and Punctuation of Automatic Speech Transcripts},
  author={Fernando Batista and Helena Moniz and Isabel Trancoso and Nuno J. Mamede},
  journal={IEEE Transactions on Audio, Speech, and Language Processing},
  year={2012},
  volume={20},
  pages={474-485}
}
This paper focuses on the tasks of recovering capitalization and punctuation marks from texts without that information, such as spoken transcripts, produced by automatic speech recognition systems. These two practical rich transcription tasks were performed using the same discriminative approach, based on maximum entropy, suitable for on-the-fly usage. Reported experiments were conducted both over Portuguese and English broadcast news data. Both force aligned and automatic transcripts were used… CONTINUE READING

13 Figures & Tables

Topics

Statistics

05101520112012201320142015201620172018
Citations per Year

52 Citations

Semantic Scholar estimates that this publication has 52 citations based on the available data.

See our FAQ for additional information.