Search of Spoken Documents Retrieves Well Recognized Transcripts

Abstract

This paper presents a series of analyses and experiments on spoken document retrieval systems: search engines that retrieve transcripts produced by speech recognizers. Results show that transcripts that match queries well tend to be recognized more accurately than transcripts that match a query less well. This result was described in past literature, however, no study or explanation of the effect has been provided until now. This paper provides such an analysis showing a relationship between word error rate and query length. The paper expands on past research by increasing the number of recognitions systems that are tested as well as showing the effect in an operational speech retrieval system. Potential future lines of enquiry are also described.

DOI: 10.1007/978-3-540-71496-5_45

Extracted Key Phrases

8 Figures and Tables

Cite this paper

@inproceedings{Sanderson2007SearchOS, title={Search of Spoken Documents Retrieves Well Recognized Transcripts}, author={Mark Sanderson and Xiao Mang Shou}, booktitle={ECIR}, year={2007} }