Multi-Site Data Collection for a Spoken Language Corpus

  title={Multi-Site Data Collection for a Spoken Language Corpus},
  author={Lynette Hirschman},
This paper describes a recently collected spoken language corpus for the ATIS (Air Travel Information System) domain. This data collection effort has been co-ordinated by MADCOW (Multi-site ATIS Data COllection Working group). We summarize the motivation for this effort, the goals, the implementation of a multi-site data collection paradigm, and the accomplishments of MADCOW in monitoring the collection and distribution of 12,000 utterances of spontaneous speech from five sites for use in a… CONTINUE READING


Publications citing this paper.


Publications referenced by this paper.
Showing 1-10 of 15 references

DARPA ATIS Benchmark Test Results Summary,

  • D Pallett
  • "February
  • 1992
2 Excerpts

Evaluation of the CMU ATIS system

  • W. Ward
  • Proc. Fourth DARPA Speech and Language Workshop,
  • 1991
1 Excerpt

Podlozny, "A Template Marcher for Robust NL Interpretation,

  • E. Jackson, D. Appelt, J. Bear, A. R. Moore
  • Proc. DARPA Speech and Natural Language Workshop,
  • 1991
1 Excerpt

Similar Papers

Loading similar papers…