The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability

@article{Pitt2005TheBC,
  title={The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability},
  author={M. Pitt and K. Johnson and E. Hume and Scott F. Kiesling and W. Raymond},
  journal={Speech Commun.},
  year={2005},
  volume={45},
  pages={89-95}
}
  • M. Pitt, K. Johnson, +2 authors W. Raymond
  • Published 2005
  • Computer Science
  • Speech Commun.
  • This paper describes the Buckeye corpus of spontaneous American English speech, a 307,000-word corpus containing the speech of 40 talkers from central Ohio, USA. The method used to elicit and record the speech is described, followed by a description of the protocol that was developed to phonemically label what talkers said. The results of a test of labeling consistency are then presented. The corpus will be made available to the scientific community when labeling is completed. 
    213 Citations

    Figures, Tables, and Topics from this paper

    Explore Further: Topics Discussed in This Paper

    The buckeye corpus of speech: updates and enhancements
    • 6
    • PDF
    The Korean Corpus of Spontaneous Speech
    • 13
    • PDF
    GRASS: the Graz corpus of Read And Spontaneous Speech
    • 12
    • PDF
    The “Up” corpus: A corpus of speech samples across adulthood
    • 11
    • PDF
    SpiCE: A New Open-Access Corpus of Conversational Bilingual Speech in Cantonese and English
    • 1
    • Highly Influenced
    • PDF
    Phone Boundary Annotation in Conversational Speech
    • 5
    • PDF

    References

    SHOWING 1-10 OF 49 REFERENCES
    Phonetic analyses of word and segment variation using the TIMIT corpus of American english
    • 59
    • PDF
    Transcribing phonetic detail in the speech of unintelligible children: a comparison of procedures.
    • 50
    Informal speech : alphabetic & phonemic texts with statistical analyses and tables
    • 60
    Reduction of English function words in switchboard
    • 91
    • PDF
    Preliminary results on speaker-dependent variation in the TIMIT database.
    • D. Byrd
    • Computer Science, Medicine
    • The Journal of the Acoustical Society of America
    • 1992
    • 91
    • PDF
    Reliability studies in broad and narrow phonetic transcription
    • 243
    • Highly Influential
    An introduction to the pronunciation of English
    • 929
    • PDF
    An acoustic‐phonetic data base
    • 84
    Estimating the quality of phonetic transcriptions and segmentations of speech signals
    • 65
    • PDF
    Listening to spoken English
    • 386