Corpus ID: 17856037

An analysis of transcription consistency in spontaneous speech from the buckeye corpus

@inproceedings{Raymond2002AnAO,
  title={An analysis of transcription consistency in spontaneous speech from the buckeye corpus},
  author={W. Raymond and M. Pitt and K. Johnson and E. Hume and Matthew J. Makashay and Robin Dautricourt and Craig Hilts},
  booktitle={INTERSPEECH},
  year={2002}
}
  • W. Raymond, M. Pitt, +4 authors Craig Hilts
  • Published in INTERSPEECH 2002
  • Computer Science
  • We present a preliminary analysis of transcriber consistency in labeling and segmentation of words and phones in the Buckeye corpus of spontaneous, informal speech. We find that pairwise inter-transcriber agreement on exact phone label match was 76%, and segmentation agreement within 20% of phone pair length was 75%, though longer phones are more consistently segmented than shorter phones. Patterns of consistency variation in labeling are observed as a function of phonetic categories that are… CONTINUE READING
    27 Citations

    Figures, Tables, and Topics from this paper

    Phone Boundary Annotation in Conversational Speech
    • 5
    • PDF
    A comparison of ASR and human errors for transcription of non-native spontaneous speech
    • 2
    • PDF
    A Taiwan Southern Min spontaneous speech corpus for discourse prosody
    • 2
    • PDF
    Assessing the accuracy of existing forced alignment software on varieties of British English
    • 3
    • PDF
    Fast transcription of unstructured audio
    Informal speech processes can be categorical in nature, even if they affect many different words.
    • 25
    • Highly Influenced
    • PDF

    References

    SHOWING 1-6 OF 6 REFERENCES
    Estimating the quality of phonetic transcriptions and segmentations of speech signals
    • 65
    • PDF
    The ViC transcriber's manual: Guidelines for transferring, transcribing, and labeling sound files for the Buckeye corpus
    • The ViC transcriber's manual: Guidelines for transferring, transcribing, and labeling sound files for the Buckeye corpus
    • 2000
    The Aligner user's guide. Entropic Research Laboratory
    • The Aligner user's guide. Entropic Research Laboratory
    • 1997
    Estimating the quality of honetic transcriptions and segmentations of speech ignals
    • Proceedings of ICSLP
    • 1996