Highly Accurate Mandarin Tone Classification In The Absence of Pitch Information
@inproceedings{Ryant2014HighlyAM, title={Highly Accurate Mandarin Tone Classification In The Absence of Pitch Information}, author={Neville Ryant and M. Slaney and M. Liberman and E. Shriberg and Jiahong Yuan}, year={2014} }
A deep neural network (DNN) classifier based only on 40 mel-frequency cepstral coefficients (MFCCs) achieved 29.99% frame error rate (FER) and 16.86% segment error rate (SER) in recognizing five tonal categories in Mandarin Chinese broadcast news. With the addition of subband autocorrelation change detection (SACD) pitch-class features [1], the classifier scored 27.58% FER and 15.56% SER. These results are substantially better than the best previously reported results on broadcast news tone… CONTINUE READING
20 Citations
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks
- Computer Science
- J. Signal Process. Syst.
- 2018
- 11
An Investigation of the Target Approximation Model for Tone Modeling and Recognition in Continuous Mandarin Speech
- Computer Science
- INTERSPEECH
- 2020
- PDF
Pitch Range Estimation with Multi features and MTL-DNN Model
- Computer Science
- 2018 14th IEEE International Conference on Signal Processing (ICSP)
- 2018
- 2
Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations
- Computer Science
- 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
- 2016
- 4
Tone Classification in Mandarin Chinese Using Convolutional Neural Networks
- Computer Science
- INTERSPEECH
- 2016
- 12
- PDF
Joint Gender-, Tone-, Vowel- Classification Via Novel Hierarchical Classification for Annotation of Monosyllabic Mandarin Word Tokens
- Computer Science
- 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2018
- 2
- PDF
Improving Mandarin Prosody Boundary Detection by Using Phonetic Information and Deep LSTM Model
- Computer Science
- 2019 International Conference on Asian Language Processing (IALP)
- 2019
The influence of pitch and noise on the discriminability of filterbank features
- Computer Science
- INTERSPEECH
- 2014
- 1
- PDF
References
SHOWING 1-10 OF 35 REFERENCES
Mandarin tone classification without pitch tracking
- Computer Science
- 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2014
- 24
- PDF
Improved tone modeling for Mandarin broadcast news speech recognition
- Computer Science
- INTERSPEECH
- 2006
- 77
- Highly Influential
- PDF
Pitch tracking and tone features for Mandarin speech recognition
- Computer Science
- 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)
- 2000
- 67
Decision tree based tone modeling for Chinese speech recognition
- Computer Science
- 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing
- 2004
- 20
Prediction of Fundamental Frequency and Voicing From Mel-Frequency Cepstral Coefficients for Unconstrained Speech Reconstruction
- Computer Science
- IEEE Transactions on Audio, Speech, and Language Processing
- 2007
- 53
Tone and pitch accent classification using auditory attention cues
- Computer Science
- 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2011
- 11
- PDF
Pitch-gesture modeling using subband autocorrelation change detection
- Computer Science
- INTERSPEECH
- 2013
- 5
- PDF
Large vocabulary Mandarin speech recognition with different approaches in modeling tones
- Computer Science
- INTERSPEECH
- 2000
- 86
- PDF
Noise Robust Pitch Tracking by Subband Autocorrelation Classification
- Computer Science
- INTERSPEECH
- 2012
- 78
- PDF
Temporal and spectral cues in Mandarin tone recognition.
- Medicine
- The Journal of the Acoustical Society of America
- 2006
- 56
- PDF