Pronunciation Modeling of Mandarin Casual Speech
@inproceedings{Fung2000PronunciationMO, title={Pronunciation Modeling of Mandarin Casual Speech}, author={Pascale Fung and William J. Byrne and Zheng Thomas and Teresa M. Kamm and Liu Yi and Song Zhanjiang and Veera Venkataramani and Umar Ruhi}, year={2000} }
Figures and Tables from this paper
table 1 figure 1 table 2 figure 2 table 3 figure 3 table 4 figure 4 table 5 figure 5 table 6 figure 6 table 7 figure 7 table 8 table 9 table 10 table 11 table 12 table 13 table 14 table 15 table 16 table 17 table 18 table 19 table 20 table 21 table 22 table 23 table 24 table 25 table 26 table 27 table 28 table 29 table 30 table 31 table 32 table 33 table 35 table 36 table 37 table 38 table 39
27 Citations
Model partial pronunciation variations for spontaneous Mandarin speech recognition
- Physics7th International Conference on Spoken Language Processing (ICSLP 2002)
- 2002
Modeling pronunciation variations is a critical part of spontaneous Mandarin speech recognition. Such variations include both complete changes and partial changes. Complete changes can usually be…
Pronunciation Modeling for Spontaneous Mandarin Speech Recognition
- PhysicsInt. J. Speech Technol.
- 2004
It is shown that partial changes are much less clear-cut than previously assumed and cannot be modelled by mere representation by alternate phone units and can be applied to any automatic speech recognition system based on subword units.
State-dependent phonetic tied mixtures with pronunciation modeling for spontaneous speech recognition
- PhysicsIEEE Transactions on Speech and Audio Processing
- 2004
A state-dependent phonetic tied-mixture model with variable codebook size that incorporates a state-level pronunciation model for better discrimination of phonetic and acoustic confusions, while reducing model complexity is proposed.
Modeling partial pronunciation variations for spontaneous Mandarin speech recognition
- PhysicsComput. Speech Lang.
- 2002
Towards Improved Assessment of Phonotactic Information for Automatic Language Identification
- Computer Science2006 IEEE Odyssey - The Speaker and Language Recognition Workshop
- 2006
This investigation makes use of the CallHome corpus, based on the premise it provides a better representation for the style of discourse and channel conditions encountered in the conversational telephone speech (CTS), which is now the focus of current NIST LID evaluations.
PARTIAL CHANGE ACCENT MODELS SPEECH RECOG
- Physics
- 2003
Regional accents in Mandarin speech result mostly from partial phone changes due to the interlanguage system of non-native speakers. We propose partial change accent models based on accent-specific…
Partial Change Phone Models for Pronunciation Variations in Spontaneous Mandarin Speech
- Physics
- 2002
The pre-trained acoustic model is reconstructed by sharing Gaussian mixtures between canonical phone models and partial change phone models at the state level and improves the resolution of the acoustic model to accommodate partial changes.
English-Chinese Name Machine Transliteration Using Search and Neural Network Models
- Computer Science
- 2018
It is found that search-based methods outperform deep learning ones, likely due to the relatively small number of English names with standard Chinese translations in the accessible dataset, and that incorporating syllable length heuristics and phonetic information into the search improves performance significantly.
Joint training methods for tandem and hybrid speech recognition systems using deep neural networks
- Business
- 2017
Cambridge International Scholarship, Cambridge Overseas Trust
Research funding, EPSRC Natural Speech Technology Project
Research funding, DARPA BOLT Program
Research funding, iARPA Babel Program
Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition
- Computer ScienceIEEE Transactions on Audio, Speech, and Language Processing
- 2013
The proposed DGMS framework is able to cover more multi-accent changes, thus reduce some performance loss in pruned beam search, without increasing the model size of the original acoustic model set.
20 References
Pronunciation modeling by sharing gaussian densities across phonetic models
- PhysicsEUROSPEECH
- 1999
The incorporation of pronunciation models into acoustic model training in addition to recognition is described, showing a 1.7 % improvement in recognition accuracy on the Switchboard corpus is presented.
Automatic Generation of Detailed Pronunciation Lexicons
- Linguistics
- 1996
This work explores different ways of “spelling” a word in a speech recognizer’s lexicon and how to obtain those spellings and describes how these different pronunciations are obtained from text-to-speech systems and from procedures that build decision trees trained on phonetically-labeled corpora.
A Status Report from WS97
- presented at IEEE Workshop on Automatic Speech Recognition and Understanding, Santa Barbara, CA, USA, 1997.
- 1997
Pronunciation modeling for conversational speech recognition
- Physics
- 2001
This dissertation provides a fundamental and quantitative insight into pronunciation variability in spontaneous speech and demonstrates techniques for accommodating this variability within the framework of traditional automatic speech recognition systems that assume temporally non-overlapping phonetic segments.
Automatic Speech and Speaker Recognition: Advanced Topics
- Physics
- 1999
Automatic Speech and Speaker Recognition: Advanced Topics groups together in a single volume a number of important topics on speech and speaker recognition, topics which are of fundamental importance, but not yet covered in detail in existing textbooks.
Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling
- Computer Science2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)
- 2000
An accent adaptation approach using pronunciation variation modeling technology for the Mandarin accent was proposed in this paper and the syllable recognition error rate was reduced 15% by context-independent SPVD, and 20% bycontext-dependent SPVD.
An application of SAMPA-c for standard Chinese
- Computer ScienceINTERSPEECH
- 2000
The result shows that the labeling system presented is suitable for Standard Chinese and is used in two corpora labeling.
Japanese document recognition based on interpolated n-gram model of character
- Computer ScienceProceedings of 3rd International Conference on Document Analysis and Recognition
- 1995
A contextual postprocessing method using a trigram model of character for Japanese document recognition using a deleted interpolation method is described, and its advantage is revealed by practical experiments.
Statistically reliable deleted interpolation
- Computer ScienceIEEE Trans. Speech Audio Process.
- 1997
This work proposes a statistically reliable deleted interpolation (DI) approach that attempts to piecewise linearly approximate the interpolating weight curve based on some reasoning concerned with statistical reliability of sample-based estimates.
The phonetic labeling on read and spontaneous discourse corpora
- LinguisticsINTERSPEECH
- 2000
First the principles and conventions of transcription are presented, then these two speech styles are compared from phonetic and syntactic point of view, including the statistic results of different phonetic units got from the annotated corpora.