Exploiting prosodic and lexical features for tone modeling in a conditional random field framework

@article{Wei2008ExploitingPA,
  title={Exploiting prosodic and lexical features for tone modeling in a conditional random field framework},
  author={Hongxiu Wei and Xinhao Wang and Hao Wu and Dingsheng Luo and Meihong Wu},
  journal={2008 IEEE International Conference on Acoustics, Speech and Signal Processing},
  year={2008},
  pages={4549-4552}
}
Tonal cues play an important role in distinguishing ambiguous words in Mandarin speech recognition. This paper explores an innovative tone modeling framework using prosodic and lexical features, as well as syllable context information. A discriminative model, namely a Conditional Random Field (CRF), is adopted, which is sufficiently flexible to handle multiple interacting features and long-range dependencies of observations. After the first pass search of a recognition system, the CRF based… CONTINUE READING

Tables, Results, and Topics from this paper.

Key Quantitative Results

  • Experiments results show that the tonal cues help to achieve 7.8% and 8.6% relative reductions of character error rate on two widely used Mandarin speech recognition tasks, Hub-4 test and 863 test.

Citations

Publications citing this paper.
SHOWING 1-10 OF 11 CITATIONS

References

Publications referenced by this paper.
SHOWING 1-10 OF 14 REFERENCES

Similar Papers

Loading similar papers…