Word embedding for recurrent neural network based TTS synthesis

@article{Wang2015WordEF,
  title={Word embedding for recurrent neural network based TTS synthesis},
  author={Peilu Wang and Yao Qian and Frank K. Soong and Lei He and Hai Zhao},
  journal={2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  year={2015},
  pages={4879-4883}
}
The current state of the art TTS synthesis can produce synthesized speech with highly decent quality if rich segmental and suprasegmental information are given. However, some suprasegmental features, e.g., Tone and Break (TOBI), are time consuming due to being manually labeled with a high inconsistency among different annotators. In this paper, we investigate the use of word embedding, which represents word with low dimensional continuous-valued vector and being assumed to carry a certain… CONTINUE READING
Highly Cited
This paper has 53 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 29 extracted citations

Representing raw linguistic information in chinese text-to-speech system

2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) • 2017
View 2 Excerpts

54 Citations

010203020152016201720182019
Citations per Year
Semantic Scholar estimates that this publication has 54 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-10 of 23 references

Natural Language Processing (almost) from Scratch

Journal of Machine Learning Research • 2011
View 5 Excerpts
Highly Influenced

Neural net word representations for phrase-break prediction without a part of speech tagger

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2014
View 2 Excerpts

On the training aspects of Deep Neural Network (DNN) for parametric TTS synthesis

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2014
View 1 Excerpt

Statistical parametric speech synthesis using deep neural networks

2013 IEEE International Conference on Acoustics, Speech and Signal Processing • 2013
View 1 Excerpt