• Computer Science
  • Published 2015

Statistical parametric speech synthesis: from HMM to LSTM-RNN

@inproceedings{Zen2015StatisticalPS,
  title={Statistical parametric speech synthesis: from HMM to LSTM-RNN},
  author={Heiga Zen},
  year={2015}
}
Statistical parametric speech synthesis (SPSS) combines an acoustic model and a vocoder to render speech given a text. Typically decision tree-clustered context-dependent hidden Markov models (HMMs) are employed as the acoustic model, which represent a relationship between linguistic and acoustic features. Recently, artificial neural network-based acoustic models, such as deep neural networks, mixture density networks, and long short-term memory recurrent neural networks (LSTM-RNNs), showed… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 25 CITATIONS

Merlin: An Open Source Neural Network Speech Synthesis System

VIEW 6 EXCERPTS
CITES BACKGROUND
HIGHLY INFLUENCED

A New GAN-based End-to-End TTS Training Algorithm

VIEW 1 EXCERPT
CITES METHODS

End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training

VIEW 2 EXCERPTS
CITES BACKGROUND

Sequence-to-sequence Modelling of F0 for Speech Emotion Conversion

VIEW 1 EXCERPT
CITES METHODS

Voice command generation using Progressive Wavegans

VIEW 1 EXCERPT
CITES BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-10 OF 88 REFERENCES

Statistical parametric speech synthesis using deep neural networks

VIEW 7 EXCERPTS

Product of Experts for Statistical Parametric Speech Synthesis

VIEW 5 EXCERPTS

Trainable speech synthesis with trended hidden Markov models

  • John Dines, Sridha Sridharan
  • Computer Science
  • 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
  • 2001
VIEW 16 EXCERPTS
HIGHLY INFLUENTIAL

Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis

  • Heiga Zen, Hasim Sak
  • Computer Science
  • 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2015
VIEW 5 EXCERPTS

A Tutorial on Hidden Markov Models and Selected Applications

VIEW 3 EXCERPTS
HIGHLY INFLUENTIAL