Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis

@inproceedings{Yoshimura1999SimultaneousMO,
  title={Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis},
  author={Takayoshi Yoshimura and Keiichi Tokuda and Takashi Masuko and Takao Kobayashi and Tadashi Kitamura},
  booktitle={EUROSPEECH},
  year={1999}
}
In this paper, we describe an HMM-based speech synthesis system in which spectrum, pitch and state duration are modeled simultaneously in a unified framework of HMM. In the system, pitch and state duration are modeled by multi-space probability distribution HMMs and multi-dimensional Gaussian distributions, respectively. The distributions for spectral parameter, pitch parameter and the state duration are clustered independently by using a decision-tree based context clustering technique… CONTINUE READING
Highly Influential
This paper has highly influenced 66 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 867 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 559 extracted citations

868 Citations

050100'99'03'08'13'18
Citations per Year
Semantic Scholar estimates that this publication has 868 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…