Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training

Abstract

Audiobook data is a freely available source of rich expressive speech data. To accurately generate speech of this form, expressiveness must be incorporated into the synthesis system. This paper investigates two parts of this process: the representation of expressive information in a statistical parametric speech synthesis system; and whether discrete… (More)

Topics

3 Figures and Tables

Statistics

01020201520162017
Citations per Year

Citation Velocity: 9

Averaging 9 citations per year over the last 3 years.

Learn more about how we calculate this metric in our FAQ.

Cite this paper

@inproceedings{Chen2012ExploringRE, title={Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training}, author={Langzhou Chen and Mark J. F. Gales and Vincent Wan and Javier Latorre and Masami Akamine}, booktitle={INTERSPEECH}, year={2012} }