Minimum error rate training for phrasing in speech synthesis

Abstract

Phrase break prediction models in speech synthesis are classifiers that predict whether or not each word boundary is a prosodic break. These classifiers are generally trained to optimize the likelihood of prediction, and their performance is evaluated in terms of classification accuracy. We propose a minimum error rate training method for phrase break… (More)

3 Figures and Tables

Topics

  • Presentations referencing similar topics