- Published 1996 in ICSLP

This report describes a method for estimating the separation degree at the bunsetsu boundary (SD) for Japanese text-to-speech synthesis. Our method gives us the prosodic symbol without using complicated linguistic analysis. First we classify bunsetsus according to the nal morpheme. Each classi ed bunsetsu has a temporary separation degree in advance. We call this \the estimated separation degree" (ESD). ESD is derived from the SD's statistical tendency regarding each bunsetsu. The SD is decided by rules that correct the ESD as an initial degree. Correction rules are constructed by comparing the ESD, and the SD is observed from natural speech to cancel the frequently occurring mismatches. An absolute evaluation test of ve grades was performed upon 300 sentences with prosodic symbols given by our method. As a result, the ratio of \Natural" and \Somewhat unnatural but tolerable" exceeded 2/3. The proportion of \Serious error" was less than 10%, thus giving us satisfactory results.

@inproceedings{Magata1996AMF,
title={A method for estimating prosodic symbol from text for Japanese text-to-speech synthesis},
author={Ken-ichi Magata and Tomoki Hamagami and Mitsuo Komura},
booktitle={ICSLP},
year={1996}
}