Matthias Eichner

Learn More
In the past, several approaches have been proposed for voice conversion in TTS systems. Mostly, conversion is done by modification of the spectral properties and pitch to match a certain target voice. This conversion causes distortions that deteriorate the quality of the synthesized speech. In this paper we investigate a very simple and straightforward(More)
Pronunciation dictionaries are the interface between ortho-graphic and phonetic representation of the speech signal and are thereby a substantial component of speech recognition systems. In many systems simple canonical pronunciation forms are used within the dictionary. They represent the " correct " pronunciation as they are found in lexicons and neither(More)
This paper presents: a multi-level concept to generate the speech rhythm in the Dresden TTS system for German (DreSS). The rhythm control includes the phrase, the syllabic and the phonemic level. The concept allows the alternative use of rule-based or statistical, but also data driven methods on these levels. To create the rules and to train a neural(More)
State-of-the-art speech synthesis systems achieve a high overall quality. However, synthesized speech still lacks naturalness. To produce more natural and colloquial synthetic speech, our research focuses on integration of effects present in spontaneous speech. Conventional speech synthesis systems do not consider the probability of a word in its context.(More)
With the progress of speech synthesis towards the development of complete TTS systems, the databases of speech synthesizers obtain more and more similarity with databases of speech recognizers. This offers new possibilities in combining systems for speech synthesis and recognition. In a new project, we are developing a speech dialogue system with the(More)
We investigate the deployment possibilities of speech enabled services in a Web based e-learning environment. The integration of speech technology is realized with a client/server architecture. Therefore, the services speech synthesis, speech recognition, and speaker verification are installed at a central SpeechServer. The client uses a Java applet(More)
In this paper we investigate measures for the evaluation of pronunciation dictionaries that can be used independently of the type of lexicon, the language, a specific recognizer and how the dictionary was generated. We will describe statistical measures, measures based on information theory and performance measures and give examples how these measures can(More)