• Corpus ID: 331207

The HMM-based speech synthesis system (HTS) version 2.0

@inproceedings{Zen2007TheHS,
  title={The HMM-based speech synthesis system (HTS) version 2.0},
  author={Heiga Zen and Takashi Nose and Junichi Yamagishi and Shinji Sako and Takashi Masuko and Alan W. Black and Keiichi Tokuda},
  booktitle={SSW},
  year={2007}
}
A statistical parametric speech synthesis system based on hidden Markov models (HMMs) has grown in popularity over the last few years. This system simultaneously models spectrum, excitation, and duration of speech using context-dependent HMMs and generates speech waveforms from the HMMs themselves. Since December 2002, we have publicly released an open-source software toolkit named HMM-based speech synthesis system (HTS) to provide a research and development platform for the speech synthesis… 
Development of an HMM-based speech synthesis system for Indian English language
TLDR
Experimental evaluation depicts that the developed text-to-speech system is capable of producing adequately natural speech in terms of intelligibility and intonation.
Hidden Markov Model based Speech Synthesis: A Review
TLDR
This paper reviews recent research advances in field of speech synthesis with related to statistical parametric approach to speech synthesis based on HMM, which finds the prosodic characteristics of the voice can be modified by simply varying the HMM parameters, thus reducing the large storage requirement.
FLEXIBLE HARMONIC / STOCHASTIC MODELING FOR HMM-BASED SPEECH SYNTHESIS
In this paper the preliminary results, of a new approach on speech modeling for statistical parametric HMM-based speech synthesis are presented. The proposed system is based on a flexible
Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis
TLDR
A speaker-adaptive HMM-based speech synthesis system that employs speaker adaptation, feature-space adaptive training, mixed-gender modeling, and full-covariance modeling using CSMAPLR transforms, in addition to several other techniques that have proved effective in previous systems are described.
Investigating the effect of speech features and the number of HMM mixtures in the quality HMM-based synthesizers
TLDR
The HMM-based speech synthesis system is described and applies to Arabic language using small size training speech database as an example, and it is shown that the resulting model database has the advantage of being small (can be less than 1MB).
Performance evaluation of the speaker-independent HMM-based speech synthesis system “HTS 2007” for the Blizzard Challenge 2007
TLDR
This paper describes a speaker-independent/adaptive HMM-based speech synthesis system developed for the Blizzard Challenge 2007, which employs speaker adaptation, feature-space adaptive training, mixed-gender modeling, and full-covariance modeling using CSMAPLR transforms, in addition to several other techniques that have proved effective in previous systems.
Evaluation of Finnish unit selection and HMM-based speech synthesis
TLDR
This work combines HMM-based signal generation with the front end originally designed for unit selection based Finnish TTS and the prosody of the output generated by the two synthesis techniques using the same speech database is evaluated.
Speech synthesis using articulatory-knowledge based HMM structure
TLDR
The proposed HMM structure is proposed to model the context-dependent spectral characteristics of a speech unit in order to improve synthetic speech fluency and reduce the huge amount of context combinations based on the articulatory knowledge of phonemes.
Hidden Markov Model ( HMM ) based Speech Synthesis for Urdu Language
This paper describes the development of HMM based speech synthesizer for Urdu language using the HTStoolkit. It describes the modifications needed to original HTS-Demo-scripts to port them, for Urdu
Arabic HMM-based speech synthesis
  • K. M. Khalil, Cherif Adnan
  • Computer Science
    2013 International Conference on Electrical Engineering and Software Applications
  • 2013
TLDR
The developed model improves the speech synthesis, naturalness and intelligibility quality in the Arabic language environment and is possible to play on the HMM parameters, change the producer voice characteristics.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 121 REFERENCES
A Hidden Semi-Markov Model-Based Speech Synthesis System
TLDR
Subjective listening test results show that use of HSMMs improves the reported naturalness of synthesized Speech Synthesis, which can be viewed as an HMM with explicit state duration PDFs.
Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
TLDR
It is demonstrated that a few sentences uttered by a target speaker are sufficient to adapt not only voice characteristics but also prosodic features, and synthetic speech generated from adapted models using only four sentences is very close to that from speaker dependent models trained using 450 sentences.
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
TLDR
An HMM-based speech synthesis system in which spectrum, pitch and state duration are modeled simultaneously in a unified framework of HMM is described.
Voice characteristics conversion for HMM-based speech synthesis system
TLDR
To transform the voice characteristics of synthesized speech to the target speaker, the maximum a posteriori estimation and vector field smoothing (MAP/VFS) algorithm was applied to the phoneme HMMs.
Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005
TLDR
The technical details, building processes, and performance of the basic HMM-based speech synthesis system, and new features integrated into Nitech-HTS 2005 such as STRAIGHT-based vocoding, HSMM- based acoustic modeling, and a speech parameter generation algorithm considering GV are described.
HSMM-Based Model Adaptation Algorithms for Average-Voice-Based Speech Synthesis
TLDR
Several speaker adaptation algorithms and MAP modification are described to develop consistent method for synthesizing speech in a unified way for arbitrary amount of the speech data.
Implementation and evaluation of an HMM-based Thai speech synthesis system
TLDR
The evaluation of the synthesized speech shows that tone correctness is significantly improved in some clustering styles, and the implemented system gives the better reproduction of prosody (or naturalness, in some sense) than the unit-selection-based system with the same speech database.
Speech parameter generation algorithms for HMM-based speech synthesis
This paper derives a speech parameter generation algorithm for HMM-based speech synthesis, in which the speech parameter sequence is generated from HMMs whose observation vector consists of a
Duration modeling for HMM-based speech synthesis
TLDR
This paper takes account of contextual factors such as stressrelated factors and locational factors in addition to phone identity factors to synthesize good quality speech with natural timing and the speaking rate can be varied easily.
HMM-based Trainable Speech Synthesis for Chinese
TLDR
A two-level based model is introduced for duration modeling and prediction, and the duration prediction RMSE was improved from 29.56ms to 27.01ms in order to improve the rhythm of synthetic speech.
...
1
2
3
4
5
...