Harmonics Plus Noise Model Based Vocoder for Statistical Parametric Speech Synthesis

  title={Harmonics Plus Noise Model Based Vocoder for Statistical Parametric Speech Synthesis},
  author={Daniel Erro and I{\~n}aki Sainz and Eva Navas and Inma Hern{\'a}ez},
  journal={IEEE Journal of Selected Topics in Signal Processing},
This article explores the potential of the harmonics plus noise model of speech in the development of a high-quality vocoder applicable in statistical frameworks, particularly in modern speech synthesizers. It presents an extensive explanation of all the different alternatives considered during the design of the HNM-based vocoder, together with the corresponding objective and subjective experiments, and a careful description of its implementation details. Three aspects of the analysis have been… CONTINUE READING
Highly Cited
This paper has 91 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 57 extracted citations

A straightforward method for calculating the voicing cut-off frequency for streaming HNM TTS

2015 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech) • 2015
View 5 Excerpts
Highly Influenced

Maximum Voiced Frequency Estimation: Exploiting Amplitude and Phase Spectra

IEEE Signal Processing Letters • 2014
View 13 Excerpts
Highly Influenced

A waveform representation framework for high-quality statistical parametric speech synthesis

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) • 2015
View 4 Excerpts
Highly Influenced

Vocaine the vocoder and applications in speech synthesis

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2015
View 4 Excerpts
Highly Influenced

A Log Domain Pulse Model for Parametric Speech Synthesis

IEEE/ACM Transactions on Audio, Speech, and Language Processing • 2018
View 3 Excerpts

92 Citations

Citations per Year
Semantic Scholar estimates that this publication has 92 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 43 references

Iterative Estimation of Sinusoidal Signal Parameters

IEEE Signal Processing Letters • 2010
View 17 Excerpts
Highly Influenced

Harmonic plus noise models for speech, Combined with statistical methods, for speech and speaker modification

Y. Stylianou
Ph.D. dissertations, École Nationale Supèrieure de Télécommunications, Paris, France, 1996. • 1996
View 20 Excerpts
Highly Influenced

Aholab Speech Synthesizers for Albayzin 2010

I. Sainz, D. Erro, +5 authors I. Luengo
Proc. FALA, 2010, pp. 343–347. • 2010
View 2 Excerpts
Highly Influenced

MFCC+F0 extraction and waveform reconstruction using HNM: Preliminary results in an HMM-based synthesizer

D. Erro, I. Sainz, I. Saratxaga, E. Navas, I. Hernaez
Proc. FALA, 2010, pp. 29–32. • 2010
View 4 Excerpts
Highly Influenced

HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering

IEEE Transactions on Audio, Speech, and Language Processing • 2011
View 2 Excerpts

Similar Papers

Loading similar papers…