Statistical Voice Conversion with WaveNet-Based Waveform Generation

  title={Statistical Voice Conversion with WaveNet-Based Waveform Generation},
  author={Kazuhiro Kobayashi and Tomoki Hayashi and Akira Tamamori and Tomoki Toda},
This paper presents a statistical voice conversion (VC) technique with the WaveNet-based waveform generation. VC based on a Gaussian mixture model (GMM) makes it possible to convert the speaker identity of a source speaker into that of a target speaker. However, in the conventional vocoding process, various factors such as F0 extraction errors, parameterization errors and over-smoothing effects of converted feature trajectory cause the modeling errors of the speech waveform, which usually bring… CONTINUE READING
Highly Cited
This paper has 30 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 22 extracted citations


Publications referenced by this paper.
Showing 1-10 of 27 references

Speech waveform synthesis based on wavenet considering speech generation process

  • A. Tamamori, T. Hayashi, T. Toda, K. Takeda
  • IEICE Tech. Rep. SP2016-77 (Japanese edition), no…
  • 2016
2 Excerpts

Similar Papers

Loading similar papers…