Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory

@article{Muramatsu2007VoiceCB,
  title={Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory},
  author={Takashi Muramatsu and Yamato Ohtani and Tomoki Toda and Hiroshi Saruwatari and Kiyohiro Shikano},
  journal={IEEE Transactions on Audio, Speech, and Language Processing},
  year={2007},
  volume={15},
  pages={2222-2235}
}
In this paper, we describe a novel spectral conversion method for voice conversion (VC). A Gaussian mixture model (GMM) of the joint probability density of source and target features is employed for performing spectral conversion between speakers. The conventional method converts spectral parameters frame by frame based on the minimum mean square error. Although it is reasonably effective, the deterioration of speech quality is caused by some problems: 1) appropriate spectral movements are not… CONTINUE READING
Highly Influential
This paper has highly influenced 129 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 920 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 487 extracted citations

Sparse representation for frequency warping based voice conversion

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2015
View 13 Excerpts
Method Support
Highly Influenced

Voice conversion using deep neural networks with speaker-independent pre-training

2014 IEEE Spoken Language Technology Workshop (SLT) • 2014
View 13 Excerpts
Method Support
Highly Influenced

Ipsj Sig Technical Report

Vocalistener
-1
View 8 Excerpts
Highly Influenced

Electrolaryngeal Speech Enhancement with Statistical Voice Conversion based on CLDNN

2018 26th European Signal Processing Conference (EUSIPCO) • 2018
View 6 Excerpts
Highly Influenced

920 Citations

050100'10'13'16'19
Citations per Year
Semantic Scholar estimates that this publication has 920 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-10 of 10 references

Tokuda . A speech parameter generation algorithm considering global variance for HMMbased speech synthesis

T. Toda, K.
IEICE Transactions • 2007

Voice conversion based on maximum likelihood estimation of spectral parameter trajectory

T. Toda, A. W. Black, K. Tokuda
in Proc. IEEE Trans. ASLP, • 2007
View 3 Excerpts

Quality-enhanced voice morphing using maximum likelihood transformations

IEEE Transactions on Audio, Speech, and Language Processing • 2006