Author pages are created from data sourced from our academic publisher partnerships and public sources.
- Publications
- Influence
Share This Author
Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions
- Jonathan Shen, Ruoming Pang, Yonghui Wu
- Computer ScienceIEEE International Conference on Acoustics…
- 16 December 2017
This paper describes Tacotron 2, a neural network architecture for speech synthesis directly from text. The system is composed of a recurrent sequence-to-sequence feature prediction network that maps…
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
- Daniel S. Park, William Chan, Quoc V. Le
- Computer ScienceINTERSPEECH
- 19 April 2019
TLDR
Conformer: Convolution-augmented Transformer for Speech Recognition
- Anmol Gulati, James Qin, Ruoming Pang
- Computer ScienceINTERSPEECH
- 16 May 2020
TLDR
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
- H. Zen, Viet-Trung Dang, Yonghui Wu
- Computer Science, PhysicsINTERSPEECH
- 5 April 2019
TLDR
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
- Yuxuan Wang, Daisy Stanton, R. Saurous
- Computer ScienceICML
- 23 March 2018
TLDR
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
- Ye Jia, Yu Zhang, Yonghui Wu
- Computer ScienceNeurIPS
- 1 June 2018
TLDR
WaveGrad: Estimating Gradients for Waveform Generation
- Nanxin Chen, Yu Zhang, H. Zen, Ron J. Weiss, Mohammad Norouzi, William Chan
- Computer ScienceICLR
- 2 September 2020
TLDR
Training RNNs as Fast as CNNs
- Tao Lei, Yu Zhang, Yoav Artzi
- Computer ScienceEMNLP
- 8 September 2017
TLDR
Hierarchical Generative Modeling for Controllable Speech Synthesis
- Wei-Ning Hsu, Yu Zhang, Ruoming Pang
- Computer ScienceICLR
- 16 October 2018
TLDR
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
- Yu Zhang, Ron J. Weiss, B. Ramabhadran
- Linguistics, Computer ScienceINTERSPEECH
- 9 July 2019
TLDR
...
...