Author pages are created from data sourced from our academic publisher partnerships and public sources.
- Publications
- Influence
Share This Author
Scikit-learn: Machine Learning in Python
- Fabian Pedregosa, G. Varoquaux, E. Duchesnay
- Computer ScienceJ. Mach. Learn. Res.
- 1 February 2011
Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing…
Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions
- Jonathan Shen, Ruoming Pang, Yonghui Wu
- Computer ScienceIEEE International Conference on Acoustics…
- 16 December 2017
This paper describes Tacotron 2, a neural network architecture for speech synthesis directly from text. The system is composed of a recurrent sequence-to-sequence feature prediction network that maps…
Tacotron: Towards End-to-End Speech Synthesis
- Yuxuan Wang, R. Skerry-Ryan, R. Saurous
- Computer ScienceINTERSPEECH
- 29 March 2017
TLDR
CNN architectures for large-scale audio classification
- Shawn Hershey, S. Chaudhuri, K. Wilson
- Computer ScienceIEEE International Conference on Acoustics…
- 29 September 2016
TLDR
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
- H. Zen, Viet-Trung Dang, Yonghui Wu
- Computer Science, PhysicsINTERSPEECH
- 5 April 2019
TLDR
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
- Ye Jia, Yu Zhang, Yonghui Wu
- Computer ScienceNeurIPS
- 1 June 2018
TLDR
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
- R. Skerry-Ryan, Eric Battenberg, R. Saurous
- PhysicsICML
- 24 March 2018
TLDR
Model-Based Expectation-Maximization Source Separation and Localization
- Michael I. Mandel, Ron J. Weiss, D. Ellis
- PhysicsIEEE Transactions on Audio, Speech, and Language…
- 1 February 2010
TLDR
State-of-the-Art Speech Recognition with Sequence-to-Sequence Models
- C. Chiu, T. Sainath, M. Bacchiani
- Computer ScienceIEEE International Conference on Acoustics…
- 5 December 2017
TLDR
Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
- Yuxuan Wang, R. Skerry-Ryan, R. Saurous
- Computer ScienceArXiv
- 29 March 2017
TLDR
...
...