• Corpus ID: 245853923

MR-SVS: Singing Voice Synthesis with Multi-Reference Encoder

  title={MR-SVS: Singing Voice Synthesis with Multi-Reference Encoder},
  author={Shoutong Wang and Jinglin Liu and Yi Ren and Zhen Wang and Changliang Xu and Zhou Zhao},
Multi-speaker singing voice synthesis is to generate the singing voice sung by different speakers. To generalize to new speakers, previous zero-shot singing adaptation methods obtain the timbre of the target speaker with a fixed-size embedding from single reference audio. However, they face several challenges: 1) the fixed-size speaker embedding is not powerful enough to capture full details of the target timbre; 2) single reference audio does not contain sufficient timbre information of the… 

Figures and Tables from this paper

