HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

@inproceedings{Su2020HiFiGANHD,
  title={HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks},
  author={Jiaqi Su and Zeyu Jin and A. Finkelstein},
  booktitle={INTERSPEECH},
  year={2020}
}
Real-world audio recordings are often degraded by factors such as noise, reverberation, and equalization distortion. This paper introduces HiFi-GAN, a deep learning method to transform recorded speech to sound as though it had been recorded in a studio. We use an end-to-end feed-forward WaveNet architecture, trained with multi-scale adversarial discriminators in both the time domain and the time-frequency domain. It relies on the deep feature matching losses of the discriminators to improve the… Expand
13 Citations
Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement
  • 2
  • Highly Influenced
  • PDF
Context-Aware Prosody Correction for Text-Based Speech Editing
  • PDF
CDPAM: Contrastive learning for perceptual audio similarity
  • PDF
Enhancing Low-Quality Voice Recordings Using Disentangled Channel Factor and Neural Waveform Model
  • PDF
High Fidelity Speech Regeneration with Application to Speech Enhancement
  • 1
  • PDF
It$\hat{\text{o}}$TTS and It$\hat{\text{o}}$Wave: Linear Stochastic Differential Equation Is All You Need For Audio Generation
  • Shoule Wu, Ziqiang Shi
  • Computer Science, Engineering
  • 2021
  • PDF
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
  • PDF
Restoring degraded speech via a modified diffusion model
  • PDF
The Multi-speaker Multi-style Voice Cloning Challenge 2021
  • 3
  • PDF
All for One and One for All: Improving Music Separation by Bridging Networks
  • PDF
...
1
2
...

References

SHOWING 1-10 OF 43 REFERENCES
High Fidelity Speech Synthesis with Adversarial Networks
  • 57
  • PDF
SEGAN: Speech Enhancement Generative Adversarial Network
  • 549
  • PDF
Towards Generalized Speech Enhancement with Generative Adversarial Networks
  • 10
  • PDF
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition
  • 107
  • PDF
Speech Denoising with Deep Feature Losses
  • 77
  • PDF
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
  • 151
  • PDF
Perceptually-motivated Environment-specific Speech Enhancement
  • Jiaqi Su, A. Finkelstein, Zeyu Jin
  • Computer Science
  • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2019
  • 5
  • PDF
Learning Spectral Mapping for Speech Dereverberation and Denoising
  • 108
  • PDF
Improving GANs for Speech Enhancement
  • 15
  • PDF
Data Augmentation and Deep Convolutional Neural Networks for Blind Room Acoustic Parameter Estimation
  • 4
  • PDF
...
1
2
3
4
5
...