Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing
@article{Fu2020BoostingOS, title={Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing}, author={S. Fu and Chien-Feng Liao and Tsun-An Hsieh and Kuo-Hsuan Hung and S. Wang and Cheng Yu and Heng-Cheng Kuo and Ryandhimas E. Zezario and Y. Li and Shang-Yi Chuang and Y. Lu and Y. Tsao}, journal={ArXiv}, year={2020}, volume={abs/2006.10296} }
The Transformer architecture has shown its superior ability than recurrent neural networks on many different natural language processing applications. Therefore, this study applies a modified Transformer on the speech enhancement task. Specifically, the positional encoding may not be necessary and hence is replaced by convolutional layers. To further improve PESQ scores of enhanced speech, the L_1 pre-trained Transformer is fine-tuned by MetricGAN framework. The proposed MetricGAN can be… CONTINUE READING
Figures, Tables, and Topics from this paper
One Citation
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application
- Computer Science, Engineering
- ArXiv
- 2020
- PDF
References
SHOWING 1-10 OF 24 REFERENCES
Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR
- Computer Science
- LVA/ICA
- 2015
- 349
- PDF
Multiple-target deep learning for LSTM-RNN based speech enhancement
- Computer Science
- 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA)
- 2017
- 85
- PDF
T-GSA: Transformer with Gaussian-Weighted Self-Attention for Speech Enhancement
- Computer Science, Engineering
- ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2020
- 22
- PDF
Stable Training of Dnn for Speech Enhancement Based on Perceptually-Motivated Black-Box Cost Function
- Computer Science, Engineering
- ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2020
- 7
- Highly Influential
- PDF
A Regression Approach to Speech Enhancement Based on Deep Neural Networks
- Computer Science
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
- 2015
- 716
- PDF
Weighted Speech Distortion Losses for Neural-Network-Based Real-Time Speech Enhancement
- Computer Science, Engineering
- ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2020
- 24
- PDF
Speech Enhancement Using Self-Adaptation and Multi-Head Self-Attention
- Computer Science, Engineering
- ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2020
- 18
- PDF
Time-Frequency Masking-Based Speech Enhancement Using Generative Adversarial Network
- Computer Science
- 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2018
- 60
- PDF
Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality
- Computer Science, Engineering
- IEEE Signal Processing Letters
- 2020
- 9
- PDF