• Corpus ID: 239016583

NN3A: Neural Network supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications

  title={NN3A: Neural Network supported Acoustic Echo Cancellation, Noise Suppression and Automatic Gain Control for Real-Time Communications},
  author={Ziteng Wang and Yueyue Na and Biao Tian and Qiang Fu},
  • Ziteng Wang, Yueyue Na, +1 author Qiang Fu
  • Published 16 October 2021
  • Computer Science, Engineering
  • ArXiv
Acoustic echo cancellation (AEC), noise suppression (NS) and automatic gain control (AGC) are three often required modules for real-time communications (RTC). This paper proposes a neural network supported algorithm for RTC, namely NN3A, which incorporates an adaptive filter and a multi-task model for residual echo suppression, noise reduction and near-end speech activity detection. The proposed algorithm is shown to outperform both a method using separate models and an end-to-end alternative… 

Figures and Tables from this paper


Deep Residual Echo Suppression and Noise Reduction: A Multi-Input FCRN Approach in a Hybrid Speech Enhancement System
Deep neural network (DNN)-based approaches to acoustic echo cancellation (AEC) and hybrid speech enhancement systems have gained increasing attention recently, introducing significant performance
Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On Percepnet
This work proposes a system combining a traditional acoustic echo canceller, and a low-complexity joint residual echo and noise suppressor based on a hybrid signal processing/deep neural network (DSP/DNN) approach that outperforms both traditional and other neural approaches while requiring only 5.5% CPU for real-time operation.
Deep Learning for Joint Acoustic Echo and Noise Cancellation with Nonlinear Distortions
A causal system to address acoustic echo and noise cancellation jointly as deep learning based speech separation, which incorporates a convolutional recurrent network (CRN) and a recurrent network with long short-term memory (LSTM).
Multiple-Input Neural Network-Based Residual Echo Suppression
A neural network-based approach that directly estimates the RES filter coefficients from multiple inputs, including the A EC output, the far-end speech, and/or the echo computed by the AEC is proposed.
Deep Residual Echo Suppression With A Tunable Tradeoff Between Signal Distortion And Echo Suppression
  • Amir Ivry, I. Cohen, Baruch Berdugo
  • Computer Science, Engineering
    ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2021
A residual echo suppression method using a UNet neural network that directly maps the outputs of a linear acoustic echo canceler to the desired signal in the spectral domain and embeds a design parameter that allows a tunable tradeoff between the desired-signal distortion and residual echo suppressed in double-talk scenarios.
Weighted Recursive Least Square Filter and Neural Network Based Residual ECHO Suppression for the AEC-Challenge
This paper presents a real-time Acoustic Echo Cancellation (AEC) algorithm submitted to the AEC-Challenge. The algorithm consists of three modules: Generalized Cross-Correlation with PHAse Transform
An Experimental Study on Speech Enhancement Based on Deep Neural Networks
This letter presents a regression-based speech enhancement framework using deep neural networks (DNNs) with a multiple-layer deep architecture that tends to achieve significant improvements in terms of various objective quality measures.
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks
A phase-sensitive objective function based on the signal-to-noise ratio (SNR) of the reconstructed signal is developed, and it is shown that in experiments it yields uniformly better results in terms of signal- to-distortion ratio (SDR).
CAD-AEC: Context-Aware Deep Acoustic Echo Cancellation
Experiments show that the proposed CAD-AEC can robustly achieve better echo return loss enhancement (ERLE) and perceptual speech quality compared to the previous classical and deep learning techniques.
Complex Ratio Masking for Monaural Speech Separation
The proposed approach improves over other methods when evaluated with several objective metrics, including the perceptual evaluation of speech quality (PESQ), and a listening test where subjects prefer the proposed approach with at least a 69% rate.