• Corpus ID: 18328317

# Joint Estimation of Reverberation Time and Direct-to-Reverberation Ratio from Speech using Auditory-Inspired Features

@article{Xiong2015JointEO,
title={Joint Estimation of Reverberation Time and Direct-to-Reverberation Ratio from Speech using Auditory-Inspired Features},
author={Feifei Xiong and Stefan Goetze and Bernd T. Meyer},
journal={ArXiv},
year={2015},
volume={abs/1510.04620}
}
• Published 15 October 2015
• Computer Science
• ArXiv
Blind estimation of acoustic room parameters such as the reverberation time $T_\mathrm{60}$ and the direct-to-reverberation ratio ($\mathrm{DRR}$) is still a challenging task, especially in case of blind estimation from reverberant speech signals. In this work, a novel approach is proposed for joint estimation of $T_\mathrm{60}$ and $\mathrm{DRR}$ from wideband speech in noisy conditions. 2D Gabor filters arranged in a filterbank are exploited for extracting features, which are then used as…
17 Citations

## Figures from this paper

Joint Estimation of Reverberation Time and Early-To-Late Reverberation Ratio From Single-Channel Speech Signals
• Physics
IEEE/ACM Transactions on Audio, Speech, and Language Processing
• 2019
From state-of-the-art algorithms that were tested in the acoustic characterization of environments challenge, jROPE achieves comparable results among the best for all individual tasks (RT and ELR estimation from full-band and sub-band signals).
Single-Channel Blind Direct-to-Reverberation Ratio Estimation Using Masking
• Computer Science
INTERSPEECH
• 2020
Single-channel DRR estimation is formulated as an extraction task of two signal components from the recorded audio and outperforms state-of-the-art singleand multi-channel methods on the ACE challenge data corpus.
Exploring Auditory-Inspired Acoustic Features for Room Acoustic Parameter Estimation From Monaural Speech
• Physics
IEEE/ACM Transactions on Audio, Speech, and Language Processing
• 2018
Compared to state-of-the-art algorithms that were tested in the acoustic characterisation of environments (ACE) challenge, the ROPE model is the only one that is among the best for all individual tasks (RT and ELR estimation from fullband and subband signals).
Estimation of Room Acoustic Parameters: The ACE Challenge
• Physics
IEEE/ACM Transactions on Audio, Speech, and Language Processing
• 2016
The acoustic characterization of environments (ACE) challenge showed that T60 estimation is a mature field where analytical approaches dominate whilst DRR estimation is one of the less mature fields where machine learning approaches are currently more successful.
Binaural Direct-to-Reverberant Energy Ratio and Speaker Distance Estimation
• Physics
IEEE/ACM Transactions on Audio, Speech, and Language Processing
• 2020
Two novel approaches to estimate the direct-to-reverberant energy ratio (DRR) of binaural signals are presented, based on the interaural magnitude-squared coherence and stochastic maximum likelihood beamforming.
Joint Estimation Of Acoustic Parameters From Single-Microphone Speech Observations
• Physics, Computer Science
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 2020
Results show the estimator compares favourably with respect to the state-of-the-art for unseen and real acoustic scenarios.
Online Blind Reverberation Time Estimation Using CRNNs
• Computer Science
INTERSPEECH
• 2020
This work proposes to use a convolutional recurrent neural network (CRNN) for blind T60 estimation as it combines the parametric efficiency of CNNs with the online estimation of recurrent neural networks and, in contrast to CNNs, can process time-sequences of variable length.
Combination strategy based on relative performance monitoring for multi-stream reverberant speech recognition
• Computer Science
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 2017
A room parameter estimation model is proposed to establish a reliable combination strategy which performs on either DNN posterior probabilities or word lattices which is highly correlated to ASR performances between multiple streams, i.e., relative performance monitoring.
Performance comparison of real-time single-channel speech dereverberation algorithms
• Computer Science
2017 Hands-free Speech Communications and Microphone Arrays (HSCMA)
• 2017
Experimental results show that all four algorithms are capable of providing benefits in reverberant environments even with moderate background noises, and low complexity and latency indicate their potential for real-time applications.

## References

