A robust frontend for VAD: exploiting contextual, discriminative and spectral cues of human voice


Reliable automatic detection of speech/non-speech activity in degraded, noisy audio signals is a fundamental and challenging task in robust signal processing. As various speech technology applications rely on the accuracy of a Voice Activity Detection (VAD) system for their effectiveness and robustness, the problem has gained considerable research interest… (More)


4 Figures and Tables


Citations per Year

Citation Velocity: 31

Averaging 31 citations per year over the last 3 years.

Learn more about how we calculate this metric in our FAQ.