Frame-Wise Dynamic Threshold Based Polyphonic Acoustic Event Detection

@inproceedings{Xia2017FrameWiseDT,
  title={Frame-Wise Dynamic Threshold Based Polyphonic Acoustic Event Detection},
  author={Xianjun Xia and Roberto B. Togneri and Ferdous Sohel and David Huang},
  booktitle={Interspeech},
  year={2017}
}
Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt multi-label classification techniques to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the global threshold has to be set manually and is highly dependent on the database being tested. To deal with this, we replaced the fixed threshold method… 

Figures and Tables from this paper

Confidence Based Acoustic Event Detection

Experimental results on the latest TUT sound event 2017 database of polyphonic events demonstrate the superior performance of the proposed approach compared to the multi-label classification based AED method.

Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information

Experimental results on the monophonic UPC-TALP and the polyphonic TUT Sound Event datasets demonstrate the superior performance of the joint learning method by achieving lower error rate and higher F-score compared to the baseline AED system.

Acoustic Event Detection with Sequential Attention and Soft Boundary Information

Experimental results on the latest TUT Sound Event database demonstrate the superior performance of the proposed technique, utilizing the sequential attention together with the soft boundary information.

Evaluation of Post-Processing Algorithms for Polyphonic Sound Event Detection

This paper evaluated different approaches for temporal segmentation, namely statistics-based and parametric methods and compared post-processing algorithms on the temporal prediction curves of two models: one based on the challenge’s baseline and onebased on Multiple Instance Learning (MIL).

Acoustic event detection utilizing event class and localization information

This dissertation investigates different techniques in pursuit of extending the frontiers of acoustic event detection with particular emphasis on utilizing the acoustic event class and acoustic event localization information.

CLASS WISE DISTANCE BASED ACOUSTIC EVENT DETECTION

Crossvalidation detection results on the development set of DCASE2017 show the efficiency of the proposed method by achieving a 4% absolute reduction in segment-based error rate compared to the baseline system.

Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization

A convolutional neural network transformer (CNN-Transfomer) is proposed for audio tagging and SED, and it is shown that CNN-Transformer performs similarly to a Convolutional recurrent neural network (CRNN).

A Survey: Neural Network-Based Deep Learning for Acoustic Event Detection

How deep learning methods benefit the acoustic event detection task and the potential issues that need to be addressed for prospective real-world scenarios are discussed.

A Survey: Neural Network-Based Deep Learning for Acoustic Event Detection

How deep learning methods benefit the acoustic event detection task and the potential issues that need to be addressed for prospective real-world scenarios are discussed.

References

SHOWING 1-10 OF 31 REFERENCES

Random forest classification based acoustic event detection

Evaluations on the UPC-TALP database which consists of highly variable acoustic events demonstrate the efficiency of the proposed approaches with improvements in detection error rate compared to the best baseline system.

Polyphonic sound event detection using multi label deep neural networks

Frame-wise spectral-domain features are used as inputs to train a deep neural network for multi label classification in this work and the proposed method improves the accuracy by 19% percentage points overall.

Real-world acoustic event detection

Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features

The proposed SED system is compared against the state of the art mono channel method on the development subset of TUT sound events detection 2016 database and the usage of spatial and harmonic features are shown to improve the performance of SED.

TUT database for acoustic scene classification and sound event detection

The recording and annotation procedure, the database content, a recommended cross-validation setup and performance of supervised acoustic scene classification system and event detection baseline system using mel frequency cepstral coefficients and Gaussian mixture models are presented.

Metrics for Polyphonic Sound Event Detection

This paper presents and discusses various metrics proposed for evaluation of polyphonic sound event detection systems used in realistic situations where there are typically multiple sound sources

Challenge on Detection and Classification of Acoustic Scenes and Events ACOUSTIC EVENT DETECTION USING SIGNAL ENHANCEMENT AND SPECTRO-TEMPORAL FEATURE EXTRACTION

In this paper, an acoustic event detection system is proposed. It consists of a noise reduction signal enhancement step based on the noise power spectral density estimator proposed in [1] and on the

Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording

The work on Task 1 Acoustic Scene Classification and Task 3 Sound Event Detection in Real Life Recordings has low-level and high-level features, classifier optimization and other heuristics specific to each task.

Random Regression Forests for Acoustic Event Detection and Classification

This paper poses event detection and localization as a regression task and proposes an approach based on random forest regression which shows superior performance on two databases ITC-Irst and UPC-TALP.

A database and challenge for acoustic scene classification and event detection

This paper introduces a newly-launched public evaluation challenge dealing with two closely related tasks of the field: acoustic scene classification and event detection.