Overcoming label noise in audio event detection using sequential labeling
@article{Kim2020OvercomingLN, title={Overcoming label noise in audio event detection using sequential labeling}, author={Jae-Bin Kim and Seongkyu Mun and Myungwoo Oh and Soyeon Choe and Yong-Hyeok Lee and Hyung-Min Park}, journal={ArXiv}, year={2020}, volume={abs/2007.05191} }
This paper addresses the noisy label issue in audio event detection (AED) by refining strong labels as sequential labels with inaccurate timestamps removed. In AED, strong labels contain the occurrence of a specific event and its timestamps corresponding to the start and end of the event in an audio clip. The timestamps depend on subjectivity of each annotator, and their label noise is inevitable. Contrary to the strong labels, weak labels indicate only the occurrence of a specific event. They…
References
SHOWING 1-10 OF 18 REFERENCES
Connectionist Temporal Localization for Sound Event Detection with Sequential Labeling
- Computer ScienceICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2019
Evaluation on a subset of Audio Set shows that CTL closes a third of the gap between presence/ absence labeling and strong labeling, demonstrating the usefulness of the extra temporal information in sequential labeling.
Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments
- Computer ScienceDCASE
- 2018
This paper presents DCASE 2018 task 4.0, which evaluates systems for the large-scale detection of sound events using weakly labeled data (without time boundaries) and explores the possibility to exploit a large amount of unbalanced and unlabeled training data together with a small weakly labeling training set to improve system performance.
A first attempt at polyphonic sound event detection using connectionist temporal classification
- Computer Science2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2017
This paper presents a first attempt at using Connectionist temporal classification (CTC) for sound event detection, and shows that CTC is able to locate the boundaries of sound events on a very noisy corpus of consumer generated content with rough hints about their positions.
Sound Event Detection in Domestic Environments with Weakly Labeled Data and Soundscape Synthesis
- Computer ScienceDCASE
- 2019
The paper introduces Domestic Environment Sound Event Detection (DESED) dataset mixing a part of last year dataset and an additional synthetic, strongly labeled, dataset provided this year that’s described more in detail.
Audio Set: An ontology and human-labeled dataset for audio events
- Computer Science2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2017
The creation of Audio Set is described, a large-scale dataset of manually-annotated audio events that endeavors to bridge the gap in data availability between image and audio research and substantially stimulate the development of high-performance audio event recognizers.
Audio tagging with noisy labels and minimal supervision
- Computer ScienceDCASE
- 2019
This paper presents the task setup, the FSDKaggle2019 dataset prepared for this scientific evaluation, and a baseline system consisting of a convolutional neural network.
A Comparison of Five Multiple Instance Learning Pooling Functions for Sound Event Detection with Weak Labeling
- Computer ScienceICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2019
This paper builds a neural network called TALNet, which is the first system to reach state-of-the-art audio tagging performance on Audio Set, while exhibiting strong localization performance on the DCASE 2017 challenge at the same time.
TUT database for acoustic scene classification and sound event detection
- Computer Science, Physics2016 24th European Signal Processing Conference (EUSIPCO)
- 2016
The recording and annotation procedure, the database content, a recommended cross-validation setup and performance of supervised acoustic scene classification system and event detection baseline system using mel frequency cepstral coefficients and Gaussian mixture models are presented.
Metrics for Polyphonic Sound Event Detection
- Computer Science
- 2016
This paper presents and discusses various metrics proposed for evaluation of polyphonic sound event detection systems used in realistic situations where there are typically multiple sound sources…
CLEAR Evaluation of Acoustic Event Detection and Classification Systems
- PhysicsCLEAR
- 2006
In this paper, the various systems for the tasks of AED and AEC and their results are presented.