TUT database for acoustic scene classification and sound event detection

@article{Mesaros2016TUTDF,
  title={TUT database for acoustic scene classification and sound event detection},
  author={Annamaria Mesaros and Toni Heittola and Tuomas Virtanen},
  journal={2016 24th European Signal Processing Conference (EUSIPCO)},
  year={2016},
  pages={1128-1132}
}
We introduce TUT Acoustic Scenes 2016 database for environmental sound research, consisting of binaural recordings from 15 different acoustic environments. A subset of this database, called TUT Sound Events 2016, contains annotations for individual sound events, specifically created for sound event detection. TUT Sound Events 2016 consists of residential area and home environments, and is manually annotated to mark onset, offset and label of sound events. In this paper we present the recording… 

Figures and Tables from this paper

Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events
TLDR
This study indicates that recognizing acoustic scenes by identifying distinct sound events is effective and paves the way for future studies that combine this strategy with previous ones.
Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)
TLDR
The proposed SED system is compared against the state of the art mono channel method on the development subset of TUT sound events detection 2016 database and the usage of spatial and harmonic features are shown to improve the performance of SED.
Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016), Budapest, Hungary, 3 Sep 2016.
TLDR
The proposed SED system is compared against the state of the art mono channel method on the development subset of TUT sound events detection 2016 database and the usage of spatial and harmonic features are shown to improve the performance of SED.
Joint Analysis of Acoustic Events and Scenes Based on Multitask Learning
TLDR
Experimental results obtained indicate that the proposed multitask learning for joint analysis of acoustic events and scenes improves the performance of acoustic event detection by 10.66 percentage points in terms of the F-score, compared with a conventional method based on a convolutional recurrent neural network.
Improving Sound Event Detection in Domestic Environments using Sound Separation
TLDR
This paper starts from a sound separation model trained on the Free Universal Sound Separation dataset and the DCASE 2020 task 4 sound event detection baseline, and explores different methods to combine separated sound sources and the original mixture within the sound event Detection.
Acoustic Scene Classification Using Deep Audio Feature and BLSTM Network
TLDR
This work presents the work of acoustic scene classification for the challenge of the Detection and Classification of Acoustic Scenes and Events 2017, i.e., DCASE2017 challenge, using a feature of Deep Audio Feature (DAF) for acoustic scene representation and a classifier of Bidirectional Long Short Term Memory (BLSTM) network foroustic scene classification.
Robust Feature Learning on Long-Duration Sounds for Acoustic Scene Classification
TLDR
A robust feature learning (RFL) framework to train the CNN and shows that the proposed RFL framework can obtain a more robust acoustic scene classifier towards unseen devices and cities.
Reverberation-based feature extraction for acoustic scene classification
TLDR
A strong low-complexity baseline system using a compact feature set that is improved with a novel class of audio features, which exploit the knowledge of sound behaviour within the scene - reverberation, which increases the classification accuracy.
Weakly labeled acoustic event detection using local detector and global classifier
TLDR
This paper proposes an acoustic event detection framework for weakly supervised data which is labeled with only the existence of events and shows that the proposed model has a lower EER on the CHiME Home dataset than other neural network based models.
...
...

References

SHOWING 1-10 OF 25 REFERENCES
Detection and classification of acoustic scenes and events: An IEEE AASP challenge
TLDR
An overview of systems submitted to the public evaluation challenge on acoustic scene classification and detection of sound events within a scene as well as a detailed evaluation of the results achieved by those systems are provided.
HOG and subband power distribution image features for acoustic scene classification
TLDR
This work proposes to use the Subband Power Distribution (SPD) as a feature to capture the occurrences of these events by computing the histogram of amplitude values in each frequency band of a spectrogram image by using the so-called Sinkhorn kernel.
Acoustic event detection for multiple overlapping similar sources
  • D. Stowell, David Clayton
  • Physics
    2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
  • 2015
TLDR
A simple method modelling the onsets, durations and offsets of acoustic events to avoid intrinsic limits on polyphony or on inter-event temporal patterns is introduced and evaluated in a case study with over 3000 zebra finch calls.
Events Detection for an Audio-Based Surveillance System
TLDR
The automatic shot detection system presented is based on a novelty detection approach which offers a solution to detect abnormality (abnormal audio events) in continuous audio recordings of public places and takes advantage of potential similarity between the acoustic signatures of the different types of weapons by building a hierarchical classification system.
Metrics for Polyphonic Sound Event Detection
This paper presents and discusses various metrics proposed for evaluation of polyphonic sound event detection systems used in realistic situations where there are typically multiple sound sources
Detecting audio events for semantic video search
TLDR
The experiments with SVM classifiers, and different features, using a 290-hour corpus of sound effects, which allowed us to build detectors for almost 50 semantic concepts, showed that the task is much harder in real-life videos, which so often include overlapping audio events.
Context-dependent sound event detection
TLDR
The two-step approach was found to improve the results substantially compared to the context-independent baseline system, and the detection accuracy can be almost doubled by using the proposed context-dependent event detection.
Automatic surveillance of the acoustic activity in our living environment
TLDR
An acoustic surveillance system comprised of a computer and microphone situated in a typical office environment that continuously analyzes the acoustic activity at the recording site, separates all interesting events, and stores them in a database is reported.
Acoustic context recognition using local binary pattern codebooks
TLDR
The paper introduces the use of local binary pattern (LBP) analysis which is used to complement MFCCs with acoustic texture information and relates to a bag-of-features extension which clusters LBPs into a small number of codewords.
...
...