• Corpus ID: 245502498

Acoustic scene classification using auditory datasets

@article{Kumpawat2021AcousticSC,
  title={Acoustic scene classification using auditory datasets},
  author={Jayesh Kumpawat and Shubhajit Dey},
  journal={ArXiv},
  year={2021},
  volume={abs/2112.13450}
}
The approach of using audio datasets to solve the problems revolving around land use patterns has gained decent amount of research exposure in the recent times. Although being a blooming domain in the field of AI and Data Analysis , it would not be wrong to declare that a significant amount of journey is yet to be covered. In this article, the project conducted to classify some pre-defined acoustic scene is discussed and explained. designed Output class the The approach used not only challenges… 

References

SHOWING 1-10 OF 15 REFERENCES

A Review of Deep Learning Based Methods for Acoustic Scene Classification

TLDR
This article summarizes and groups existing approaches for data preparation, i.e., feature representations, feature pre-processing, and data augmentation, and for data modeling, i.

Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network

TLDR
This paper explores the use of Multi-channel CNN for the classification task, which aims to extract features from different channels in an end-to-end manner, and explores the using of mixup method, which can provide higher prediction accuracy and robustness in contrast with previous models.

SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification

TLDR
This paper presents SpecAugment++, a novel data augmentation method for deep neural networks based acoustic scene classification (ASC), which can obtain 3 .

CQT-based Convolutional Neural Networks for Audio Scene Classification

TLDR
It is shown in this paper that a ConstantQ-transformed input to a Convolutional Neural Network improves results and a parallel (graphbased) neural network architecture is proposed which captures relevant audio characteristics both in time and in frequency.

Acoustic Scene Classification: Classifying environments from the sounds they produce

TLDR
An account of the state of the art in acoustic scene classification (ASC), the task of classifying environments from the sounds they produce, and a range of different algorithms submitted for a data challenge to provide a general and fair benchmark for ASC techniques.

Receptive-field-regularized CNN variants for acoustic scene classification

TLDR
This paper performs a systematic investigation of different RF configuration for various CNN architectures on the DCASE 2019 Task 1.A dataset, introduces Frequency Aware CNNs to compensate for the lack of frequency information caused by the restricted RF, and investigates if and in what RF ranges they yield additional improvement.

DCASENET: An Integrated Pretrained Deep Neural Network for Detecting and Classifying Acoustic Scenes and Events

TLDR
The aim is to build an integrated system that can serve as a pretrained model to perform the three abovementioned tasks, and demonstrates that the proposed architecture, called DcaseNet, can be either directly used for any of the tasks while providing suitable results or fine-tuned to improve the performance of one task.

An evaluation of data augmentation methods for sound scene geotagging

TLDR
This work improves on the state-of-the-art city geotagging method by 23% in terms of classification accuracy, and evaluates a series of common audio data augmentation methods to evaluate which best improves the accuracy of audio geotagged classifiers.

Noise-Robust Sound-Event Classification System with Texture Analysis

TLDR
In this study, a noise-robust system for the classification of sound data is proposed that can be used either as a standalone solution or as a supplement to the known methods to obtain a more accurate solution.

Deep Neural Network Acoustic Models for ASR

TLDR
New acoustic models based on Deep Neural Networks (DNN) that have begun to replace GMMs that enable better generalization of learned features to new situations, even when only small amounts of training data are available are described.