An Introduction to Audio Content Analysis: Applications in Signal Processing and Music Informatics
@inproceedings{Lerch2012AnIT, title={An Introduction to Audio Content Analysis: Applications in Signal Processing and Music Informatics}, author={Alexander Lerch}, year={2012}, url={https://api.semanticscholar.org/CorpusID:60238484} }
This book provides quick access to different analysis algorithms and allows comparison between different approaches to the same task, making it useful for newcomers to audio signal processing and industry experts alike.
Figures and Tables from this paper
figure 1.1 figure 10.1 figure 2.1 figure 2.10 figure 2.11 figure 2.12 figure 2.13 figure 2.13 figure 2.14 figure 2.2 figure 2.3 figure 2.4 figure 2.5 figure 2.6 figure 2.7 figure 2.8 figure 2.9 figure 3.1 table 3.1 figure 3.10 table 3.10 figure 3.11 figure 3.12 figure 3.13 figure 3.14 figure 3.15 table 3.15 figure 3.16 figure 3.17 figure 3.18 figure 3.19 figure 3.2 figure 3.20 figure 3.3 table 3.3 figure 3.4 figure 3.5 figure 3.6 table 3.6 figure 3.7 table 3.7 table 3.7 figure 3.8 table 3.8 figure 3.9 table 3.9 figure 4.1 table 4.1 figure 4.2 figure 4.3 figure 4.4 figure 4.5 figure 4.6 figure 4.7 figure 5.1 figure 5.10 figure 5.11 figure 5.12 figure 5.13 figure 5.14 figure 5.15 figure 5.16 figure 5.17 figure 5.18 figure 5.19 figure 5.2 table 5.2 figure 5.20 figure 5.21 figure 5.22 figure 5.23 figure 5.24 figure 5.25 table 5.3 figure 5.4 figure 5.5 table 5.5 figure 5.6 table 5.6 figure 5.7 table 5.7 figure 5.8 figure 5.9 table 5.9 figure 6.1 table 6.1 figure 6.2 figure 6.3 figure 6.4 figure 6.5 figure 6.6 figure 7.1 figure 7.2 figure 7.3 figure 8.2 table 8.2 table 9.1 figure 9.1 figure 9.2 figure B.2
221 Citations
Pitch Detection Using Modified Autocorrelation and Web Audio API
- 2023
Computer Science
The approach to pitch detection using autocorrelation and the Web Audio API represents a promising avenue for future research in music signal processing and has the potential to facilitate the development of new applications in this field.
Audio Content Analysis
- 2021
Computer Science
This chapter defines musical audio content, introduces the general process of audio content analysis, and surveys basic approaches toaudio content analysis.
Music Performance Analysis: A Survey
- 2019
Computer Science
The field of Music Performance Analysis (MPA) is surveyed from various perspectives, its significance to the field of MIR is discussed, and opportunities for future research in this field are pointed out.
Music Instrument Identification using Convolutional Neural Networks
- 2018
Computer Science
The following master thesis presents an Instrument Identification system for polyphonic classical music material including a preprocessing method to reduce crosstalk within multi-track recordings and evaluates two different methods based on sliding classification and multiple instance learning to train a classifier on large time frames but eventually predict on smaller time frames.
Interactive Learning of Signal Processing Through Music: Making Fourier Analysis Concrete for Students
- 2021
Computer Science, Education
It is demonstrated how the music domain provides motivating and tangible applications that make learning signal processing an interactive pursuit and how software tools originally developed for music analysis provide students multiple entry points to delve deeper into classical signal processing techniques while bridging the gap between education and cutting-edge research.
libACA, pyACA, and ACA-Code: Audio Content Analysis in 3 Languages
- 2022
Computer Science
Automatic Spatial Audio Scene Classification in Binaural Recordings of Music
- 2019
Computer Science, Engineering
It is demonstrated that in addition to the binaural cues, the Mel-frequency cepstral coefficients constitute an important carrier of spatial information, imperative for the classification of spatial audio scenes.
Iracema: a Python library for audio content analysis
- 2020
Computer Science
An architecture that will provide to users an abstraction level that simplifies the manipulation of different kinds of time series, as well as the extraction of segments from them is proposed.
IMPROVEMENT OF AUDIO FEATURE EXTRACTION TECHNIQUES IN TRADITIONAL INDIAN STRING MUSICAL INSTRUMENT
- 2016
Computer Science, Engineering
Zero Forcing Equalizer (ZFE) was integrated with three audio feature extraction techniques, namely MFCC-ZFE, LPC-ZFE and ZCR-ZFE in order to improve the performance of the existing techniques.
Robust Audio Content Classification Using Hybrid-Based SMD and Entropy-Based VAD
- 2020
Computer Science, Engineering
Experimental results show that the hierarchical ACC system using hybrid feature-based SMD and entropy-based VAD is successfully evaluated against three available datasets and is comparable with existing methods even in a variable noise-level environment.
478 References
Multi-pitch and periodicity analysis model for sound separation and auditory scene analysis
- 1999
Computer Science, Engineering
A model for multi-pitch and periodicity analysis of complex audio signals is presented that is more efficient and practical than the Meddis and O'Mard unitary pitch perception model, yet exhibits very similar behavior.
Hierarchical Automatic Audio Signal Classification
- 2004
Computer Science
The design, implementation, and evaluation of a system for automatic audio signal classification is presented, differentiating between three speech classes, 13 musical genres, and background noise according to audio type.
Camel: a lightweight framework for content-based audio and music analysis
- 2010
Computer Science
A brief overview of the design decisions and characteristics of CAMEL (Content-based Audio and Music Extraction Library), an easy-to-use C++ framework developed for content-based audio and music analysis.
Towards Instrument Segmentation for Music Content Description: a Critical Review of Instrument Classification Techniques
- 2000
Computer Science
This survey discusses the necessity of developing new strategies for classifying sound mixes without a priori separation of sound sources, and evaluates their applicability to the more complex case of describing sound mixes.
Advanced Audio Identification Using MPEG-7 Content Description
- 2001
Computer Science
A system for automatic identification of audio material from a database of registered works is presented, designed to allow reliable, fast and robust detection ofaudio material with the resources provided by today’s standard computing platforms.
Automatic Song Identification in Noisy Broadcast Audio
- 2002
Computer Science, Engineering
A new method to minimize the effects of audio manipulation (i.e. radio edits) and distortions due to broadcast transmissions and is designed to give almost no false positives and achieve very high accuracy.
A HIERARCHICAL APPROACH TO AUTOMATIC MUSICAL GENRE CLASSIFICATION
- 2003
Computer Science
A system for the automatic classification of audio signals according to audio category is presented, which is motivated by the numerous advantages of such a tree-like structure, which include easy expansion capabilities, flexibility in the design of genre-dependent features and the ability to reduce the probability of costly errors.
Extracting audio cues in real time to understand musical expressiveness
- 2001
Computer Science
This paper presents a preliminary set of tools, able to extract some simple audio cues in real time from single-instrument musical performances, and an approach to map such cues to an expressive space to distinguish performances characterized by different expressive content.
A Robust Mid-Level Representation for Harmonic Content in Music Signals
- 2005
Computer Science
It is hoped that by utilizing the notion of a musically-motivated mid-level representation, this work may help bridge the gap between symbolic and audio research.
Evaluation of Features for Audio-to-Audio Alignment
- 2011
Computer Science
A new method for the objective evaluation of audio-to-audio alignment systems is proposed that enables the use of arbitrary kinds of music as ground truth data and showed that the feature weighting algorithm could improve the alignment accuracies compared to the results of the individual features.