Proposal of the Aesthetic Experience-Oriented Evaluation Framework for Field-recording Sound Retrieval System: Experiments using Acoustic Feature Signatures Based on Multiscale Fractal Dimension

  title={Proposal of the Aesthetic Experience-Oriented Evaluation Framework for Field-recording Sound Retrieval System: Experiments using Acoustic Feature Signatures Based on Multiscale Fractal Dimension},
  author={Motohiro Sunouchi and Masaharu Yoshioka},
  journal={2021 3rd International Conference on Image, Video and Signal Processing},
Sound designers and musicians often need to retrieve sound materials based on their similarity to aesthetic hearing experiences from sound databases such as Freesound. This study proposes an aesthetic experience-oriented evaluation framework for a field-recording sound retrieval system, using the sound clips extracted from Freesound. Furthermore, we discuss the features of the framework by analyzing the performance of the similarity search system for field-recording sound material using… 

Figures and Tables from this paper



Similarity Search Of Freesound Environmental Sound Based On Their Enhanced Multiscale Fractal Dimension

In this paper, we propose a new acoustic feature signature based on the multiscale fractal dimension extracted from sound signals for the content-based retrieval of environmental sounds such as

Diversity-Robust Acoustic Feature Signatures Based on Multiscale Fractal Dimension for Similarity Search of Environmental Sounds

This paper extends EMFD by using the kernel density estimation method (EMFD-KDE), which results in increased stability and robustness against small fluctuations in the parameters of sound sources, and newly proposes another acoustic feature signature based on MFD, namely verylong-range multiscale fractal dimension signature (MFD-VL).

Environmental Sound Recognition With Time–Frequency Audio Features

An empirical feature analysis for audio environment characterization is performed and a matching pursuit algorithm is proposed to use to obtain effective time-frequency features to yield higher recognition accuracy for environmental sounds.

Multiscale Fractal Analysis of Musical Instrument Signals With Application to Recognition

TheMultiscale fractal dimension (MFD) profile is proposed as a short-time descriptor, useful to quantify the multiscale complexity and fragmentation of the different states of the music waveform, and can discriminate several aspects among different music instruments.

Fractal dimensions of speech sounds: computation and application to automatic speech recognition.

The geometry of speech turbulence as reflected in the fragmentation of the time signal is quantified by using fractal models and an efficient algorithm for estimating the short-time fractal dimension of speech signals based on multiscale morphological filtering is described.

Environmental sound recognition: A survey

This survey will offer a qualitative and elucidatory survey on recent developments of environmental sound recognition, and includes three parts: i) basic environmental sound processing schemes, ii) stationary ESR techniques and iii) non-stationary E SR techniques.

Freesound 2: An Improved Platform for Sharing Audio Clips is an online collaborative sound database where people from different disciplines share recorded sound clips under Creative Commons licenses. It was started in 2005 and it is being

A Phenomenological Experience of Sound: Notes on Francisco López

This article provides an overview of the output of sound artist Francisco López (b. Madrid, 1964) by examining his music (or soundworks), his writings, and his performances. Unlike electroacoustic

A Theoretical Analysis of NDCG Type Ranking Measures

This paper studies, from a theoretical perspective, the widely used Normalized Discounted Cumulative Gain (NDCG)-type ranking measures, and shows that NDCG with logarithmic discount has consistent distinguishability although it converges to the same limit for all ranking functions.

An algorithm for suffix stripping

An algorithm for suffix stripping is described, which has been implemented as a short, fast program in BCPL and performs slightly better than a much more elaborate system with which it has been compared.