Exploring the distribution of statistical feature parameters for natural sound textures

@article{Mishra2020ExploringTD,
  title={Exploring the distribution of statistical feature parameters for natural sound textures},
  author={Ambika Prasad Mishra and Nicol S. Harper and Jan W. H. Schnupp},
  journal={PLoS ONE},
  year={2020},
  volume={16}
}
Sounds like “running water” and “buzzing bees” are classes of sounds which are a collective result of many similar acoustic events and are known as “sound textures”. Recent psychoacoustic study using sound textures by [1] reported that natural sounding textures can be synthesized from white noise by imposing statistical features such as marginals and correlations computed from the outputs of cochlear models responding to the textures. The outputs being the envelopes of bandpass filter responses… 
4 Citations

Trends in Audio Texture Analysis, Synthesis, and Applications

Audio signals are classified into speech, music, and environmental sounds. From the evo- lution of audio features, an adequate amount of work has been seen in speech and music processing. On the

Tinnitus-like “hallucinations” elicited by sensory deprivation in an entropy maximization recurrent neural network

An objective-driven learning algorithm is used to model an early auditory processing neuronal network, e.g., in the dorsal cochlear nucleus, and shows that attenuation of peripheral inputs drives the recurrent network towards its critical point and transition into a tinnitus-like state.

Deep Learning Based Speaker Recognition System with CNN and LSTM Techniques

This work focuses on using MFCC to transform audio to spectrograms without losing the essential features of the audio file in question, using a closed set and an open set implementation procedure on these datasets.

References

SHOWING 1-10 OF 23 REFERENCES

Temporal Low-Order Statistics of Natural Sounds

The results show that natural sounds are highly redundant, and have possible implications to the neural code used by the auditory system, particularly along the cochlear axis.

Modulation spectra of natural sounds and ethological theories of auditory processing.

It is shown how to create synthetic sounds with first and second order envelope statistics identical to those found in natural sounds, and postulated that the auditory system as well as engineering applications may exploit these statistical properties to obtain an efficient representation of behaviorally relevant sounds.

Efficient coding of natural sounds

It is reported that several basic auditory nerve fiber tuning properties can be accounted for by adapting a population of filter shapes to encode natural sounds efficiently.

Tuning to Natural Stimulus Dynamics in Primary Auditory Cortex

Coding of Naturalistic Stimuli by Auditory Midbrain Neurons

It is found that naturalistic inputs significantly enhance the rate of transmitted information, indicating that auditiory neural responses are matched to characteristics of natural auditory scenes.

Optimal neural population coding of an auditory spatial cue

Here it is shown that ITD coding strategies of different species can be explained by a unifying principle: that the ITDs an animal naturally encounters should be coded with maximal accuracy.

Emergence of Tuning to Natural Stimulus Statistics along the Central Auditory Pathway

It is shown that neurons in primary auditory cortex of anaesthetized ferrets respond more strongly and reliably to dynamic stimuli whose statistics follow “natural” 1/f dynamics than to stimuli exhibiting pitch and amplitude modulations that are faster or slower than 1/ f.

The Neural Code for Auditory Space Depends on Sound Frequency and Head Size in an Optimal Manner

The optimal-coding model makes key, yet unobserved, predictions: for many species, including humans, both forms of neural representation are employed, depending on sound frequency, and novel representations are predicted for intermediate frequencies.

Derivation of auditory filter shapes from notched-noise data