Music in Our Ears: The Biological Bases of Musical Timbre Perception

  title={Music in Our Ears: The Biological Bases of Musical Timbre Perception},
  author={Kailash Patil and Daniel Pressnitzer and Shihab A. Shamma and Mounya Elhilali},
  journal={PLoS Computational Biology},
Timbre is the attribute of sound that allows humans and other animals to distinguish among different sound sources. Studies based on psychophysical judgments of musical timbre, ecological analyses of sound's physical characteristics as well as machine learning approaches have all suggested that timbre is a multifaceted attribute that invokes both spectral and temporal sound features. Here, we explored the neural underpinnings of musical timbre. We used a neuro-computational framework based on… 

Figures and Tables from this paper

Neural and behavioral investigations into timbre perception
This work reviews human timbre perception and the spectral and temporal acoustic features that give rise to timbre in speech, musical and environmental sounds, and explores the neural representation of timbre, first within the peripheral auditory system and later at the level of the auditory cortex.
Learning metrics on spectrotemporal modulations reveals the perception of musical instrument timbre.
A broad overview of former studies on musical timbre is provided to identify its relevant acoustic substrates according to biologically inspired models and observe that timbre has both generic and experiment-specific acoustic correlates.
Biomimetic spectro-temporal features for music instrument recognition in isolated notes and solo phrases
The study presents an approach for parsing solo performances into their individual note constituents and adapting back-end classifiers using support vector machines to achieve a generalization of instrument recognition to off-the-shelf, commercially available solo music.
A Flexible Bio-Inspired Hierarchical Model for Analyzing Musical Timbre
A flexible and multipurpose bio-inspired hierarchical model for analyzing musical timbre that uses a cochlear filter bank to resolve the spectral components of a sound, lateral inhibition to enhance spectral resolution, and a modulation filterBank to extract the global temporal envelope and roughness of the sound from amplitude modulations.
Representation of music genres based on the spectro-temporal modulation responses of the human brain
Comprehensive cortical representations and functional organization of music genres are examined by building voxel-wise models of fMRI data collected while human subjects listened to 540 music clips to elucidate the quantitative representation ofMusic genres in the human cortex and indicate the possibility of modeling the authors' abstract categorization of complex auditory stimuli based on the brain activity.
Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable
This study confirms the potential of these new classes of sounds, acoustic and auditory sketches, to study sound recognition, and shows that, with the exception of voice sounds, very sparse representations of sounds could be recognized above chance.
The Perceptual Representation of Timbre
  • S. McAdams
  • Physics
    Timbre: Acoustics, Perception, and Cognition
  • 2019
One conception of timbre is as a spectromorphology encompassing time-varying frequency and amplitude behaviors, as well as spectral and temporal modulations.
Acoustic Correlates of Auditory Object and Event Perception: Speakers, Musical Timbres, and Environmental Sounds
Listeners’ dissimilarity ratings were associated with spectrotemporal variability and aperiodicity, and musical training was related to improved sound identification performance.


Fast recognition of musical sounds based on timbre.
The data suggest rapid and accurate neural mechanisms for musical-sound recognition based on selectivity to complex spectro-temporal signatures of sound sources.
Separate Neural Processing of Timbre Dimensions in Auditory Sensory Memory
Results expand to timbre dimensions a property of separation of the representation in sensory memory that has already been reported between basic perceptual attributes (pitch, loudness, duration, and location) of sound sources.
Cortical Representation of Natural Complex Sounds: Effects of Acoustic Features and Auditory Object Category
A hierarchical organization of the anteroventral auditory-processing stream is supported, with the most anterior regions representing the complete acoustic signature of auditory objects.
Voice-selective areas in human auditory cortex
It is shown, using functional magnetic resonance imaging in human volunteers, that voice-selective regions can be found bilaterally along the upper bank of the superior temporal sulcus (STS), and their existence sheds new light on the functional architecture of the human auditory cortex.
The Timbre Toolbox: extracting audio descriptors from musical signals.
This analysis suggests ten classes of relatively independent audio descriptors, showing that the Timbre Toolbox is a multidimensional instrument for the measurement of the acoustical structure of complex sound signals.
Perceptual scaling of synthesized musical timbres: Common dimensions, specificities, and latent subject classes
The model with latent classes and specificities gave a better fit to the data and made the acoustic correlates of the common dimensions more interpretable, suggesting that musical timbres possess specific attributes not accounted for by these shared perceptual dimensions.
Acoustic correlates of timbre space dimensions: a confirmatory study using synthetic tones.
Listeners presented with carefully controlled synthetic tones use attack time, spectral centroid, and spectrum fine structure in dissimilarity rating experiments, and spectral flux appears as a less salient timbre parameter, its salience depending on the number of other dimensions varying concurrently in the stimulus set.
Listening As Introduction to the Perception of Auditory Events
Listening combines broad coverage of acoustics, speech and music perception psychophysics, and auditory physiology with a coherent theoretical orientation in a lively and accessible introduction to