Exploring the anatomical encoding of voice with a mathematical model of the vocal system

  title={Exploring the anatomical encoding of voice with a mathematical model of the vocal system},
  author={M. Florencia Assaneo and Jacobo Diego Sitt and Ga{\"e}l Varoquaux and Mariano Sigman and Laurent D. Cohen and Marcos Alberto Trevisan},

Figures and Tables from this paper

Discrete anatomical coordinates for speech production and synthesis
This work used Hall-effect transducers and magnets -mounted on the tongue, lips and jaw- to track the kinematics of the oral tract during the vocalization of vowel-consonant-vowel structures and addressed a relevant inquiry of the biology of language.
Vocal effort modulates the motor planning of short speech structures.
Results support that the effort required to produce the sequence of movements of a vocal gesture modulates the onset of the motor plan, and reflect the motor planning of CCVs and transitions are proxy indicators of the vocal effort needed to produce them.
The morphometry of the laryngeal phonatory system - base of the anatomical study of the voice aptitudes.
This study performed the dissection of seven embalmed anatomical parts and made measurements of the anatomical elements involved in the phonation, organized on three levels: laryngeal, oral, palatinal, pharyngeals, epiglottal and nasal.
The audiovisual structure of onomatopoeias: An intrusion of real-world physics in lexical creation
The capacity of creative language to transport complex multisensory information in a controlled experiment, where participants improvised onomatopoeias from noisy moving objects in audio, visual and audiovisual formats, found that consonants communicate movement types mainly through the manner of articulation in the vocal tract.
Motor representations underlie the reading of unfamiliar letter combinations
The results support that a speech motor code is used for the recognition of infrequent text strings during silent reading and show that transitions measure the articulatory effort required to produce the CCVs.
Ocular dynamics reveal articulatory processing at single-phoneme level during silent reading
The results demonstrate that silent reading is modulated by slight articulatory features such as the laryngeal abduction needed to devoice a single consonant or the reshaping of the vocal tract between successive consonants.
Mechanisms of voice processing: Evidence from autism spectrum disorder
The correct perception of information carried by the voice is a key requirement for successful human communication. Hearing another person’s voice provides information about who is speaking (voice
Cortical entrainment: what we can learn from studying naturalistic speech perception
The view that naturalistic experimental paradigms, utilising spontaneously produced speech as stimuli and suitable frequency-domain methodological tools, should be used to address an important question that remains open: whether cortical entrainment is observed during speech perception and comprehension in real-life communicative situations is advanced.


Voice-selective areas in human auditory cortex
It is shown, using functional magnetic resonance imaging in human volunteers, that voice-selective regions can be found bilaterally along the upper bank of the superior temporal sulcus (STS), and their existence sheds new light on the functional architecture of the human auditory cortex.
Decoding Articulatory Features from fMRI Responses in Dorsal Speech Regions
The role of articulatory representations during passive listening is examined using carefully controlled stimuli (spoken syllables) in combination with multivariate fMRI decoding and revealed articulatory-specific brain responses of speech at multiple cortical levels, including auditory, sensorimotor, and motor regions, suggesting the representation of sensorsimotor information during passive speech perception.
The motor theory of speech perception revised
Task-Dependent Decoding of Speaker and Vowel Identity from Auditory Cortical Response Patterns
The task dependency of speaker/vowel classification demonstrates that the informative fMRI response patterns reflect the top-down enhancement of behaviorally relevant sound representations and suggests that successful selection, processing, and retention of task-relevant sound properties relies on the joint encoding of information across early and higher-order regions of the auditory cortex.
Relation of vocal tract shape, formant transitions, and stop consonant identification.
  • B. Story, K. Bunton
  • Physics
    Journal of speech, language, and hearing research : JSLHR
  • 2010
It was demonstrated that regions of the vocal tract exist that, when constricted, shift the formant frequencies in a predictable direction and the boundaries of these acoustically defined regions were shown to coincide with phonetic categories for stop consonants.
Phonetic Feature Encoding in Human Superior Temporal Gyrus
High-density direct cortical surface recordings in humans while they listened to natural, continuous speech were used to reveal the STG representation of the entire English phonetic inventory, demonstrating the acoustic-phonetic representation of speech in human STG.
Norm-Based Coding of Voice Identity in Human Auditory Cortex
A parametric model of the vocal tract area function for vowel and consonant simulation.
  • B. Story
  • Physics
    The Journal of the Acoustical Society of America
  • 2005
A model of the vocal-tract area function is described that consists of four tiers and can be specified either as static or time varying, which allows for multiple levels of coarticulation or coproduction.
Identification of synthetic vowels based on selected vocal tract area functions.
The purpose of this study was to determine the degree to which synthetic vowel samples based on previously reported vocal tract area functions of eight speakers could be accurately identified by