Adverse conditions improve distinguishability of auditory, motor, and perceptuo-motor theories of speech perception: An exploratory Bayesian modelling study

  title={Adverse conditions improve distinguishability of auditory, motor, and perceptuo-motor theories of speech perception: An exploratory Bayesian modelling study},
  author={Cl{\'e}ment Moulin-Frier and Rapha{\"e}l Laurent and Pierre Bessi{\`e}re and Jean-Luc Schwartz and Julien Diard},
  journal={Language and Cognitive Processes},
  pages={1240 - 1263}
In this paper, we put forward a computational framework for the comparison between motor, auditory, and perceptuo-motor theories of speech communication. We first recall the basic arguments of these three sets of theories, either applied to speech perception or to speech production. Then we expose a unifying Bayesian model able to express each theory in a probabilistic way. Focusing on speech perception, we demonstrate that under two hypotheses, regarding communication noise and inter-speaker… 

The Complementary Roles of Auditory and Motor Information Evaluated in a Bayesian Perceptuo-Motor Model of Speech Perception

COSMO (Communicating Objects using Sensory-Motor Operations), an integrative model that allows principled comparisons of purely motor or purely auditory implementations of a speech perception task and tests the gain of efficiency provided by their Bayesian fusion, is developed.

Computer simulations of coupled idiosyncrasies in speech perception and speech production with COSMO, a perceptuo-motor Bayesian model of speech communication

This paper attempts to simulate one study on coupled idiosyncrasies in the perception and production of French oral vowels, within COSMO, a Bayesian computational model of speech communication, and proposes a perceptuo-motor model in which auditory processing would enable optimal processing of learned sounds and motor processing would be helpful in unlearned adverse conditions.

What drives the perceptual change resulting from speech motor adaptation? Evaluation of hypotheses in a Bayesian modeling framework

Simulations suggest that some hypotheses concerning the motor and auditory updates that could result from motor learning are compatible with a framework in which motor adaptation updates both the auditory-motor internal model and the auditory characterization of the perturbed phoneme, and where perception involves both auditory and somatosensory pathways.

The shadow of a doubt? Evidence for perceptuo-motor linkage during auditory and audiovisual close-shadowing

Testing whether the visual modality could speed motor response in a close-shadowing task resulted in results interpreted within a two-stage sensory-motor framework, in which the auditory and visual streams are integrated together and with internally generated motor representations before a final decision may be available.

COSMO SylPhon: A Bayesian Perceptuo-motor Model to Assess Phonological Learning

A Bayesian model of speech communication, named "COSMO SylPhon", is proposed, which shows that if agents are equipped with a bootstrap process inspired by the Frame-Content Theory of speech development, they learn to associate consonants to specific articulatory gestures, providing the basis for consonantal articulatory invariance.

Recognizing speech in a novel accent: the motor theory of speech perception reframed

A novel computational model of how a listener comes to understand the speech of someone speaking the listener’s native language with a foreign accent, which serves as a reference point for the discussion in Part 3, which proposes a dual-stream neuro-linguistic architecture.

A computational model of perceptuo-motor processing in speech perception: learning to imitate and categorize synthetic CV syllables

This paper presents COSMO, a Bayesian computational model, which is expressive enough to carry out syllable production, perception and imitation tasks using motor, auditory or perceptuo-motor

Echoes on the motor network: how internal motor control structures afford sensory experience

Evidence of AMR development from a motor control perspective is reviewed, and it is recommended that activation of these types of internal models outside of action execution may provide an ecological advantage when encountering known stimuli in ambiguous conditions.



The motor theory of speech perception revised

Cortical interactions underlying the production of speech sounds.

  • F. Guenther
  • Biology, Psychology
    Journal of communication disorders
  • 2006

The Motor Somatotopy of Speech Perception

Speech listening specifically modulates the excitability of tongue muscles: a TMS study

It is demonstrated that, during speech listening, there is an increase of motor‐evoked potentials recorded from the listeners' tongue muscles when the presented words strongly involve, when pronounced, tongue movements.

A theoretical investigation of reference frames for the planning of speech movements.

A 4-part theoretical treatment favoring models whose only invariant targets are regions in auditory perceptual space over models that posit invariant constriction targets is presented, which poses several difficult challenges to proponents of constriction theories.

The Essential Role of Premotor Cortex in Speech Perception

Articulatory bias in speech categorization: Evidence from use-induced motor plasticity

Emergence of articulatory-acoustic systems from deictic interaction games in a “Vocalize to Localize” framework

A computational Bayesian model incorporating the Dispersion and Quantal Theories of speech sounds inside the Vocalize-to-Localize framework is presented, and it is shown how realistic simulations of vowel systems can emerge from this model.

Hearing lips and seeing voices: how cortical areas supporting speech production mediate audiovisual speech perception.

The results suggest that AV speech elicits in the listener a motor plan for the production of the phoneme that the speaker might have been attempting to produce, and that feedback in the form of efference copy from the motor system ultimately influences the phonetic interpretation.