Toward a model for lexical access based on acoustic landmarks and distinctive features.

  title={Toward a model for lexical access based on acoustic landmarks and distinctive features.},
  author={Kenneth N. Stevens},
  journal={The Journal of the Acoustical Society of America},
  volume={111 4},
  • K. Stevens
  • Published 3 April 2002
  • Psychology
  • The Journal of the Acoustical Society of America
This article describes a model in which the acoustic speech signal is processed to yield a discrete representation of the speech stream in terms of a sequence of segments, each of which is described by a set (or bundle) of binary distinctive features. These distinctive features specify the phonemic contrasts that are used in the language, such that a change in the value of a feature can potentially generate a new word. This model is a part of a more general model that derives a word sequence… 

Figures and Tables from this paper


This paper discusses data and thought processes that prompted four significant changes in the formulation of the model over the past decade: rule-generated changes in segments versus modifications of cues for features, landmark detection, principles of cue selection, and the role of analysis-by-synthesis in verifying word hypotheses.

Exploring the connection of acoustic and distinctive features

The methods proposed in this study can be of use to identify systematic speech signal correspondencies for phonological models and as a starting point for distinctive feature detection in speech recognition.

Classification of stop consonant place of articulation

The overall findings are that attributes relating to the burst spectrum in relation to the vowel contribute most effectively, while Attributes relating to formant transition are somewhat less effective.

Selective acoustic cues for French voiceless stop consonants.

The objective of this study is to define selective cues that identify only certain realizations of a feature, more precisely the place of articulation of French unvoiced stops, but have every

A Probabilistic Speech Recognition Framework Based on the Temporal Dynamics of Distinctive Feature Landmark Detectors

This paper elaborates on a computational model for speech recognition that is inspired by several different interrelated strands of research in phonology, acoustic phonetics, speech perception, and neuroscience, and constructs a hierarchically structured point process representation based on feature detectors.

Modeling the temporal dynamics of distinctive feature landmark detectors for speech recognition.

An approach that constructs a hierarchically structured point process representation based on distinctive feature landmark detectors and probabilistically integrates the firing patterns of these detectors to decode a phonological sequence is outlined.

Large vocabulary continuous speech recognition using linguistic features and constraints

A number of extensions to the original Huttenlocher-Zue lexical access model are made to take advantage of the existing facilities of a probabilistic, graph-based recognition framework and, more importantly, to model the broad linguistic features in a data-driven approach.

A hierarchical point process model for speech recognition

  • A. JansenP. Niyogi
  • Computer Science
    2008 IEEE International Conference on Acoustics, Speech and Signal Processing
  • 2008
In this paper, we present a computational framework to engage distinctive feature-based theories of speech perception. Our approach involves: (i) transforming the signal into a collection of marked

Robustness of Acoustic Landmarks in Spontaneously-Spoken American English

Preliminary results for one conversation show that 86% of landmarks were realized overall, with a sharply lower rate for coronal stops /t/ and /d/.



Finding acoustic regularities in speech: applications to phonetic recognition

This thesis presents an alternative approach whereby this phonetic-level description is bypassed in favor of directly relating the acoustic realizations to the underlying phonemic forms, and relies critically on the ability to detect important acoustic landmarks in the speech signal.

Landmark detection for distinctive feature-based speech recognition

An algorithm for automatically detecting landmarks associated with segments having abrupt acoustics, which provides hypotheses about the underlying broad phonetic class at each landmark as a consequence of landmark detection.

Analysis and interpretation of glide characteristics in pursuit of an algorithm for recognition

The algorithm devised in this thesis determines whether or not a glide occurs between a consonant landmark and the following vowel by determining if the acoustic properties listed above fall into a certain range.

Nasal detection module for a knowledge-based speech recognition system

The nasality module that has been developed is a sonorant landmark detector that greatly reduces false landmark detection and distinguishes nasals from laterals by incorporating additional nasal manner cues.

Automatic syllable detection for vowel landmarks

The acoustic theory of speech production was used to predict characteristics of vowels, and studies were done on a speech database to test the predictions, and the resulting data guided the development of an improved Vowel landmark detector (VLD).

Detection of consonant voicing: a module for a hierarchical speech recognition system

The results in this study suggest that acoustic cues selected by considering the representation and production of speech may provide reliable criteria for determining consonant voicing.

How word onsets drive lexical access and segmentation: evidence from acoustics, phonology and processing

  • D. GowJ. MelvoldS. Manuel
  • Linguistics
    Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
  • 1996
It is suggested that word onsets differ from other parts of words in that they offer more robust and redundant acoustic evidence about phonetic features and are generally protected from phonological assimilation, neutralization and deletion and therefore show less lawful variation in surface realization.

The geometry of phonological features

The apparently vast number of speech sounds found in the languages of the world turn out to be surface-level realisations of a limited number of combinations of a very small set of such features – some twenty or so, in current analyses.

The predominance of strong initial syllables in the English vocabulary