• Corpus ID: 9639362

Segmentation of a speech waveform according to glottal open and closed phases using an autoregressive-HMM

@inproceedings{Smith2000SegmentationOA,
  title={Segmentation of a speech waveform according to glottal open and closed phases using an autoregressive-HMM},
  author={Gavin Smith and Tony Robinson},
  booktitle={INTERSPEECH},
  year={2000}
}
This paper presents an algorithm to segment speech according to glottal open and closed phases using the time waveform alone. Based on this, pitch, jitter and closed to open glottal ratios can be computed. Segmentation is achieved by identifying spectral changepoints at the subpitch period timescale. Changepoints are identi ed using a 3-state autoregressive hidden Markov model (AR-HMM) operating on the time waveform, with the LiljencrantsFant (LF) glottal model as a theoretical basis. Model… 
Adaptive Algorithms for Pitch-synchronous Speech Signal Segmentation
Abstract The paper describes and compares two time domain algorithms for segmenting voiced speech into quasi-periodical units that correspond to pitch periods. The first algorithm uses the similarity
Pitch-Synchronous Speech Signal Segmentation and Its Applications
TLDR
A time domain algorithm for segmenting voiced speech into quasi-periodical units that correspond to pitch periods is described and its application to precise pitch estimation is discussed.

References

SHOWING 1-6 OF 6 REFERENCES
Isolated-utterance speech recognition using hidden Markov models with bounded state durations
TLDR
It was found that in the discrete case the recognition rate of HMM/BSD is 9.0%, 6.3%, and 1.9% higher than the conventional HMMs and HMMs with Poisson and gamma distribution state durations, respectively.
Precise glottal closure instant detector for voiced speech
An algorithm which can extract precise glottal closure instant (GCI) information directly from speech is described. By utilising the average pitch and the area informations, accurate GCI positions
Discrete-Time Processing of Speech Signals
TLDR
The preface to the IEEE Edition explains the background to speech production, coding, and quality assessment and introduces the Hidden Markov Model, the Artificial Neural Network, and Speech Enhancement.
Fundamentals of speech recognition
TLDR
This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech.
Segmentation of a Speech Waveform According to Glottal Timing Information using an Explicit Duration AutoregressiveHMM
  • Technical Report CUED/F-INFENG/TR.390, Cambridge University Engineering Dept., UK,
  • 2000