Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index

  title={Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index},
  author={A. P. Prathosh and T. V. Ananthapadmanabha and A. G. Ramakrishnan},
  journal={IEEE Transactions on Audio, Speech, and Language Processing},
Epoch is defined as the instant of significant excitation within a pitch period of voiced speech. Epoch extraction continues to attract the interest of researchers because of its significance in speech analysis. Existing high performance epoch extraction algorithms require either dynamic programming techniques or a priori information of the average pitch period. An algorithm without such requirements is proposed based on integrated linear prediction residual (ILPR) which resembles the voice… 

Epoch Extraction Using Hilbert–Huang Transform for Identification of Closed Glottis Interval

This chapter aims at developing an extraction algorithm independent of the characteristics of vocal tract system that improves the accuracy of epochs extracted and pitch detected from speech signal.

Epoch Extraction from Speech Signals Using Temporal and Spectral Cues by Exploiting Harmonic Structure of Impulse-like Excitations

  • P. GangamohanS. Gangashetty
  • Computer Science
    ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2019
This paper develops an algorithm which does not require any apriori information of pitch period, and can be implemented in real-time applications, and exploits the harmonic property of sequence of impulse-like excitations to obtain the local pitch period.

Cumulative Impulse Strength for Epoch Extraction

A temporal measure termed the cumulative impulse strength is proposed for locating the impulses in a quasi-periodic impulse-sequence embedded in noise and applied for detecting the GCIs from the inverted integrated LPR using a recursive algorithm.

A fast algorithm for speech polarity detection using long-term linear prediction

An algorithm is proposed that improvises the existing technique using the skewness of the voice source (VS) signal using the integrated linear prediction residual (ILPR) as the VS estimate, which is obtained using linear prediction on long-term frames of the low-pass filtered speech signal.

Detection of Glottal Excitation Epochs in Speech Signal Using Hilbert Envelope

A technique, suitable for real-time processing, is presented that uses Hilbert envelope to enhance saliency of the glottal excitation epochs and to reduce the ripples due to the vocal tract filter and its robustness against highpass filtering.

CWT-Based Approach for Epoch Extraction From Telephone Quality Speech

Experimental results show that the epoch identification rate of proposed method is significantly better than the state-of-the-art methods for the telephone quality speech.

Expressive speech analysis for epoch extraction using zero frequency filtering approach

The present work discusses the issues of epoch extraction from expressive speech signals. Epochs represent the accurate glottal closure instants in voiced speech which in turn give the accurate

Epoch Extraction from Pathological Children Speech Using Single Pole Filtering Approach

The instant of significant excitation of the vocal tract system is referred to the epoch of the speech signal. The presence of high pitch and aperiodicity are the major challenges for the epoch

Analysis of singing voice for epoch extraction using Zero Frequency Filtering method

This paper analyzes singing voice for the estimation of epochs by studying the characteristics of the source-filter interaction and the effect of wider range of pitch using the Zero Frequency Filtering (ZFF) method.



Epoch extraction from linear prediction residual for identification of closed glottis interval

In voiced speech analysis epochal information is useful in accurate estimation of pitch periods and the frequency response of the vocal tract system. Ideally, linear prediction (LP) residual should

Epoch Extraction From Speech Signals

The interesting part of the results is that the epoch extraction by the proposed method seems to be robust against degradations like white noise, babble, high-frequency channel, and vehicle noise.

Epoch-based analysis of speech signals

Speech analysis is traditionally performed using short-time analysis to extract features in time and frequency domains. The window size for the analysis is fixed somewhat arbitrarily, mainly to

Determination of instants of significant excitation in speech using group delay function

A new method based on the global phase characteristics of minimum phase signals for determining the instants of significant excitation in speech signals is proposed, which works well for all types of voiced speech in male as well as female speech but, in all cases, under noise-free conditions only.

Estimation of Glottal Closing and Opening Instants in Voiced Speech Using the YAGA Algorithm

The Yet Another GCI/GOI Algorithm (YAGA) is proposed to detect GCIs from speech signals by employing multiscale analysis, the group delay function, and N-best dynamic programming and a novel GOI detector based upon the consistency of the candidates' closed quotients relative to the estimated GCIs is presented.

Detection of Glottal Closure Instants From Speech Signals: A Quantitative Review

It is shown that for clean speech, SEDREAMS and YAGA are the best performing techniques, both in terms of identification rate and accuracy, and ZFR and SEDreamS also show a superior robustness to additive noise and reverberation.

Determination of Instants of Significant Excitation in Speech Using Hilbert Envelope and Group Delay Function

The accuracy in determining the instants of significant excitation and the time complexity of the proposed method is compared with the group delay based approach.

Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm

The Dynamic Programming Projected Phase-Slope Algorithm (DYPSA) is automatic and operates using the speech signal alone without the need for an EGG signal for automatic estimation of glottal closure instants (GCIs) in voiced speech.

Detection of the closure-burst transitions of stops and affricates in continuous speech using the plosion index.

A rule-based algorithm is designed that aims at selecting only those events associated with the closure-burst transitions of stops and affricates and gives a performance comparable to or better than the state-of-the-art methods.