Physiologically-motivated feature extraction for speaker identification

@article{Wang2014PhysiologicallymotivatedFE,
  title={Physiologically-motivated feature extraction for speaker identification},
  author={Jianglin Wang and Michael T. Johnson},
  journal={2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  year={2014},
  pages={1690-1694}
}
This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information… CONTINUE READING

References

Publications referenced by this paper.
Showing 1-10 of 22 references

Some observations on oral air flow during phonation

  • H. M. Teager
  • IEEE Trans. Acoust., Speech, Signal Processing…
  • 1980
Highly Influential
10 Excerpts

Discrete-Time Speech Signal Processing: Principles and Practice

  • T. F. Quatieri
  • 2002
Highly Influential
3 Excerpts

Use of the harmonic phase in speaker recognition

  • I. Hernaez, I. Saratrxaga, J. Sanchez, E. Navas, I. Luengo
  • Proc. Interspeech, pp. 2757-2760, 2011.
  • 2011
1 Excerpt

Similar Papers

Loading similar papers…