Modeling Prosody for Speaker Recognition: Why Estimating Pitch May Be a Red Herring

@inproceedings{Laskowski2010ModelingPF,
  title={Modeling Prosody for Speaker Recognition: Why Estimating Pitch May Be a Red Herring},
  author={Kornel Laskowski and Qin Jin},
  booktitle={Odyssey},
  year={2010}
}
It has long been claimed that spectral envelope features outperform prosodic features on speaker recognition tasks. However, the reasons for such an arrangement are not entirely compelling. In the current work we present some evidence to challenge these claims. We propose that energy found at harmonically related frequencies encodes the acoustic correlates of variables which are typically referred to as prosodic, making harmonic energy summation highly relevant. Its frequent implementation for… CONTINUE READING

References

Publications referenced by this paper.
Showing 1-10 of 25 references

Harmonics-to-noise ratio as an index of the degree of hoarseness.

The Journal of the Acoustical Society of America • 1982
View 6 Excerpts
Highly Influenced

CSR-I (WSJ0) Complete

J. Garofolo, D. Graff, D. Paul, D. Pallett
Linguistic Data Consortium, vol. LDC93S6A, • 2007
View 2 Excerpts
Highly Influenced

Comparing the contributions of context and prosody in text-independent dialog act recognition

2010 IEEE International Conference on Acoustics, Speech and Signal Processing • 2010
View 1 Excerpt

Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum

2009 IEEE International Conference on Acoustics, Speech and Signal Processing • 2009
View 3 Excerpts

Using sets of combs to control pitch estimation errors

Li énard, J.-S, C. Barras, F. Signol
Proc. 155th Meeting Acoustical Society of America, Paris, France, • 2008
View 2 Excerpts

Similar Papers

Loading similar papers…