• Published 2009

Nonlinear translation-invariant transformations for speaker-independent speech recognition

@inproceedings{Mller2009NonlinearTT,
  title={Nonlinear translation-invariant transformations for speaker-independent speech recognition},
  author={Florian M{\"u}ller and Alfred Mertins},
  year={2009}
}
The spectral effects of vocal tract length (VTL) changes are one reason of why the recognition rate of today’s speaker-independent automatic speech recognition (ASR) systems is considerably lower than the one of speaker-dependent systems. By using certain types of filterbanks these effects can be described by a translation in subband-index space. In this paper, nonlinear translation-invariant transforms that originally have been proposed in the field of pattern recognition are investigated for… CONTINUE READING

Figures and Tables from this paper.

Citations

Publications citing this paper.

References

Publications referenced by this paper.
SHOWING 1-10 OF 24 REFERENCES

The HTK book version 3.4

VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

Frequency-Warping Invariant Features for Automatic Speech Recognition

VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

A class of translation invariant transforms

VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

Audio segmentation based on multi-scale audio classification

  • Yibin Zhang, Jie Zhou
  • Computer Science
  • 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 2004

Compression: From Cochlea to Cochlear Implants

VIEW 1 EXCERPT