Model composition by lagrange polynomial approximation for robust speech recognition in noisy environment

@inproceedings{Raut2004ModelCB,
  title={Model composition by lagrange polynomial approximation for robust speech recognition in noisy environment},
  author={Chandra Kant Raut and Takuya Nishimoto and Shigeki Sagayama},
  booktitle={INTERSPEECH},
  year={2004}
}
This paper presents a technique for estimating HMM model parameters for noisy speech from given clean speech HMM and noise HMM. The model parameters are estimated by approximating the non-linear function governing the relationship between speech and noise, by a Lagrange polynomial, and thus enabling the distribution of corrupted speech parameters to have a closed form. The method is computationally efficient, and the experimental results showed significant improvement in recognition performance… CONTINUE READING

References

Publications referenced by this paper.
SHOWING 1-10 OF 13 REFERENCES

Neural-network-based HMM adaptation for noisy speech

  • 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
  • 2001
VIEW 11 EXCERPTS
HIGHLY INFLUENTIAL

A vector Taylor series approach for environment-independent speech recognition

  • 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings
  • 1996
VIEW 5 EXCERPTS
HIGHLY INFLUENTIAL

Robust continuous speech recognition using parallel model combination

  • IEEE Trans. Speech and Audio Processing
  • 1996
VIEW 9 EXCERPTS
HIGHLY INFLUENTIAL

A fast and flexible implementation of parallel model combination

  • 1995 International Conference on Acoustics, Speech, and Signal Processing
  • 1995
VIEW 9 EXCERPTS
HIGHLY INFLUENTIAL

A Comparative Study of Approximations for Parallel Model Combination of Static and Dynamic Parameters

Y. Gong
  • Proc. ICSLP02, pp. 1029-1032, 2002.
  • 2002
VIEW 1 EXCERPT

Jacobian approach to fast acoustic model adaptation

  • 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1997
VIEW 1 EXCERPT

Similar Papers