Vocal Tract Length Perturbation (VTLP) improves speech recognition

  title={Vocal Tract Length Perturbation (VTLP) improves speech recognition},
  author={Navdeep Jaitly},
Augmenting datasets by transforming inputs in a way that does not change the label is a crucial ingredient of the state of the art methods for object recognition using neural networks. However this approach has (to our knowledge) not been exploited successfully in speech recognition (with or without neural networks). In this paper we lay the foundation for this approach, and show one way of augmenting speech datasets by transforming spectrograms, using a random linear warping along the… CONTINUE READING
Highly Influential
This paper has highly influenced 10 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 152 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 87 citations

153 Citations

Citations per Year
Semantic Scholar estimates that this publication has 153 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 10 references