A hierarchical system for word discovery exploiting DTW-based initialization

@article{Walter2013AHS,
  title={A hierarchical system for word discovery exploiting DTW-based initialization},
  author={Oliver Walter and Timo Korthals and Reinhold H{\"a}b-Umbach and Bhiksha Raj},
  journal={2013 IEEE Workshop on Automatic Speech Recognition and Understanding},
  year={2013},
  pages={386-391}
}
Discovering the linguistic structure of a language solely from spoken input asks for two steps: phonetic and lexical discovery. The first is concerned with identifying the categorical subword unit inventory and relating it to the underlying acoustics, while the second aims at discovering words as repeated patterns of subword units. The hierarchical approach presented here accounts for classification errors in the first stage by modelling the pronunciation of a word in terms of subword units… CONTINUE READING
Highly Cited
This paper has 30 citations. REVIEW CITATIONS

From This Paper

Results and topics from this paper.

Key Quantitative Results

  • This improved initialization, using only weak supervision, has led to a 40% reduction in word error rate on a digit recognition task.

Citations

Publications citing this paper.
Showing 1-10 of 21 extracted citations

Unsupervised Word Segmentation and Lexicon Discovery Using Acoustic Word Embeddings

IEEE/ACM Transactions on Audio, Speech, and Language Processing • 2016
View 6 Excerpts
Highly Influenced

An embedded segmental K-means model for unsupervised segmentation and clustering of speech

2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) • 2017
View 1 Excerpt

Unsupervised learning for spoken word production based on simultaneous word and phoneme discovery without transcribed data

2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) • 2017

References

Publications referenced by this paper.
Showing 1-10 of 13 references

A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition

2013 IEEE International Conference on Acoustics, Speech and Signal Processing • 2013
View 2 Excerpts

Weak top-down constraints for unsupervised acoustic model training

2013 IEEE International Conference on Acoustics, Speech and Signal Processing • 2013

Partial sequence matching using an Unbounded Dynamic Time Warping algorithm

2010 IEEE International Conference on Acoustics, Speech and Signal Processing • 2010
View 1 Excerpt

Similar Papers

Loading similar papers…