Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search

@inproceedings{Yuan2018LearningAW,
  title={Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search},
  author={Yougen Yuan and Cheung-Chi Leung and Lei Xie and Hongjie Chen and Bin Ma and Haizhou Li},
  booktitle={Interspeech},
  year={2018}
}
We propose to learn acoustic word embeddings with temporal context for query-by-example (QbE) speech search. The temporal context includes the leading and trailing word sequences of a word. We assume that there exist spoken word pairs in the training database. We pad the word pairs with their original temporal context to form fixed-length speech segment pairs. We obtain the acoustic word embeddings through a deep convolutional neural network (CNN) which is trained on the speech segment pairs… CONTINUE READING

References

Publications referenced by this paper.
Showing 1-10 of 25 references

Similar Papers

Loading similar papers…