YLAB@RU at Spoken Term Detection Task in NTCIR-10 SpokenDoc-2

Abstract

This paper describes improvement of the STD method which is based on the vector quantization (VQ). Spoken documents are represented as sequences of VQ codes, and they are matched with a text query to be detected based on the V-P score which measures the relationship between a VQ code and a phoneme. The matching score between VQ codes and phonemes is calculated after normalization for each phoneme in a query term to avoid biased scoring particular phonemes. 1. Objectives -To improve the detection performance of unknown words for the spoken term detection (STD). 2. STD Method Based on Vector Quantization To represent spoken documents as sequences of VQ codes . > Conventional methods represent spoken documents as sequences of sub-words, such as phonemes, to detect unknown words.

Extracted Key Phrases

1 Figure or Table

Cite this paper

@inproceedings{Sakamoto2013YLABRUAS, title={YLAB@RU at Spoken Term Detection Task in NTCIR-10 SpokenDoc-2}, author={Iori Sakamoto and Kook Cho and Masanori Morise and Yoichi Yamashita}, booktitle={NTCIR}, year={2013} }