Cross-Language Information Retrieval by Domain Restriction Using Web Directory Structure

Abstract

In this paper, we propose a cross-language information retrieval (CLIR) method based on estimating for domains of the query using hierarchic structures of Web directories. To get the most appropriate translation of the queries, we utilize the Web directories written in many different languages as multilingual corpus for disambiguating translation of the query and for estimating the domain of search results using hierarchic structures of Web directories. From experimental evaluations, we found that there is an advantage in retrieval accuracy using our proposal for disambiguating translation in CLIR system. We found that it is effective to restrict to target fields of the query using lower level merged categories in order to acquire suited translation of the query.

DOI: 10.1109/HICSS.2008.108

Extracted Key Phrases

9 Figures and Tables

Cite this paper

@inproceedings{Kimura2008CrossLanguageIR, title={Cross-Language Information Retrieval by Domain Restriction Using Web Directory Structure}, author={Fuminori Kimura and Akira Maeda and Kenji Hatano and Jun Miyazaki and Shunsuke Uemura}, booktitle={HICSS}, year={2008} }