Automatic Morphological Query Expansion Using Analogy-Based Machine Learning


Information retrieval systems (IRSs) usually suffer from a low ability to recognize a same idea that is expressed in different forms. A way of improving these systems is to take into account morphological variants. We propose here a simple yet effective method to recognize these variants that are further used so as to enrich queries. In comparison with already published methods, our system does not need any external resources or a priori knowledge and thus supports many languages. This new approach is evaluated against several collections, 6 different languages and is compared to existing tools such as a stemmer and a lemmatizer. Reported results show a significant and systematic improvement of the whole IRS efficiency both in terms of precision and recall for every language.

DOI: 10.1007/978-3-540-71496-5_22

Extracted Key Phrases

4 Figures and Tables

Cite this paper

@inproceedings{Moreau2007AutomaticMQ, title={Automatic Morphological Query Expansion Using Analogy-Based Machine Learning}, author={Fabienne Moreau and Vincent Claveau and Pascale S{\'e}billot}, booktitle={ECIR}, year={2007} }