• Corpus ID: 2904992

Using WordNet in a Knowledge-Based Approach to Information Retrieval

  title={Using WordNet in a Knowledge-Based Approach to Information Retrieval},
  author={Raymond Richardson and Alan F. Smeaton},
The application of natural language processing tools and techniques to information retrieval tasks has long since been identified as potentially useful for the quality of information retrieval. Traditionally, IR has been based on matching words or terms in a query with words or terms in a document. In this paper we introduce an approach to IR based on computing a semantic distance measurement between concepts or words and using this word distance to compute a similarity between a query and a… 

Figures and Tables from this paper

Experiments on using semantic distances between words in image caption retrieval
This paper describes the first successful application of semantically-based word-word similarities, work described elsewhere, to an information retrieval application, specifically the indexing and retrieval of captions describing the content of images.
Concept Based Query Expansion Using WordNet
This paper proposed a new query expansion technique using the comprehensive thesaurus WordNet and its semantic relatedness measure modules, demonstrating a 7% precision improvement over retrieval methods not employing query expansion techniques.
Improving Information Retrieval Performance by Combining Different Text-Mining Techniques
A method of making WordNet more useful in information retrieval applications by combining it with other knowledge resources is proposed and a simple word sense disambiguation is performed to avoid misleading expansion terms.
Combining General Hand-Made and Automatically Constructed Thesauri for Query Expansion in Information Retrieval
This paper investigates why the use of WordNet in information retrieval has not been successful and proposes a method to combine WordNet with predicateargument-based and co-occurrence-based automatically constructed thesauri.
An effective approach to document retrieval via utilizing WordNet and recognizing phrases
This work utilizes WordNet to disambiguate word senses of query terms and shows that its approach yields between 23% and 31% improvements over the best-known results on the TREC 9, 10 and 12 collections for short (title only) queries, without using Web data.
Textual Document Indexing and Retrieval via Knowledge Sources and Data Mining
A knowledge-based query expansion technique to improve document retrieval effectiveness and a new phrase-based indexing technique that combines concepts with word stems are developed.
Exploiting Thesauri and Hierarchical Categories in Cross-Language Information Retrieval
This paper proposes a model, which uses multiple sources for query reformulation and expansion to select expansion terms and retrieve information needed by a user, and evaluated the effectiveness of the proposed combined method using an application of a French-English information retrieval.
Cross-Language Information Retrieval Using Multiple Resources and Combinations for Query Expansion
A model using multiple sources for query reformulation and expansion to select expansion terms and retrieve information needed by a user is proposed, and a new feedback strategy based on the extraction of domain keywords to expand user's query is introduced and evaluated.
Exploiting salient semantic analysis for information retrieval
This paper studies how to efficiently use SSA to improve the information retrieval performance, and proposes a SSA-based retrieval method under the language model framework that consistently outperform the existing Wikipedia-based retrieved methods.
Semantic similarity methods in wordNet and their application to information retrieval on the web
A novel information retrieval method is proposed that is capable of detecting similarities between documents containing semantically similar but not necessarily lexicographically similar terms.


A semantic-based approach to information processing
Results indicate firstly that the proposed semantic information processing system is capable of being used as a controlled vocabulary and secondly that the approaches to estimating semantic similarity operate well at their intended concept level.
Query expansion using lexical-semantic relations
Examination of the utility of lexical query expansion in the large, diverse TREC collection shows this query expansion technique makes little difference in retrieval effectiveness if the original queries are relatively complete descriptions of the information being sought even when the concepts to be expanded are selected by hand.
Information Retrieval Based on Conceptual Distance in is-a Hierarchies
There have been several document ranking methods to calculate the conceptual distance or closeness between a Boolean query and a document. Though they provide good retrieval effectiveness in many
Word sense disambiguation for free-text indexing using a massive semantic network
This work investigates using the massive Word Net semantic network for disambiguation during document indexing to improve precision and improvement in disamblguation compared with chance.
A Model of Knowledge Based Information Retrieval with Hierarchical Concept Graph
The proposed model computes the conceptual distance between a query and an object and both are indexed with weighted terms from a hierarchical thesaurus by allowing the index term and the edge of the HCG to be weighted.
Word sense disambiguation and information retrieval
From these results it has become clear that more basic research is needed to investigate the relationship between sense ambiguity, disambiguation, and IR.
A unified approach to automatic indexing and information retrieval
WorldViews, an experimental system that unites as many aspects of information organization as possible around a simple, familiar, yet versatile knowledge representation framework, is discussed. This
Introduction to WordNet: An On-line Lexical Database
Standard alphabetical procedures for organizing lexical information put together words that are spelled alike and scatter words with similar or related meanings haphazardly through the list.
Selection and information: a class-based approach to lexical relationships
A new, information-theoretic account of selectional constraints is proposed, which assumes that lexical items are organized in a conceptual taxonomy according to class membership, where classes are defined simply as sets rather than in terms of explicit features or properties.
Development and application of a metric on semantic nets
Experiments in which distance is applied to pairs of concepts and to sets of concepts in a hierarchical knowledge base show the power of hierarchical relations in representing information about the conceptual distance between concepts.