• Corpus ID: 16218354

Information Filtering and Retrieval : An Overview

  title={Information Filtering and Retrieval : An Overview},
  author={Colm O'Riordan and Humphrey Sorensen},
The areas of information retrieval(IR) and information filtering(IF)have become very active research domains. The problems created by the large increase of available online information, of which the vast majority is largely unstructured, have accentuated the need for effective mechanisms to separate the relevant information from the irrelevant. This paper reviews the main approaches and systems used in IR and in the newer field of IF. The paper also includes an overview of systems which utilise… 

Figures from this paper

A Tutorial on Information Filtering Concepts and Methods for Bio-medical Searching
Information retrieval, and information filtering are two major information access techniques that help searchers efficiently find the information that they really need, and avoid the irrelevant information that does not match their interests.
An approach for the capture of context-dependent document relationships extracted from Bayesian analysis of users' interactions with information
The results indicate that the approach provides a useful method for the establishment of identifiable relationships between documents based on the context of their usage, rather than their content.
Chapter 1: Introduction Chapter 2: Text Mining 2.1.1 Vector Space Model 2.1.2 More Sophisticated Representations
  • Computer Science
This thesis deals with one area in text mining, namely text categorization, which is the process of automatically labeling unlabeled text documents by their corresponding categories based entirely on their content.
Text Classification Algorithms: A Survey
An overview of text classification algorithms is discussed, which covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods.
A user profile for information filtering using RFID-SIM card in pervasive network
  • A. Osman, T. Mantoro
  • Computer Science
    2011 International Conference on Multimedia Computing and Systems
  • 2011
A pervasive network based information filtering system that integrates user profile such as identity, preference and other important data that is embarked in a RFID-SIM card in order to guarantee its privacy, flexibility, mobility and confidentiality is proposed.
Self-learning Semantic-distance-based Answering System with Automatic Morpheme Recognition
A new definition of the meaning of a sentence for a human is offered and a simple technique for searching for sentences having meanings close to the mean of a given sentence is presented.
L'expression du problème dans la recherche d'informations : application à un contexte d'intermédiation territoriale
L'Intelligence Territoriale est un concept recemment apparu en France. Nous l'avons identifie comme la conjugaison d'actions d'Intelligence Economique et de Knowledge Management appliquees a un
Product Recommendation System
This document summarizes current capabilities, research and operational priorities, and plans for further studies that were established at the 2015 USGS workshop on quantitative hazard assessments of earthquake-triggered landsliding and liquefaction in the Central American region.
Mineração de Textos: Detecção automática de sentimentos em comentários nas mídias sociais
Os avancos nas tecnicas de analise automatica de documentos possibilitaram o reconhecimento de aspectos subjetivos em textos extraidos de midias sociais. O objetivo deste trabalho foi implementar um


I3R: A new approach to the design of document retrieval systems
The system described in this article, 13R, provides a number of facilities and search strategies based on a detailed specification of the user’s information need and uses a novel architecture to allow more than one system facility to be used at a given stage of a search session.
I 3 R: a new approach to the design of document retrieval systems
A system that provides a number of FACILITIES and SEARCH STRATEGIES based on an EMPHASIS on domain knowledge used for refining the model of the information need, and the provision of a blowing mechanism that allows the user to NAVIGATE through the knowledge base.
The INQUERY Retrieval System
A retrieval system (INQUERY) that is based on a probabilistic retrieval model and provides support for sophisticated indexing and complex query formulation is described.
New methods for relevance feedback: improving information retrieval performance
It is shown how new computational models such as genetic algorithms and connection& processing can be used to further improve relevance feedback techniques and ultimately, retrieval performance.
Automatic Query Expansion Using SMART: TREC 3
This work continues the work in TREC 3, performing runs in the routing, ad-hoc, and foreign language environments, with a major focus on massive query expansion, adding from 300 to 530 terms to each query.
Evaluation of an inference network-based retrieval model
Network representations show promise as mechanisms for inferring probable relationships between documents and queries and have been used in information retrieval since at least the early 1960s.
An evaluation of retrieval effectiveness for a full-text document-retrieval system
An evaluation of a large, operational full-text document-retrieval system shows the system to be retrieving less than 20 percent of the documents relevant to a particular search.
The probability ranking principle in IR
It is shown that the principle that documents should be ranked in order of the probability of relevance or usefulness can be justified under certain assumptions, but that in cases where these assumptions do not hold, the principle is not valid.
SIFT - a Tool for Wide-Area Information Dissemination
SIFT's approach to user interest modeling and user-server communication is presented and an empirical study of SIFT's performance is presented, examining its main memory requirement and ability to scale with information volume and user population.
Document Retrieval and Routing Using the INQUERY System
In the TREC experiments this year, a number of new techniques were introduced for both the ad-hoc retrieval and routing runs, and experiments with Spanish retrieval were carried out.