On Relevance, Probabilistic Indexing and Information Retrieval

@article{Maron1960OnRP,
  title={On Relevance, Probabilistic Indexing and Information Retrieval},
  author={M. E. Maron and J. L. Kuhns},
  journal={J. ACM},
  year={1960},
  volume={7},
  pages={216-244}
}
This paper reports on a novel technique for literature indexing and searching in a mechanized library system. The notion of relevance is taken as the key concept in the theory of information retrieval and a comparative concept of relevance is explicated in terms of the theory of probability. The resulting technique called “Probabilistic Indexing,” allows a computing machine, given a request for information, to make a statistical inference and derive a number (called the “relevance number”) for… 

Figures and Tables from this paper

A Survey on Important Aspects of Information Retrieval

A comprehensive survey discussing not only the emergence and evolution of information retrieval but also include different information retrieval models and some important aspects such as document representation, similarity measure and query expansion.

Statistical Association Methods for Mechanized Documentation: Symposium Proceedings, Washington 1964

This volume contains 22 of the papers included in the program, the abstracts of 4 additional papers that were presented, and the text of the talk given by R. M. Hayes at the banquet.

A Review on Important Aspects of Information Retrieval

This paper presents a comprehensive study, which discusses not only emergence and evolution of information retrieval but also includes different information retrieval models and some important aspects such as document representation, similarity measure and query expansion.

Contextualized access to distributed and heterogeneous multimedia data sources. (Accès contextualisé aux sources de données multimédias distribuées et hétérogènes)

The main contribution of this thesis is the construction of a network of Content Based Image Retrieval systems that are able to extract and exploit the information about an input image’s semantic concept.

Unsupervised Modeling of Multiple Data Sources : A Latent Shared Subspace Approach

This paper aims to provide a history of web exceptionalism from 1989 to 2002, a period chosen in order to explore its roots as well as specific cases up to and including the year in which descriptions of “Web 2.0” began to circulate.

Um novo modelo de ordenação de documentos baseados em correlação entre termos

Uma nova abordagem para a ordenacao de documentos a partir do modelo de espaco vetorial prove uma forma simples, efetiva, eficiente e parametrizada para o processamento of consultas disjuntivas, conjuntivas e bastante efetivas e computacionalmente viavel para colecoes genericas.

A new unified probabilistic model

A new unified probabilistic model is introduced that does not require that data must be available for the particular document or query in question, but it can utilize such specific data if it is available and the expression of its probabilities is straightforward.

A probabilistic model of information and retrieval: development and status

The model from its foundations through its logical development to cover more aspects of retrieval data and a wider range of system functions is presented, and each step in the argument is matched by comparative retrieval tests, to provide a single coherent account of a major line of research.

A network approach to probabilistic information retrieval

How probabilistic information retrieval based on document components may be implemented as a feedforward (feedbackward) artificial neural network is shown and performance of feedback improves substantially over no feedback, and further gains are obtained when queries are expanded with terms from the feedback documents.
...