On Relevance, Probabilistic Indexing and Information Retrieval
@article{Maron1960OnRP, title={On Relevance, Probabilistic Indexing and Information Retrieval}, author={M. E. Maron and J. L. Kuhns}, journal={J. ACM}, year={1960}, volume={7}, pages={216-244} }
This paper reports on a novel technique for literature indexing and searching in a mechanized library system. The notion of relevance is taken as the key concept in the theory of information retrieval and a comparative concept of relevance is explicated in terms of the theory of probability. The resulting technique called “Probabilistic Indexing,” allows a computing machine, given a request for information, to make a statistical inference and derive a number (called the “relevance number”) for…
999 Citations
A Survey on Important Aspects of Information Retrieval
- Computer Science
- 2013
A comprehensive survey discussing not only the emergence and evolution of information retrieval but also include different information retrieval models and some important aspects such as document representation, similarity measure and query expansion.
Statistical Association Methods for Mechanized Documentation: Symposium Proceedings, Washington 1964
- Computer Science
- 2017
This volume contains 22 of the papers included in the program, the abstracts of 4 additional papers that were presented, and the text of the talk given by R. M. Hayes at the banquet.
A Review on Important Aspects of Information Retrieval
- Computer Science
- 2014
This paper presents a comprehensive study, which discusses not only emergence and evolution of information retrieval but also includes different information retrieval models and some important aspects such as document representation, similarity measure and query expansion.
Contextualized access to distributed and heterogeneous multimedia data sources. (Accès contextualisé aux sources de données multimédias distribuées et hétérogènes)
- Computer Science
- 2014
The main contribution of this thesis is the construction of a network of Content Based Image Retrieval systems that are able to extract and exploit the information about an input image’s semantic concept.
Unsupervised Modeling of Multiple Data Sources : A Latent Shared Subspace Approach
- Computer Science
- 2011
This paper aims to provide a history of web exceptionalism from 1989 to 2002, a period chosen in order to explore its roots as well as specific cases up to and including the year in which descriptions of “Web 2.0” began to circulate.
Um novo modelo de ordenação de documentos baseados em correlação entre termos
- Physics, Computer Science
- 2005
Uma nova abordagem para a ordenacao de documentos a partir do modelo de espaco vetorial prove uma forma simples, efetiva, eficiente e parametrizada para o processamento of consultas disjuntivas, conjuntivas e bastante efetivas e computacionalmente viavel para colecoes genericas.
Guest Editorial: Spaces, Logic, and Link Analysis in IR: Recent Advances From A Mathematical and Logical Perspective
- Computer ScienceInformation Retrieval
- 2005
A new unified probabilistic model
- Computer ScienceJ. Assoc. Inf. Sci. Technol.
- 2004
A new unified probabilistic model is introduced that does not require that data must be available for the particular document or query in question, but it can utilize such specific data if it is available and the expression of its probabilities is straightforward.
A probabilistic model of information and retrieval: development and status
- Computer Science
- 1998
The model from its foundations through its logical development to cover more aspects of retrieval data and a wider range of system functions is presented, and each step in the argument is matched by comparative retrieval tests, to provide a single coherent account of a major line of research.
A network approach to probabilistic information retrieval
- Computer ScienceTOIS
- 1995
How probabilistic information retrieval based on document components may be implemented as a feedforward (feedbackward) artificial neural network is shown and performance of feedback improves substantially over no feedback, and further gains are obtained when queries are expanded with terms from the feedback documents.