Integration of Collocation Statistics into the Probabilistic Retrieval Model

  title={Integration of Collocation Statistics into the Probabilistic Retrieval Model},
  author={Olga Vechtomova and S. D. T. Robertson},
The paper presents a method of combining corpus information on word collocations with the probabilistic model of information retrieval. Corpus term dependencies are used to modify the probabilistic retrieval based on the term independence assumption. Collocates are derived from windows around term occurrences in the corpus. Statistical measures of mutual information and Z score are applied to select significantly associated collocates which are later used in query expansion. The results of the… CONTINUE READING
8 Citations
15 References
Similar Papers


Publications referenced by this paper.
Showing 1-10 of 15 references

A Probabilistic Model of Information Retrieval: Development and Status

  • K. Sparck Jones, S. Walker, S. Robertson
  • University of Cambridge Computer Laboratory…
  • 1998
Highly Influential
7 Excerpts

Overview of the Okapi Projects

  • S. Robertson
  • Journal of Documentation,
  • 1997
1 Excerpt

A Method for Refining AutomaticallyDiscovered Lexical Relations : Combining Weak Techniques for Stronger Results

  • M. Hearst, G. Grefenstette
  • 1992

Similar Papers

Loading similar papers…