Interpreting TF-IDF term weights as making relevance decisions

@article{Wu2008InterpretingTT,
  title={Interpreting TF-IDF term weights as making relevance decisions},
  author={H. C. Wu and R. Luk and K. Wong and K. Kwok},
  journal={ACM Trans. Inf. Syst.},
  year={2008},
  volume={26},
  pages={13:1-13:37}
}
  • H. C. Wu, R. Luk, +1 author K. Kwok
  • Published 2008
  • Computer Science
  • ACM Trans. Inf. Syst.
  • A novel probabilistic retrieval model is presented. [...] Key Method Our novel retrieval model is simplified to a basic ranking formula that directly corresponds to the TF-IDF term weights. In general, we show that the term-frequency factor of the ranking formula can be rendered into different term-frequency factors of existing retrieval systems. In the basic ranking formula, the remaining quantity - log p(&rmacr;|t ∈ d) is interpreted as the probability of randomly picking a nonrelevant usage (denoted by…Expand Abstract
    558 Citations
    Evaluating a Novel Kind of Retrieval Models Based on Relevance Decision Making in a Relevance Feedback Environment
    New document-context term weights and clustering for information retrieval
    A context‐dependent relevance model
    • 5
    A retrospective study of a hybrid document-context based retrieval model
    • 25
    • PDF
    Investigating Passage-level Relevance and Its Role in Document-level Relevance Judgment
    • 10
    • PDF
    Semantic Sensitive TF-IDF to Determine Word Relevance in Documents
    • 2
    • PDF
    A split-list approach for relevance feedback in information retrieval
    • 3
    A Match-Transformer Framework for Modeling Diverse Relevance Patterns in Ad-hoc Retrieval
    Analysis of TF-IDF Model and its Variant for Document Retrieval
    • 22

    References

    SHOWING 1-4 OF 4 REFERENCES
    The probability ranking principle in IR
    • 967
    • Highly Influential
    Semantic Information Retrieval
    • 19
    • Highly Influential
    Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters
    • C. Zahn
    • Computer Science
    • IEEE Transactions on Computers
    • 1971
    • 1,720
    • Highly Influential
    • PDF
    Modern Information Retrieval
    • 1999