Interpreting TF-IDF term weights as making relevance decisions

@article{Wu2008InterpretingTT,
title={Interpreting TF-IDF term weights as making relevance decisions},
author={H. C. Wu and R. Luk and K. Wong and K. Kwok},
journal={ACM Trans. Inf. Syst.},
year={2008},
volume={26},
pages={13:1-13:37}
}

A novel probabilistic retrieval model is presented. [...] Key Method Our novel retrieval model is simplified to a basic ranking formula that directly corresponds to the TF-IDF term weights. In general, we show that the term-frequency factor of the ranking formula can be rendered into different term-frequency factors of existing retrieval systems. In the basic ranking formula, the remaining quantity - log p(&rmacr;|t ∈ d) is interpreted as the probability of randomly picking a nonrelevant usage (denoted by…Expand Abstract