An Investigation of Dirichlet Prior Smoothing ’ s Performance Advantage

@inproceedings{Smucker2005AnIO,
  title={An Investigation of Dirichlet Prior Smoothing ’ s Performance Advantage},
  author={Mark D. Smucker and James Allan},
  year={2005}
}
In the language modeling approach to information retrieval, Dirichlet prior smoothing frequently outperforms fixed linear interpolated (aka Jelinek-Mercer) smoothing. The only difference between Dirichlet prior and fixed linear interpolated smoothing is that Dirichlet prior determines the amount of smoothing based on a document’s length. Our hypothesis was that Dirichlet prior smoothing has an implicit document prior that favors longer documents. We tested our hypothesis by first calculating a… CONTINUE READING
Highly Cited
This paper has 37 citations. REVIEW CITATIONS

Similar Papers

Loading similar papers…