Improving Term Frequency Normalization for Multi-topical Documents, and Application to Language Modeling Approaches

@inproceedings{Na2008ImprovingTF,
  title={Improving Term Frequency Normalization for Multi-topical Documents, and Application to Language Modeling Approaches},
  author={Seung-Hoon Na and In-Su Kang and Jong-Hyeok Lee},
  booktitle={ECIR},
  year={2008}
}
Term frequency normalization is a serious issue since lengt hs of documents are various. Generally, documents become long due to two different reasons verbosity and multi-topicality. First, verbosity me ans that the same topic is repeatedly mentioned by terms related to the topic, so that t erm frequency is more increased than the well-summarized one. Second, multi-top icality indicates that a document has a broad discussion of multi-topics, rather th an single topic. Although these document… CONTINUE READING
Highly Cited
This paper has 22 citations. REVIEW CITATIONS
Related Discussions
This paper has been referenced on Twitter 3 times. VIEW TWEETS

From This Paper

Figures, tables, and topics from this paper.

Similar Papers

Loading similar papers…