Effect of term distributions on centroid-based text categorization


Most of traditional text categorization approaches utilize term frequency (tf) and inverse document frequency (idf) for representing importance of words and/or terms in classifying a text document. This paper describes an approach to apply term distributions, in addition to tf and idf, to improve performance of centroid-based text categorization. Three… (More)
DOI: 10.1016/j.ins.2003.07.007


8 Figures and Tables


Citations per Year

55 Citations

Semantic Scholar estimates that this publication has 55 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Lertnattee2004EffectOT, title={Effect of term distributions on centroid-based text categorization}, author={Verayuth Lertnattee and Thanaruk Theeramunkong}, journal={Inf. Sci.}, year={2004}, volume={158}, pages={89-115} }