Effect of term distributions on centroid-based text categorization

Abstract

Most of traditional text categorization approaches utilize term frequency (tf) and inverse document frequency (idf) for representing importance of words and/or terms in classifying a text document. This paper describes an approach to apply term distributions, in addition to tf and idf, to improve performance of centroid-based text categorization. Three… (More)
DOI: 10.1016/j.ins.2003.07.007

Topics

8 Figures and Tables

Statistics

0510'05'06'07'08'09'10'11'12'13'14'15'16'17'18
Citations per Year

55 Citations

Semantic Scholar estimates that this publication has 55 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Lertnattee2004EffectOT, title={Effect of term distributions on centroid-based text categorization}, author={Verayuth Lertnattee and Thanaruk Theeramunkong}, journal={Inf. Sci.}, year={2004}, volume={158}, pages={89-115} }