Using DragPushing to Refine Concept Index for Text Categorization

  title={Using DragPushing to Refine Concept Index for Text Categorization},
  author={Xueqi Cheng and Songbo Tan and Lilian Tang},
  journal={Journal of Computer Science and Technology},
Concept index (CI) is a very fast and efficient feature extraction (FE) algorithm for text classification. The key approach in CI scheme is to express each document as a function of various concepts (centroids) present in the collection. However, the representative ability of centroids for categorizing corpus is often influenced by so-called model misfit caused by a number of factors in the FE process including feature selection to similarity measure. In order to address this issue, this work… CONTINUE READING