SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering

@article{Ahmed2009SISCAT,
  title={SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering},
  author={Mohammad Salim Ahmed and Latifur Khan},
  journal={2009 IEEE International Conference on Data Mining Workshops},
  year={2009},
  pages={1-6}
}
Text classification poses some specific challenges. One such challenge is its high dimensionality where each document (data point) contains only a small subset of them. In this paper, we propose Semi-supervised Impurity based Subspace Clustering (SISC) in conjunction with k-Nearest Neighbor approach, based on semi-supervised subspace clustering that considers the high dimensionality as well as the sparse nature of them in text data. SISC finds clusters in the subspaces of the high dimensional… CONTINUE READING