Representing document as dependency graph for document clustering

  title={Representing document as dependency graph for document clustering},
  author={Yujing Wang and Xiaochuan Ni and Jian-Tao Sun and Yunhai Tong and Zheng Chen},
In traditional clustering methods, a document is often represented as "bag of words" (in BOW model) or n-grams (in suffix tree document model) without considering the natural language relationships between the words. In this paper, we propose a novel approach DGDC (Dependency Graph-based Document Clustering algorithm) to address this issue. In our algorithm, each document is represented as a dependency graph where the nodes correspond to words which can be seen as meta-descriptions of the… CONTINUE READING


Publications referenced by this paper.

Similar Papers

Loading similar papers…