Representing document as dependency graph for document clustering

@inproceedings{Wang2011RepresentingDA,
  title={Representing document as dependency graph for document clustering},
  author={Yujing Wang and Xiaochuan Ni and Jian-Tao Sun and Yunhai Tong and Zheng Chen},
  booktitle={CIKM},
  year={2011}
}
In traditional clustering methods, a document is often represented as "bag of words" (in BOW model) or n-grams (in suffix tree document model) without considering the natural language relationships between the words. In this paper, we propose a novel approach DGDC (Dependency Graph-based Document Clustering algorithm) to address this issue. In our algorithm, each document is represented as a dependency graph where the nodes correspond to words which can be seen as meta-descriptions of the… CONTINUE READING

References

Publications referenced by this paper.

Similar Papers

Loading similar papers…