Enhancing text clustering by leveraging Wikipedia semantics

@inproceedings{Hu2008EnhancingTC,
  title={Enhancing text clustering by leveraging Wikipedia semantics},
  author={Jian Hu and Lujun Fang and Yang Cao and Hua-Jun Zeng and Hua Li and Qiang Yang and Zheng Chen},
  booktitle={SIGIR},
  year={2008}
}
Most traditional text clustering methods are based on "bag of words" (BOW) representation based on frequency statistics in a set of documents. BOW, however, ignores the important information on the semantic relationships between key terms. To overcome this problem, several methods have been proposed to enrich text representation with external resource in the past, such as WordNet. However, many of these approaches suffer from some limitations: 1) WordNet has limited coverage and has a lack of… CONTINUE READING
Highly Influential
This paper has highly influenced a number of papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 224 citations. REVIEW CITATIONS

6 Figures & Tables

Topics

Statistics

0204020082009201020112012201320142015201620172018
Citations per Year

224 Citations

Semantic Scholar estimates that this publication has 224 citations based on the available data.

See our FAQ for additional information.