Improving Web clustering by cluster selection

  title={Improving Web clustering by cluster selection},
  author={Daniel Crabtree and Xiaoying Gao and Peter Andreae},
  journal={The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)},
Web page clustering is a technology that puts semantically related web pages into groups and is useful for categorizing, organizing, and refining search results. When clustering using only textual information, Suffix Tree Clustering (STC) outperforms other clustering algorithms by making use of phrases and allowing clusters to overlap. One problem of STC and other similar algorithms is how to select a small set of clusters to display to the user from a very large set of generated clusters. The… CONTINUE READING
Highly Cited
This paper has 67 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 42 extracted citations

68 Citations

Citations per Year
Semantic Scholar estimates that this publication has 68 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.

Similar Papers

Loading similar papers…