Enhanced Topic Distillation Using Text, Markup Tags, and Hyperlinks

  title={Enhanced Topic Distillation Using Text, Markup Tags, and Hyperlinks},
  author={Soumen Chakrabarti and Mukul Joshi and Vivek Tawde},
Topic distillation is the analysis of hyperlink graph structure to identify mutually reinforcing authorities (popular pages) and hubs (comprehensive lists of links to authorities). Topic distillation is becoming common in Web search engines, but the best-known algorithms model the Web graph at a coarse grain, with whole pages as single nodes. Such models may lose vital details in the markup tag structure of the pages, and thus lead to a tightly linked irrelevant subgraph winning over a… CONTINUE READING
Highly Cited
This paper has 164 citations. REVIEW CITATIONS

15 Figures & Tables



Citations per Year

165 Citations

Semantic Scholar estimates that this publication has 165 citations based on the available data.

See our FAQ for additional information.