Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction

@inproceedings{Chakrabarti2001IntegratingTD,
  title={Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction},
  author={Soumen Chakrabarti},
  booktitle={WWW},
  year={2001}
}
Topic distillation is the process of finding authoritative Web pages and comprehensive “hubs” which reciprocally endorse each other and are relevant to a given query. Hyperlinkbased topic distillation has been traditionally applied to a macroscopic Web model where documents are nodes in a directed graph and hyperlinks are edges. Macroscopic models miss valuable clues such as banners, navigation panels, and template-based inclusions, which are embedded in HTML pages using markup tags… CONTINUE READING
Highly Cited
This paper has 182 citations. REVIEW CITATIONS
105 Extracted Citations
1 Extracted References
Similar Papers

Citing Papers

Publications influenced by this paper.

182 Citations

0102030'99'02'06'10'14'18
Citations per Year
Semantic Scholar estimates that this publication has 182 citations based on the available data.

See our FAQ for additional information.

Referenced Papers

Publications referenced by this paper.

Similar Papers

Loading similar papers…