Focused Crawling Using Context Graphs

@inproceedings{Diligenti2000FocusedCU,
  title={Focused Crawling Using Context Graphs},
  author={Michelangelo Diligenti and Frans Coetzee and Steve Lawrence and C. Lee Giles and Marco Gori},
  booktitle={VLDB},
  year={2000}
}
Maintaining currency of search engine indices by exhaustive crawling is rapidly becoming impossible due to the increasing size and dynamic content of the web. Focused crawlers aim to search only the subset of the web related to a specific category, and offer a potential solution to the currency problem. The major problem in focused crawling is performing appropriate credit assignment to different documents along a crawl path, such that short-term gains are not pursued at the expense of less… CONTINUE READING
Highly Influential
This paper has highly influenced 43 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 639 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 393 extracted citations

Empirical evaluation of the link and content-based focused Treasure-Crawler

Computer Standards & Interfaces • 2016
View 7 Excerpts
Highly Influenced

Focused Crawling : algorithm survey and new approaches with a manual analysis

Ignacio Garćıa Dorado
2008
View 5 Excerpts
Highly Influenced

xCrawl: a high-recall crawling method for Web mining

Knowledge and Information Systems • 2008
View 8 Excerpts
Highly Influenced

639 Citations

0204060'00'04'09'14'19
Citations per Year
Semantic Scholar estimates that this publication has 639 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-10 of 16 references

Similar Papers

Loading similar papers…