Focused Crawls, Tunneling, and Digital Libraries

  title={Focused Crawls, Tunneling, and Digital Libraries},
  author={Donna Bergmark and Carl Lagoze and Alex Sbityakov},
Crawling the Web to build collections of documents related to pre-specified topics became an active area of research during the late 1990’s, crawler technology having been developed for use by search engines. Now, Web crawling is being seriously considered as an important strategy for building large scale digital libraries. This paper covers some of the crawl technologies that might be exploited for collection building. For example, to make such collection-building crawls more effective… CONTINUE READING
Highly Cited
This paper has 203 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 72 extracted citations

Crawling Credible Online Medical Sentiments for Social Intelligence

2013 International Conference on Social Computing • 2013
View 5 Excerpts
Method Support
Highly Influenced

Topic-specific Web Searching based on a Real-text Dictionary

WEBIST • 2012
View 7 Excerpts
Method Support
Highly Influenced

Building domain-specific Web collections for scientific digital libraries: a meta-search enhanced focused crawling method

Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004. • 2004
View 7 Excerpts
Highly Influenced

Relevant Sources of Information Are Not Necessarily Popular Ones

2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) • 2014
View 5 Excerpts
Highly Influenced

Towards extracting event-centric collections from Web archives

International Journal on Digital Libraries • 2018
View 1 Excerpt

203 Citations

Citations per Year
Semantic Scholar estimates that this publication has 203 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 26 references

A simple, prima facie argument in favor of the semantic web

B. Parsia
MonkyFist (April • 2002
View 1 Excerpt

A support vector method for learning ranking functions in information retrieval, March 8 2002. Cornell University Colloqium

T. Joachimes
View 1 Excerpt

Collection synthesis

JCDL • 2002
View 2 Excerpts

Finding Out About

R. K. Belew
View 1 Excerpt