Skip to search formSkip to main contentSkip to account menu

Distributed web crawling

Known as: Distributed crawling, Distributed search, Distributed web crawler 
Distributed web crawling is a distributed computing technique whereby Internet search engines employ many computers to index the Internet via web… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2016
2016
Web crawlers are program, designed to fetch web pages for information retrieval system. Crawlers facilitate this process by… 
Review
2010
Review
2010
There are three core issues recognized for WAN-based distributed Web crawling systems: Web Partition, Agent collaboration and… 
Highly Cited
2009
Highly Cited
2009
The P2P-IPTV is an emerging class of Internet applications that is becoming very popular. The growing popularity of these rather… 
2008
2008
We identify the issues that are important in design of a geographically distributed Web crawler. The identified issues are… 
2004
2004
Distributed crawling has shown that it can overcome important limitations of the centralized crawling paradigm. However, the… 
2004
2004
Distributed crawling has shown that it can overcome important limitations of the today's crawling paradigm. However, the optimal… 
2003
2003
In business, the retrieval of up-to-date, or fresh, information is very important. It is difficult for conventional search… 
2002
2002
In web page retrievals, search engines are usually used. However, conventional search engines have a problem in that their update… 
2001
2001
A web crawling system employing a parallel and distributed architecture needs to have a mechanism to bring the whole system in a… 
1995
1995
We propose a distributed deadlock detection algorithm which detects cycles made of back edges in a distributed search tree. Our…