Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 222,884,983 papers from all fields of science
Search
Sign In
Create Free Account
Web crawler
Known as:
Webcrawler
, Crawl site
, RBSE
Expand
A Web crawler is an Internet bot which systematically browses the World Wide Web, typically for the purpose of Web indexing (web spidering). Web…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
50 relations
Ajax (programming)
Apache Hadoop
Apache Nutch
Apache Storm
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2011
2011
Design and Implementation of Scalable, Fully Distributed Web Crawler for a Web Search Engine
M. S. Kumar
,
P. Neelima
2011
Corpus ID: 13014198
Web is a context in which traditional Information Retrieval methods are challenged. Given the volume of the Web and its speed of…
Expand
2009
2009
Implementation of Web Crawler
Pooja Gupta
,
Kalpana Johari
Second International Conference on Emerging…
2009
Corpus ID: 23083489
The World Wide Web is an interlinked collection of billions of documents formatted using HTML. Ironically the very size of this…
Expand
Highly Cited
2008
Highly Cited
2008
DistanceRank: An intelligent ranking algorithm for web pages
Ali Mohammad Zareh Bidoki
,
N. Yazdani
Information Processing & Management
2008
Corpus ID: 9725263
Highly Cited
2007
Highly Cited
2007
Mining templates from search result records of search engines
Hongkun Zhao
,
W. Meng
,
Clement T. Yu
Knowledge Discovery and Data Mining
2007
Corpus ID: 7856427
Metasearch engine, Comparison-shopping and Deep Web crawling applications need to extract search result records enwrapped in…
Expand
Highly Cited
2006
Highly Cited
2006
A Crawler-based Study of Spyware in the Web
Alexander Moshchuk
,
Tanya Bragin
,
S. Gribble
,
H. Levy
Network and Distributed System Security Symposium
2006
Corpus ID: 14839645
Malicious spyware poses a significant threat to desktop security and integrity. This paper examines that threat from an Internet…
Expand
Highly Cited
2006
Highly Cited
2006
Development of a biomimetic miniature robotic crawler
A. Menciassi
,
D. Accoto
,
S. Gorini
,
P. Dario
Auton. Robots
2006
Corpus ID: 27351903
The paper presents the development of segmented artificial crawlers endowed with passive hook-shaped frictional microstructures…
Expand
Highly Cited
2005
Highly Cited
2005
DR-NEGOTIATE - a system for automated agent negotiation with defeasible logic-based strategies
Thomas Skylogiannis
,
G. Antoniou
,
Nick Bassiliades
,
Guido Governatori
,
Antonis Bikakis
International Conference on E-Learning, E…
2005
Corpus ID: 1077980
Highly Cited
2004
Highly Cited
2004
The use of guidelines to automatically verify Web accessibility
J. Abascal
,
Myriam Arrue
,
I. Fajardo
,
Nestor Garay-Vitoria
,
Jorge Tomás
Universal Access in the Information Society
2004
Corpus ID: 16060853
Accessibility is one of the key challenges that the Internet must currently face to guarantee universal inclusion. Accessible Web…
Expand
Highly Cited
2003
Highly Cited
2003
Development of "Souryu I & II" -Connected Crawler Vehicle for Inspection of Narrow and Winding Space
T. Takayama
,
S. Hirose
J. Robotics Mechatronics
2003
Corpus ID: 30830639
Highly Cited
2001
Highly Cited
2001
MODELING PEER-TO-PEER NETWORK TOPOLOGIES THROUGH “ SMALL-WORLD ” MODELS AND POWER LAWS
M. Jovanovi
2001
Corpus ID: 15104260
I INTRODUCTION The recent emergence of novel network applications such as Gnutella, Freenet, and Napster has reincarnated the…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE