PUBCRAWL: protecting users and businesses from CRAWLers

  title={PUBCRAWL: protecting users and businesses from CRAWLers},
  author={Gr{\'e}goire Jacob and Engin Kirda and Christopher Kr{\"u}gel and Giovanni Vigna},
  booktitle={USENIX Security Symposium},
Web crawlers are automated tools that browse the web to retrieve and analyze information. Although crawlers are useful tools that help users to find content on the web, they may also be malicious. Unfortunately, unauthorized (malicious) crawlers are increasingly becoming a threat for service providers because they typically collect information that attackers can abuse for spamming, phishing, or targeted attacks. In particular, social networking sites are frequent targets of malicious crawling… CONTINUE READING
Highly Cited
This paper has 27 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 13 extracted citations

Profiling Users by Modeling Web Transactions

2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS) • 2017
View 5 Excerpts
Highly Influenced

Protecting web contents against persistent distributed crawlers

2017 IEEE International Conference on Communications (ICC) • 2017
View 6 Excerpts
Highly Influenced

Can web pages be classified using anonymized TCP/IP headers?

2015 IEEE Conference on Computer Communications (INFOCOM) • 2015

Detection of Internet robots using a Bayesian approach

2015 IEEE 2nd International Conference on Cybernetics (CYBCONF) • 2015
View 2 Excerpts


Publications referenced by this paper.
Showing 1-10 of 22 references

The CAPTCHA Project

L. von Ahn, M. Blum, N. Hopper, J. Langford
Technical report, • 2000
View 8 Excerpts
Highly Influenced

Crawler Detection: A Bayesian Approach

International Conference on Internet Surveillance and Protection (ICISP’06) • 2006
View 5 Excerpts
Highly Influenced

Discovery of Web Robot Sessions Based on their Navigational Patterns

Data Mining and Knowledge Discovery • 2002
View 5 Excerpts
Highly Influenced

The Failure of Noise-Based Non-continuous Audio Captchas

2011 IEEE Symposium on Security and Privacy • 2011
View 1 Excerpt

How I got sued by Facebook

P. Warden
http: // 2010/04/how-i-got-sued-by-facebook.html • 2010
View 1 Excerpt

Web robot detection techniques: overview and limitations

Data Mining and Knowledge Discovery • 2010
View 1 Excerpt

An Automatic Scheme to Categorize User Sessions in Modern HTTP Traffic

IEEE GLOBECOM 2008 - 2008 IEEE Global Telecommunications Conference • 2008
View 2 Excerpts

Discovering New Trends in Web Robot Traffic Through Functional Classification

2008 Seventh IEEE International Symposium on Network Computing and Applications • 2008
View 1 Excerpt

Ryanair wins German court victory in screen-scraping injunction

Pinsent Masons
http://www.theregister. victory/, • 2008
View 2 Excerpts

Similar Papers

Loading similar papers…