Large-Scale Automatic Classification of Phishing Pages

  title={Large-Scale Automatic Classification of Phishing Pages},
  author={Colin Whittaker and Brian Ryner and Marria Nazif},
Phishing websites, fraudulent sites that impersonate a trusted third party to gain access to private data, continue to cost Internet users over a billion dollars each year. In this paper, we describe the design and performance characteristics of a scalable machine learning classifier we developed to detect phishing websites. We use this classifier to maintain Google’s phishing blacklist automatically. Our classifier analyzes millions of pages a day, examining the URL and the contents of a page… CONTINUE READING
Highly Influential
This paper has highly influenced 25 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 253 citations. REVIEW CITATIONS
162 Citations
28 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 162 extracted citations

253 Citations

Citations per Year
Semantic Scholar estimates that this publication has 253 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 28 references

An Empirical Analysis of Phishing Blacklists

  • S. Sheng, B. Wardman, G. Warner, L. F. Cranor, J. Hong, C. Zhang
  • CEAS
  • 2009
Highly Influential
4 Excerpts

Large scale machine learning and methods

  • J. Bem, G. Harik, J. Levenberg, N. Shazeer, S. Tong
  • US Patent 7222127
  • 2007
Highly Influential
4 Excerpts

Similar Papers

Loading similar papers…