Email-Spam Filtering with the Hashing-Trick

  title={Email-Spam Filtering with the Hashing-Trick},
  author={Joshua Attenberg and Kilian Q. Weinberger and Anirban Dasgupta and Alex Smola},
This paper delves into a recently proposed technique for collaborative spam filtering [7] that facilitates personalization with finite-sized memory guarantees. In large scale open membership email systems most users do not label enough messages for an individual local classifier to be effective, while the data is too noisy to be used for a global filter across all users. Our hybrid global/individual classifier is particularly effective at absorbing the influence of users who label emails very… CONTINUE READING
Highly Cited
This paper has 30 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 22 extracted citations

Malware classification using static analysis based features

2017 IEEE Symposium Series on Computational Intelligence (SSCI) • 2017
View 1 Excerpt

Robust Online Multi-Task Learning with Correlative and Personalized Structures

IEEE Transactions on Knowledge and Data Engineering • 2017
View 1 Excerpt

Using big data to enhance the bosch production line performance: A Kaggle challenge

2016 IEEE International Conference on Big Data (Big Data) • 2016
View 2 Excerpts

Analysis and detection of low quality information in social networks

2014 IEEE 30th International Conference on Data Engineering Workshops • 2014
View 1 Excerpt


Publications referenced by this paper.
Showing 1-9 of 9 references

Vowpal wabbit online learning project

J. Langford, L. Li, A. Strehl
Technical report, • 2007
View 3 Excerpts
Highly Influenced

Algorithm alley

R. Jenkins
Dr. Dobb’s Journal, September • 1997
View 3 Excerpts
Highly Influenced

Sparse binary polynomial hashing and the CRM114 Discriminator

W. S. Yerazunis
In MIT Spam Conference, • 2003
View 3 Excerpts

Vowpal wabbit online learning project Signal detection theory : Valuable tools for evaluating inductive learning

L. Li J. Langford, A. Strehl
The elements of statistical learning • 2001

Similar Papers

Loading similar papers…