A case for unsupervised-learning-based spam filtering

  title={A case for unsupervised-learning-based spam filtering},
  author={Feng Qian and Abhinav Pathak and Y. Charlie Hu and Zhuoqing Morley Mao and Yinglian Xie},
Traditional content-based spam filtering systems rely on supervised machine learning techniques. In the training phase, labeled email instances are used to build a learning model (e.g., a Naive Bayes classifier or support vector machine), which is then applied to future incoming emails in the detection phase. However, the critical reliance on the training data becomes one of the major limitations of supervised spam filters. Preparing labeled training data is often labor-intensive and can delay… CONTINUE READING
Highly Cited
This paper has 20 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 11 extracted citations


Publications referenced by this paper.
Showing 1-2 of 2 references

An introduction to latent semantic analysis

  • Thomas K Landauer, Peter W. Foltz, Darrell Laham
  • Discourse Processes
  • 1998
Highly Influential
6 Excerpts

Similar Papers

Loading similar papers…