Spam filtering for short messages

@inproceedings{Cormack2007SpamFF,
  title={Spam filtering for short messages},
  author={Gordon V. Cormack and Jos{\'e} Mar{\'i}a G{\'o}mez Hidalgo and Enrique Puertas Sanz},
  booktitle={CIKM},
  year={2007}
}
We consider the problem of content-based spam filtering for short text messages that arise in three contexts: mobile (SMS) communication, blog comments, and email summary information such as might be displayed by a low-bandwidth client. Short messages often consist of only a few words, and therefore present a challenge to traditional bag-of-words based spam filters. Using three corpora of short messages and message fields derived from real SMS, blog, and spam messages, we evaluate feature-based… CONTINUE READING
Highly Cited
This paper has 260 citations. REVIEW CITATIONS

3 Figures & Tables

Topics

Statistics

020406020082009201020112012201320142015201620172018
Citations per Year

261 Citations

Semantic Scholar estimates that this publication has 261 citations based on the available data.

See our FAQ for additional information.