Spam filtering for short messages

  title={Spam filtering for short messages},
  author={Gordon V. Cormack and Jos{\'e} Mar{\'i}a G{\'o}mez Hidalgo and Enrique Puertas Sanz},
We consider the problem of content-based spam filtering for short text messages that arise in three contexts: mobile (SMS) communication, blog comments, and email summary information such as might be displayed by a low-bandwidth client. Short messages often consist of only a few words, and therefore present a challenge to traditional bag-of-words based spam filters. Using three corpora of short messages and message fields derived from real SMS, blog, and spam messages, we evaluate feature-based… CONTINUE READING
Highly Cited
This paper has 260 citations. REVIEW CITATIONS

3 Figures & Tables



Citations per Year

261 Citations

Semantic Scholar estimates that this publication has 261 citations based on the available data.

See our FAQ for additional information.