Blocking Blog Spam with Language Model Disagreement

Abstract

We present an approach for detecting link spam common in blog comments by comparing the language models used in the blog post, the comment, and pages linked by the comments. In contrast to other link spam filtering approaches, our method requires no training, no hard-coded rule sets, and no knowledge of complete-web connectivity. Preliminary experiments… (More)

3 Figures and Tables

Topics

Statistics

0102030'05'06'07'08'09'10'11'12'13'14'15'16'17'18
Citations per Year

278 Citations

Semantic Scholar estimates that this publication has 278 citations based on the available data.

See our FAQ for additional information.

  • Presentations referencing similar topics