Web Spam Detection via Commercial Intent Analysis

  title={Web Spam Detection via Commercial Intent Analysis},
  author={Andr{\'a}s A. Bencz{\'u}r and Istv{\'a}n B{\'i}r{\'o} and K{\'a}roly Csalog{\'a}ny and Tam{\'a}s Sarl{\'o}s},
We propose a number of features for Web spam filtering based on the occurrence of keywords that are either of high advertisement value or highly spammed. Our features include popular words from search engine query logs as well as high cost or volume words according to Google AdWords. We also demonstrate the spam filtering power of the Online Commercial Intention (OCI) value assigned to an URL in a Microsoft adCenter Labs Demonstration and the Yahoo! Mindset classification of Web pages as either… CONTINUE READING

From This Paper

Figures, tables, results, connections, and topics extracted from this paper.
17 Extracted Citations
0 Extracted References
Similar Papers