A Comparative Impact Study of Attribute Selection Techniques on Naïve Bayes Spam Filters

Abstract

The main problem of the Internet e-mail service is the massive spam message delivery. Everyday, hundreds of unwanted and unhelpful messages are received by Internet users flooding their mailboxes. Fortunately, nowadays there are different kinds of filters able to identify and automatically delete most of these messages. In order to reduce the problem dimensionality only representative attributes are selected from each e-mail using feature selection techniques. This work presents a comparison among five well-known feature selection strategies when they are applied in conjunction with four different types of Naïve Bayes classifiers. The results obtained from the experiments carried out show the relevance of choosing an appropriate feature selection technique in order to obtain accurate results.

DOI: 10.1007/978-3-540-70720-2_17

Extracted Key Phrases

3 Figures and Tables

Cite this paper

@inproceedings{Mndez2008ACI, title={A Comparative Impact Study of Attribute Selection Techniques on Na{\"{i}ve Bayes Spam Filters}, author={Jos{\'e} Ramon M{\'e}ndez and I. Cid and Daniel Glez-Pe{\~n}a and Miguel Rocha and Florentino Fern{\'a}ndez Riverola}, booktitle={ICDM}, year={2008} }