Spam Filtering using K mean Clustering with Local Feature Selection Classifier

  title={Spam Filtering using K mean Clustering with Local Feature Selection Classifier},
  author={Anand Sharma and Vedant Rastogi},
  journal={International Journal of Computer Applications},
this paper, we present a comprehensive review of recent developments in the application of machine learning algorithms to Spam filtering, focusing on textual approaches. We are trying to introduce various spam filtering methods from Naive Bias to Hybrid methods for spam filtering, we are also introducing types of filters recently used for spam filtering along with architecture of spam filter and its types .In this paper we are proposing a technique using Local feature classification methods… 

Tables from this paper

Supervised Machine Learning Classifier for Email Spam Filtering
An SVM handling separation of nonlinear data using a Kernel function, which is an advanced machine learning technique in R to improve the accuracy of the model and present a generic template for a working of kernel function in SVM that can be built in R.
Machine Learning Techniques for Spam Detection in Email and IoT Platforms: Analysis and Research Challenges
The machine learning techniques used for spam filtering techniques used in email and IoT platforms are surveyed by classifying them into suitable categories and a comprehensive comparison of these techniques is made based on accuracy, precision, recall, etc.
Comparative Analysis Review of Pioneering DBSCAN and Successive Density-Based Clustering Algorithms
The implementation, features, strengths, and drawbacks of the DBSCAN are thoroughly examined, and the successive algorithms proposed to provide improvement on the original DBS CAN are classified based on their motivations and are discussed.
Detecting Intrusion with High Accuracy: Using Hybrid K-Multi Layer Perceptron
  • A. Dogra, Taqdir
  • Computer Science
    International Journal of Recent Technology and Engineering
  • 2019
The proposed mechanism of Self organizing KMLP technique to discover abnormal patterns from the dataset to find abnormality from the state driven dataset shows significant improvement by the margin of 10%.
A Web Extraction Browsing Scheme for Time-Critical Specific URLs Fetching
This paper works towards creating an efficient search engine that removes stop words, extract meaningful words and form clusters of highest frequency words and final stage present the result in terms of URLs.
Web URLs retrieval with least execution time using MPV clustering approach
A novel regulated most probable learning component utilized for exhibitions appeared regarding execution time, data extraction, number of URLs got in web search, and the consequences of the proposed framework are contrasted and approved.


A Local-Concentration-Based Feature Extraction Approach for Spam Filtering
  • Y. Zhu, Y. Tan
  • Computer Science
    IEEE Transactions on Information Forensics and Security
  • 2011
It is shown that the LC approach cooperates well with three term selection methods, which endows it with flexible applicability in the real world and has better performance in terms of both accuracy and F1 measure.
Applications of Support Vector Machine Based on Boolean Kernel to Spam Filtering
The experiment results show that this spam filtering arithmetic of support vector machine based on Boolean kernel has high separating accuracy, high recall ratio and precision ratio and has its value in theory and application.
Spam Classification using new kernel function in Support Vector Machine
A new kernel function is developed called Cauchy kernel, and performance is evaluated over ECML-PKDD dataset and compared with the predefined kernel functions, and shows that the results are better than predefinedkernel functions.
The main idea behind this project is to design a spam detection system that will be enabled to analyze the content of emails, in particular the artificially generated image sent as attachment in an email.
Survey on Spam Filtering Techniques
The overview of existing e-mail spam filtering methods is given, the classification, evaluation, and comparison of traditional and learning-based methods are provided and the statement for new approach in spam filtering technique is considered.
Learning to Filter Unsolicited Commercial E-Mail
The architecture of a fully implemented learning-based anti-spam filter is described, and an analysis of its behavior in real use over a period of seven months is presented.
AISEC: an artificial immune system for e-mail classification
An immune-inspired algorithm called AISEC is presented that is capable of continuously classifying electronic mail as interesting and non-interesting without the need for re-training and has a great potential for augmentation.
Fault diagnosis of plant systems using immune networks
  • A. Ishiguro, Y. Watanabe, Y. Uchikawa
  • Computer Science
    Proceedings of 1994 IEEE International Conference on MFI '94. Multisensor Fusion and Integration for Intelligent Systems
  • 1994
This study tries to apply immune networks, which work as online fault diagnosis systems by constructing large-scale networks, called immune networks (idiotypic networks), to a fault diagnosis of plant systems.