Application of sim-hash algorithm and big data analysis in spam email detection system

  title={Application of sim-hash algorithm and big data analysis in spam email detection system},
  author={Phuc-Tran Ho and Hee Sun Kim and Sung-Ryul Kim},
Currently, there are many effective techniques that are used for filtering spam emails. However, spammers have mostly identified the weakness of those methods in order to bypass current detection systems. In this paper, we propose a novel similarity-based method that implements the fingerprinting technique on parallel processing framework. Furthermore, meet-in-the-middle approach is used in our method to achieve a higher accuracy in the spam email detection system. Our experimental result… CONTINUE READING
Highly Cited
This paper has 26 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-8 of 8 extracted citations

Detecting spam and phishing mails using SVM and obfuscation URL detection algorithm

2017 International Conference on Inventive Systems and Control (ICISC) • 2017
View 2 Excerpts

Improve the Prediction Accuracy of Naïve Bayes Classifier with Association Rule Mining

2016 IEEE 2nd International Conference on Big Data Security on Cloud (BigDataSecurity), IEEE International Conference on High Performance and Smart Computing (HPSC), and IEEE International Conference on Intelligent Data and Security (IDS) • 2016
View 1 Excerpt

Spam filtering using Association Rules and Naïve Bayes Classifier

2015 IEEE International Conference on Progress in Informatics and Computing (PIC) • 2015
View 1 Excerpt

Web Service-Enabled Spam Filtering with Naïve Bayes Classification

2015 IEEE First International Conference on Big Data Computing Service and Applications • 2015
View 1 Excerpt