A Novel Approach for Spam Email Detection Based On Shifted Binary Patterns, Security and Communication Networks, DOI: 10.1002/sec.1412
[ X ]
Tarih
2016
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Erişim Hakkı
info:eu-repo/semantics/openAccess
Özet
Advances in communication allow people flexibility to communicate in various ways. Electronic mail (email) is one of the most used communication methods for personal or business purposes. However, it brings one of the most tackling issues, called spam email, which also raises concerns about data safety. Thus, the requirement of detecting spams is crucial for keeping the users safe and saving them from the waste of time while tackling those issues. In this study, an effective approach based on the probability of the usage of the characters that has similar orders with respect to their UTF-8 value by employing shifted one-dimensional local binary pattern (shifted-1D-LBP) was used to extract quantitative features from emails for spam email detection. Shifted-1D-LBP, which can be described as an ordered set of binary comparisons of the center value with its neighboring values, is a content-based approach to spam detection with low-level information. To validate the performance of the proposed approach, three benchmark corpora, Spamassasian, Ling-Spam, and TREC email corpuses, were used. The average classification accuracies of the proposed approach were 92.34%, 92.57%, and 95.15%, respectively. Analysis and promising experimental results indicated that the proposed approach was a very competitive feature extraction method in spam email filtering.