Title: An ensemble design approach based on bagging technique for filtering e-mail spam

Authors: Sanjiban Sekhar Roy; V. Madhu Viswanatham; P. Venkata Krishna

Addresses: School of Computing Science and Engineering, VIT University, Pin: 632014 Vellore, Tamil Nadu, India ' School of Computing Science and Engineering, VIT University, Pin: 632014 Vellore, Tamil Nadu, India ' School of Computing Science and Engineering, VIT University, Pin: 632014 Vellore, Tamil Nadu, India

Abstract: Spam is a severe widespread predicament which causes troubles for almost every computer user and as well as for companies and institutions. Detection of spam e-mail is a difficult task due to several complications occurring in the process of exchanging information over a network. In this paper, we have demonstrated an approach for designing a spam filtering technique using ensemble-based bagging technique. The bagging technique classifies the e-mail message into either spam or not spam (i.e., ham). Bagging being an effective classifier is a powerful pattern recognition and machine learning methodology that is widely used in the literature. Our approach is tested for performance with other existing methods. It is observed that our method, i.e., bagging gives the better accuracy while classifying e-mail instances. Experimental outcome also exhibits that the framework using bagging technique, we attain better precision and recall. Choosing the optimal value of the parameters is a crucial criterion, and this was achieved by performing ten fold cross-validations.

Keywords: e-mail; spam filtering; bagging; classifying; cross-validation.

DOI: 10.1504/IJAACS.2017.086648

International Journal of Autonomous and Adaptive Communications Systems, 2017 Vol.10 No.3, pp.247 - 260

Received: 13 Jul 2014
Accepted: 22 Oct 2014

Published online: 18 Sep 2017 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article