You can view the full text of this article for free using the link below.

Title: Email spam detection using bagging and boosting of machine learning classifiers

Authors: Uma Bhardwaj; Priti Sharma

Addresses: Department of Computer Science and Applications, Maharshi Dayanand University, Rohtak, Haryana – 124001, India ' Department of Computer Science and Applications, Maharshi Dayanand University, Rohtak, Haryana – 124001, India

Abstract: The increase in the popularity, utility, and significance of electronic mails has also raised the exposure of spam emails. This paper endeavours to detect email spam by constructing an ensemble system using bagging and boosting of machine learning techniques. The dataset used for the experimentation is Ling-Spam Corpus. The system detects spam email by bagging the machine learning-based multinomial Naïve Bayes (MNB) and J48 decision tree classifiers followed by the boosting technique of converting weak classifiers into strong by implementing the Adaboost algorithm. The experimentation includes three different experiments and the results attained are compared with each other. Experiment 1 employs the individual classifiers, experiment 2 ensembles the classifiers with bagging approach, and experiment 3 ensembles the classifiers by implementing the boosting approach for the email spam detection. The effectiveness of the ensemble methods is manifested by comparing the evaluated results with individual classifiers in terms of evaluation metrics.

Keywords: email spam; text mining; Naïve Bayes; J48 algorithm; spam filtering; correlation based feature selection; bagging; boosting.

DOI: 10.1504/IJAIP.2023.128084

International Journal of Advanced Intelligence Paradigms, 2023 Vol.24 No.1/2, pp.229 - 253

Received: 15 Jun 2019
Accepted: 14 Sep 2019

Published online: 05 Jan 2023 *

Full-text access for editors Full-text access for subscribers Free access Comment on this article