Title: Ensemble model with improved DCNN for big data classification by handling class imbalance problem

Authors: Shini Lawrance; J.R. Jeba

Addresses: Computer Science, Noorul Islam Center for Higher Education, Kumaracoil, 629180, India ' Computer Applications, Noorul Islam Center for Higher Education, Kumaracoil, 629180, India

Abstract: This research suggests a big data classification model that uses an improved deep convolutional neural network (IDCNN) and has five phases. In the first stage, Z-score normalisation is employed for preprocessing the input data. The second phase involves processing the preprocessed data for improved class imbalance using SMOTE-ENC. Then, the subsequent phase involves extracting the collection of features, which also includes raw data and features based on correlation, entropy, and MI. Then, in the fourth phase, to guarantee appropriate feature selection, an improved recursive feature elimination (IRFE) approach is employed for the selection of features is performed using the extracted features. Finally, ensemble classification using a collection of classifiers like Bi-LSTM, SVM, RNN and IDCNN is performed depending on the features that have been chosen. The IDCNN classifier is used in this case to categorise the final result by taking Bi-LSTM, SVM and RNN output scores as input.

Keywords: data; classification; class imbalance; deep convolutional neural network; DCNN; improved recursive feature elimination; IRFE.

DOI: 10.1504/IJDMMM.2025.148853

International Journal of Data Mining, Modelling and Management, 2025 Vol.17 No.3, pp.272 - 295

Received: 08 Dec 2023
Accepted: 26 May 2024

Published online: 29 Sep 2025 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article