Title: Ensemble model with improved DCNN for big data classification by handling class imbalance problem
Authors: Shini Lawrance; J.R. Jeba
Addresses: Computer Science, Noorul Islam Center for Higher Education, Kumaracoil, 629180, India ' Computer Applications, Noorul Islam Center for Higher Education, Kumaracoil, 629180, India
Abstract: This research suggests a big data classification model that uses an improved deep convolutional neural network (IDCNN) and has five phases. In the first stage, Z-score normalisation is employed for preprocessing the input data. The second phase involves processing the preprocessed data for improved class imbalance using SMOTE-ENC. Then, the subsequent phase involves extracting the collection of features, which also includes raw data and features based on correlation, entropy, and MI. Then, in the fourth phase, to guarantee appropriate feature selection, an improved recursive feature elimination (IRFE) approach is employed for the selection of features is performed using the extracted features. Finally, ensemble classification using a collection of classifiers like Bi-LSTM, SVM, RNN and IDCNN is performed depending on the features that have been chosen. The IDCNN classifier is used in this case to categorise the final result by taking Bi-LSTM, SVM and RNN output scores as input.
Keywords: data; classification; class imbalance; deep convolutional neural network; DCNN; improved recursive feature elimination; IRFE.
DOI: 10.1504/IJDMMM.2025.148853
International Journal of Data Mining, Modelling and Management, 2025 Vol.17 No.3, pp.272 - 295
Received: 08 Dec 2023
Accepted: 26 May 2024
Published online: 29 Sep 2025 *