Title: Random forest and genetic algorithm united with hyperparameter for diabetes prediction by using WBSMOTE, wrapper approach

Authors: A. Usha Nandhini; K. Dharmarajan

Addresses: School of Computing, Vels Institute of Science, Technology & Advanced Studies (VISTAS), Pallavaram, Chennai, 600117, Tamil Nadu, India ' Department of Information Technology, Vels Institute of Science, Technology & Advanced Studies (VISTAS), Pallavaram, Chennai, 600117, Tamil Nadu, India

Abstract: Food is converted into energy by the human body, but diabetes develops when insulin stops working properly and glucose remains in the bloodstream. Heart disease, stroke, renal failure, blindness, nerve damage, gum disease, and even amputations can all be caused by hyperglycemia, or high blood sugar. In recent years, machine learning has made great strides, and its usage has improved numerous areas of healthcare. This research aimed to construct a model that could accurately predict a person's likelihood of developing diabetes. In this paper, we focus on preprocessing techniques and the problem of data imbalance. In this research, diabetes diagnosis was accomplished using the random forest classifier (RFC), WBSMOTE, and the wrapping method. Accuracy in the RFC was improved when evolutionary algorithms were used with the hyperparameter optimisation technique. The UCI machine learning repository's PIMA Indian Diabetes (PIDD) dataset was used for the tests. The outcomes demonstrated that the suggested method outperformed with a maximum accuracy of 93%.

Keywords: diabetes; machine learning; random forest; EFS; exhaustive feature selection; WBSMOTE; genetic algorithm; health industry.

DOI: 10.1504/IJSSE.2023.131226

International Journal of System of Systems Engineering, 2023 Vol.13 No.2, pp.207 - 227

Received: 29 Jun 2022
Accepted: 23 Aug 2022

Published online: 01 Jun 2023 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article