Title: Early prediction of heart disease risk using extreme gradient boosting: a data-driven analysis
Authors: Hamdi A. Al-Jamimi
Addresses: Information and Computer Science, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia; Research Excellence, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia
Abstract: Heart disease is a leading cause of morbidity and mortality worldwide. Early identification of heart disease risk is critical for timely treatment and prevention of further complications. This study provides a detailed examination of a novel heart disease dataset encompassing 333 cases and 21 features. The study employed the eXtreme gradient boosting (XGBoost) algorithm to develop an intelligent predictive model to detect the likelihood of heart disease at an early stage. The choice of the XGBoost model for this study was apt, considering its strengths in managing structured medical datasets with multiple features, resistance to overfitting, and interpretability for insights into feature importance. Feature selection was utilised to identify the most important predictors for prediction. The findings demonstrate that the Gradient Boosting classifier outperforms other machine learning (ML) techniques with a 99% accuracy rate. The results highlight the capability of ML in aiding the early detection of heart diseases.
Keywords: heart disease; healthcare; artificial intelligence; AI; machine learning; ML; early prediction.
DOI: 10.1504/IJBET.2024.140563
International Journal of Biomedical Engineering and Technology, 2024 Vol.45 No.4, pp.296 - 313
Received: 05 Aug 2023
Accepted: 03 Dec 2023
Published online: 23 Aug 2024 *