Title: Early prediction of heart disease risk using extreme gradient boosting: a data-driven analysis

Authors: Hamdi A. Al-Jamimi

Addresses: Information and Computer Science, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia; Research Excellence, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia

Abstract: Heart disease is a leading cause of morbidity and mortality worldwide. Early identification of heart disease risk is critical for timely treatment and prevention of further complications. This study provides a detailed examination of a novel heart disease dataset encompassing 333 cases and 21 features. The study employed the eXtreme gradient boosting (XGBoost) algorithm to develop an intelligent predictive model to detect the likelihood of heart disease at an early stage. The choice of the XGBoost model for this study was apt, considering its strengths in managing structured medical datasets with multiple features, resistance to overfitting, and interpretability for insights into feature importance. Feature selection was utilised to identify the most important predictors for prediction. The findings demonstrate that the Gradient Boosting classifier outperforms other machine learning (ML) techniques with a 99% accuracy rate. The results highlight the capability of ML in aiding the early detection of heart diseases.

Keywords: heart disease; healthcare; artificial intelligence; AI; machine learning; ML; early prediction.

DOI: 10.1504/IJBET.2024.140563

International Journal of Biomedical Engineering and Technology, 2024 Vol.45 No.4, pp.296 - 313

Received: 05 Aug 2023
Accepted: 03 Dec 2023

Published online: 23 Aug 2024 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article