Title: Colorectal cancer risk factor assessment in Palestine using machine learning models

Authors: Mohammad A.Z. Abu Zuhri; Mohammed Awad; Shahnaz Najjar; Nuha El Sharif; Issa Ghrouz

Addresses: Department of Nursing, Arab American University, Palestine ' Department of Computer Systems Engineering, Arab American University, Palestine ' Department of Health Informatics, Arab American University, Palestine ' Faculty of Public Health, AlQuds University, Palestine ' Ministry of Health, Ramallah, Palestine

Abstract: Colorectal cancer (CRC) has a prevalence of 15% among men and 14.6% among women of all cancers. This research was carried out to assess behavioural risk factors that affected Palestinian reported CRC cases, and to make use of machine learning (ML) tools that might be used in CRC prediction, where the use of a public CRC classification and prediction tool based on accurate ML tools might help individuals in addressing their behavioural CRC risk factors and enhancing their engagement with their health. In this research, Palestinian dataset that consists of 57 predictors was collected, and the dataset consists of 216 instances. Statistical models were used to determine the important features. The study found that the most important risk factors to consider are age, past medical history, diet behaviours, physical activity, and obesity. Consequently, ML models were applied to classify and predict CRC risk factors. Results showed that ANNs outperformed all models.

Keywords: colorectal cancer; CRC; data mining; risk factors; machine learning; classification; Palestine.

DOI: 10.1504/IJMEI.2024.136963

International Journal of Medical Engineering and Informatics, 2024 Vol.16 No.2, pp.126 - 138

Received: 19 Jul 2021
Accepted: 07 Nov 2021

Published online: 01 Mar 2024 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article