Title: A comparative analysis of classifiers in cancer prediction using multiple data mining techniques

Authors: Seyed Mohammad Jafar Jalali; Sérgio Moro; Mohammad Reza Mahmoudi; Keramat Allah Ghaffary; Mohsen Maleki; Aref Alidoostan

Addresses: Department of Information Technology, Allameh Tabataba'i University, Tehran, Iran ' Instituto Universitário de Lisboa (ISCTE-IUL), ISTAR-IUL, Lisboa, Portugal; ALGORITMI Research Centre, University of Minho, Guimarães, Portugal ' Department of Statistics, Fasa University, Fasa, Iran ' Department of Statistics, Fasa University, Fasa, Iran ' Department of Statistics, Shiraz University, Shiraz, Iran ' Department of Information Technology, Islamic Azad University of Science and Research of Fars, Shiraz, Iran

Abstract: In recent years, application of data mining methods in health industry has received increased attention from both health professionals and scholars. This paper presents a data mining framework for detecting breast cancer based on real data from one of the Iran hospitals by applying association rules and the most commonly used classifiers. The former were adopted for reducing the size of datasets, while the latter were chosen for cancer prediction. A k-fold cross-validation procedure was included for evaluating the performance of the proposed classifiers. Among the six classifiers used in this paper, support vector machine achieved the best results, with an accuracy of 93%. It is worth mentioning that the approach proposed can be applied for detecting other diseases as well.

Keywords: cancer prediction; data mining; classifiers; association rules.

DOI: 10.1504/IJBISE.2017.088687

International Journal of Business Intelligence and Systems Engineering, 2017 Vol.1 No.2, pp.166 - 178

Received: 29 Dec 2016
Accepted: 12 Jun 2017

Published online: 14 Dec 2017 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article