Title: Gene selection and decision tree based classification for cancerous sample detection

Authors: Sunanda Das; Asit Kumar Das

Addresses: Department of Computer Science and Engineering, Neotia Institute of Technology, Management and Science, Diamond Harbour, South 24-Pargana 743368, West Bengal, India ' Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, Howrah 711103, West Bengal, India

Abstract: Generally, gene expression data are of high-dimensional which cause degradation of the performance of gene data analysis for disease prediction. Therefore, it is a big issue for the traditional classifiers to perform well on high-dimensional microarray data where the number of genes far exceeds the number of samples. In the proposed work, initially, Pearson's correlation coefficient is computed between every pair of genes and based on these coefficients gene dependency set is formed. From every pair of gene dependencies in the gene dependency set, similarity coefficient is measured between two genes using Jaccard Coefficient and thus a gene similarity matrix is computed and a rank is set for each gene indicating its importance. The highest rank gene is considered as the core or the most important gene of the gene set. Next, a rough set theory-based quick reduct algorithm is applied to select only the most informative genes, called reduct, which are sufficient to fully characterise the overall class structure of the gene dataset for disease analysis. Finally, from the reduced gene set of all samples, a rule-based classifier, namely, decision tree is constructed which is applied to unknown samples to predict if it is a diseased or normal sample. Experimental results show the effectiveness of the algorithm.

Keywords: gene dependency; similarity measures; reduct generation; decision tree classifier; gene selection; cancerous samples; gene expression data; gene similarity matrix; rough set theory; classification; leukaemia; colon cancer; prostate cancer; lung cancer.

DOI: 10.1504/IJBET.2016.076729

International Journal of Biomedical Engineering and Technology, 2016 Vol.21 No.1, pp.1 - 14

Received: 15 Aug 2015
Accepted: 23 Sep 2015

Published online: 24 May 2016 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article