Title: Improving robustness of gene ranking by multi-criterion combination with novel gene importance transformation

Authors: Feng Yang; K.Z. Mao

Addresses: School of Electrical & Electronic Engineering, Nanyang Technological University, 50 Nanyang Avenue, 639798 Singapore ' School of Electrical & Electronic Engineering, Nanyang Technological University, 50 Nanyang Avenue, 639798 Singapore

Abstract: Feature ranking, which ranks features via their individual importance, is one of the frequently used feature selection techniques. Traditional feature ranking criteria are apt to produce inconsistent ranking results even with light perturbations in training samples when applied to high dimensional and small-sized gene expression data, which brings troubles for further studies such as biomarker identification. A widely used strategy for solving the inconsistencies is the multicriterion combination, where score normalisation is crucial. In this paper, three problems in existing methods are first analyzed, and then a new feature importance transformation algorithm based on resampling and permutation is proposed for score normalisation. Experimental studies on four popular gene expression data sets show that the multi-criterion combination based on the proposed score normalisation produces gene rankings with improved robustness.

Keywords: feature ranking; multi-criterion combination; score normalisation; robustness; gene ranking; gene importance transformation; feature selection; bioinformatics; gene expression data; biomarker identification; biomarkers.

DOI: 10.1504/IJDMB.2013.050978

International Journal of Data Mining and Bioinformatics, 2013 Vol.7 No.1, pp.22 - 37

Received: 08 Mar 2011
Accepted: 08 Mar 2011

Published online: 20 Oct 2014 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article