Title: Sample-space-based feature extraction and class preserving projection for gene expression data

Authors: Wenjun Wang

Addresses: School of Computer Science and Engineering, Xidian University, Post Box 161, #2 TaiBai Road, Xi'an, 710071, China

Abstract: In order to overcome the problems of high computational complexity and serious matrix singularity for feature extraction using Principal Component Analysis (PCA) and Fisher's Linear Discrinimant Analysis (LDA) in high-dimensional data, sample-space-based feature extraction is presented, which transforms the computation procedure of feature extraction from gene space to sample space by representing the optimal transformation vector with the weighted sum of samples. The technique is used in the implementation of PCA, LDA, Class Preserving Projection (CPP) which is a new method for discriminant feature extraction proposed, and the experimental results on gene expression data demonstrate the effectiveness of the method.

Keywords: high-dimensional data; feature extraction; gene expression data; principal component analysis; PCA; Fisher; linear discriminant analysis; LDA; class preserving projection; bioinformatics; class preserving projection; CPP.

DOI: 10.1504/IJDMB.2013.055498

International Journal of Data Mining and Bioinformatics, 2013 Vol.8 No.2, pp.224 - 246

Received: 19 Mar 2011
Accepted: 28 Nov 2011

Published online: 20 Oct 2014 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article