Comparison of feature selection and classification combinations for cancer classification using microarray data
by Vijayan Vinaya, Nadeem Bulsara, Chetan J. Gadgil, Mugdha Gadgil
International Journal of Bioinformatics Research and Applications (IJBRA), Vol. 5, No. 4, 2009

Abstract: High throughput gene expression data can be used to identify biomarker profiles for classification. The accuracy of microarray based sample classification depends on the algorithm employed for selecting the features (genes) used for classification, and the classification algorithm. We have evaluated the performance of over 2000 combinations of feature selection and classification algorithms in classifying cancer datasets. One of these combinations (SVM for ranking genes + SMO) shows excellent classification accuracy using a small number of genes across three cancer datasets tested. Notably, classification using 15 selected genes yields 96% accuracy for a dataset obtained on an independent microarray platform.

Online publication date: Tue, 28-Jul-2009

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Bioinformatics Research and Applications (IJBRA):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com