Spark based classification of microarray data using scalable artificial neural network
by Mukesh Kumar; Ransingh B. Ray; Santanu K. Rath
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 19, No. 4, 2017

Abstract: Microarray data has a major drawback of a curse of dimensionality, where the number of features are huge in comparison with that of samples. The data retrieved from microarray cover the varieties in its nature, and changes observed with time. The vast amount of raw gene expression data often leads to computational and analytical challenges, including classification of the dataset into correct groups or classes. In this paper, various feature selection techniques based on statistical tests are proposed using Spark framework. After selecting the relevant features using various statistical tests, Artificial Neural Network (ANN) based on Spark framework (sf-ANN) is proposed, which runs on a scalable cluster with multiple nodes. The performance of sf-ANN is tested with the help of microarray datasets of various dimensions. A detailed comparative analysis in terms of execution time is presented on sf-ANN classifier based on Spark framework and conventional system (data is stored on a standalone machine) respectively, in order to examine its performance.

Online publication date: Fri, 27-Apr-2018

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com