Title: Drawing inferences from clinical studies with missing values using genetic algorithm

Authors: R. Devi Priya; S. Kuppuswami

Addresses: Kongu Engineering College, Erode 638 052, Tamil Nadu, India ' Kongu Engineering College, Erode 638 052, Tamil Nadu, India

Abstract: Missing data problem degrades the statistical power of any analysis made in clinical studies. To infer valid results from such studies, suitable method is required to replace the missing values. There is no method which can be universally applicable for handling missing values and the main objective of this paper is to introduce a common method applicable in all cases of missing data. In this paper, Bayesian Genetic Algorithm (BGA) is proposed to effectively impute both missing continuous and discrete values using heuristic search algorithm called genetic algorithm and Bayesian rule. BGA is applied to impute missing values in a real cancer dataset under Missing At Random (MAR) and Missing Completely At Random (MCAR) conditions. For both discrete and continuous attributes, the results show better classification accuracy and RMSE% than many existing methods.

Keywords: missing values; BGA; Bayesian genetic algorithms; MAR; missing at random; MCAR; missing completely at random; continuous attributes; discrete attributes; clinical studies; bioinformatics.

DOI: 10.1504/IJBRA.2014.065245

International Journal of Bioinformatics Research and Applications, 2014 Vol.10 No.6, pp.613 - 627

Received: 09 Oct 2012
Accepted: 21 Dec 2012

Published online: 29 Apr 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article