Authors: Sherief Abdallah; Rasha Abdelsalam; Rania Seliem
Addresses: Faculty of Engineering and IT, British University in Dubai, 345015, UAE ' Faculty of Engineering and IT, British University in Dubai, 345015, UAE ' Haematology Unit Rashid Hospital, P.O. Box 4545, Dubai, UAE
Abstract: Flow cytometry (FCM) is a widely used technique in health-related fields, including cancer diagnosis and HIV monitoring. Measuring and quantifying the proximity between two patients based on the FCM data is challenging, yet crucial in most data mining tasks. Not only does each file contain thousands of features (representing different cells), but also the features are unordered. Furthermore, the data of a single patient can be divided over multiple FCS files due to technical limitations of FCM machines. We propose in this paper the use of Bayesian inference, along with Binning, to represent and measure the proximity between two patients using FCM data. We verify the effectiveness of our approach by comparing the performance of several classification algorithms in predicting leukaemia cases.
Keywords: FCM; flow cytometry; data mining; leukaemia; Bayesian inference.
International Journal of Data Science, 2018 Vol.3 No.2, pp.188 - 201
Available online: 27 May 2018 *Full-text access for editors Access for subscribers Purchase this article Comment on this article