Title: Message Passing Clustering (MPC): a knowledge-based framework for clustering under biological constraints

Authors: Huimin Geng, Xutao Deng, Hesham H. Ali

Addresses: Department of Computer Science, University of Nebraska at Omaha, Omaha, NE 68182, USA. ' Department of Computer Science, University of Nebraska at Omaha, Omaha, NE 68182, USA. ' Department of Computer Science, University of Nebraska at Omaha, Omaha, NE 68182, USA

Abstract: A new clustering algorithm, Message Passing Clustering (MPC), is proposed. MPC employs the concept of message passing to describe parallel and spontaneous clustering process by allowing data objects to communicate with each other. MPC also provides an extensible framework to accommodate additional features into clustering, such as adaptive feature weights scaling, stochastic cluster merging, and semi-supervised constraints guiding. Extensive experiments were performed using both simulation and real microarray gene expression and phylogenetic data. The results showed that MPC performed favourably to other popular clustering algorithms and MPC with the integration of additional features gave even higher accuracy rate than MPC.

Keywords: phylogenetics; microarray gene expression; feature scaling; stochastic process; semisupervised; message passing clustering; MPC; clustering algorithms; data mining; bioinformatics; biological constraints; knowledge-based clustering.

DOI: 10.1504/IJDMB.2008.019092

International Journal of Data Mining and Bioinformatics, 2008 Vol.2 No.2, pp.95 - 120

Published online: 28 Jun 2008 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article