Title: Bi-k-bi clustering: mining large scale gene expression data using two-level biclustering

Authors: Levent Carkacioglu, Rengul Cetin Atalay, Ozlen Konu, Volkan Atalay, Tolga Can

Addresses: Department of Computer Engineering, Middle East Technical University, Ankara, Turkey. ' Department of Molecular Biology and Genetics, Bilkent University, Ankara, Turkey. ' Department of Molecular Biology and Genetics, Bilkent University, Ankara, Turkey. ' Department of Computer Engineering, Middle East Technical University, Ankara, Turkey. ' Department of Computer Engineering, Middle East Technical University, Ankara, Turkey

Abstract: Due to the increase in gene expression data sets in recent years, various data mining techniques have been proposed for mining gene expression profiles. However, most of these methods target single gene expression data sets and cannot handle all the available gene expression data in public databases in reasonable amount of time and space. In this paper, we propose a novel framework, bi-k-bi clustering, for finding association rules of gene pairs that can easily operate on large scale and multiple heterogeneous data sets. We applied our proposed framework on the available NCBI GEO Homo sapiens data sets. Our results show consistency and relatedness with the available literature and also provides novel associations.

Keywords: biclustering; APD; association pattern discovery; Spearman rank correlation; gene expression analysis; data mining; bioinformatics; association rules; gene pairs.

DOI: 10.1504/IJDMB.2010.037548

International Journal of Data Mining and Bioinformatics, 2010 Vol.4 No.6, pp.701 - 721

Received: 05 Jul 2008
Accepted: 05 Jan 2009

Published online: 16 Dec 2010 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article