Title: A new clustering approach for learning transcriptional modules

Authors: Francesco Archetti; Ilaria Giordani; Giancarlo Mauri; Enza Messina

Addresses: DISCO – Department of Computer Science, Systems and Communication, University of Milano Bicocca, Consorzio Milano Ricerche, Milan, Italy ' DISCO – Department of Computer Science, Systems and Communication, University of Milano Bicocca, Consorzio Milano Ricerche, Milan, Italy ' DISCO – Department of Computer Science, Systems and Communication, University of Milano Bicocca, Consorzio Milano Ricerche, Milan, Italy ' DISCO – Department of Computer Science, Systems and Communication, University of Milano Bicocca, Consorzio Milano Ricerche, Milan, Italy

Abstract: In modern biology, we had an explosion of genomic data from multiple sources, like measurements of RNA levels, gene sequences, annotations or interaction data. These heterogeneous data provide important information that should be integrated through suitable learning methods aimed at elucidating regulatory networks. We propose an iterative relational clustering procedure for finding modules of co-regulated genes. This approach integrates information concerning known Transcription Factors (TFs)–gene interactions with gene expression data to find clusters of genes that share a common regulatory program. The results obtained on two well-known gene expression data sets from Saccharomyces cerevisiae are shown.

Keywords: gene transcriptional modules; gene clusters; relational clustering; regulatory networks; data mining; bioinformatics; transcription factors; gene expression data; Saccharomyces cerevisiae.

DOI: 10.1504/IJDMB.2012.049248

International Journal of Data Mining and Bioinformatics, 2012 Vol.6 No.3, pp.304 - 323

Accepted: 02 Oct 2010
Published online: 17 Dec 2014 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article