Title: Scalable mining, analysis and visualisation of protein-protein interaction networks

Authors: Shaikh Arifuzzaman; Bikesh Pandey

Addresses: Department of Computer Science, University of New Orleans, New Orleans, LA 70148, USA ' Department of Computer Science, University of New Orleans, New Orleans, LA 70148, USA

Abstract: Proteins are linear chain biomolecules that are the basis of functional networks in all organisms. Protein-protein interaction (PPI) networks are networks of protein complexes formed by biochemical events and electrostatic forces. PPI networks can be used to study diseases and discover drugs. The causes of diseases are evident on a protein interaction level. For instance, elevation of interaction edge weights of oncogenes is manifested in cancers. The availability of large datasets and need for efficient analysis necessitate the design of scalable methods leveraging modern high-performance computing (HPC) platforms. In this paper, we design a lightweight framework on a distributed-memory parallel system to study PPI networks. Our framework supports automated analytics based on methods for extracting signed motifs, computing centrality, and finding functional units. We design message passing interface (MPI)-based parallel methods and workflow, scalable to large networks. To the best of our knowledge, these capabilities collectively make our tool novel.

Keywords: protein interaction; biological networks; network visualisation; massive networks; HPC systems; network mining.

DOI: 10.1504/IJBDI.2019.100884

International Journal of Big Data Intelligence, 2019 Vol.6 No.3/4, pp.176 - 187

Received: 08 Mar 2018
Accepted: 16 May 2018

Published online: 19 Jul 2019 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article