Title: A network-based maximum link approach towards MS identifies potentially important roles for undetected ARRB1/2 and ACTB in liver cancer progression

Authors: Wilson Wen Bin Goh; Yie Hou Lee; Zubaidah M. Ramdzan; Maxey C.M. Chung; Limsoon Wong; Marek J. Sergot

Addresses: Department of Computing, Imperial College London, UK ' Singapore-MIT Alliance for Research and Technology, Singapore ' Rosalind and Morris Goodman Cancer Centre, McGill University, Canada ' Department of Biological Sciences and Department of Biochemistry, National University of Singapore, Singapore ' Department of Computer Science and Department of Pathology, National University of Singapore, Singapore ' Department of Computing, Imperial College London, UK

Abstract: Hepatocellular Carcinoma (HCC) ranks among the deadliest of cancers and has a complex etiology. Proteomics analysis using iTRAQ provides a direct way to analyse perturbations in protein expression during HCC progression from early- to late-stage but suffers from consistency and coverage issues. Appropriate use of network-based analytical methods can help to overcome these issues. We built an integrated and comprehensive Protein-Protein Interaction Network (PPIN) by merging several major databases. Additionally, the network was filtered for GO coherent edges. Significantly differential genes (seeds) were selected from iTRAQ data and mapped onto this network. Undetected proteins linked to seeds (linked proteins) were identified and functionally characterised. The process of network cleaning provides a list of higher quality linked proteins, which are highly enriched for similar biological process gene ontology terms. Linked proteins are also enriched for known cancer genes and are linked to many well-established cancer processes such as apoptosis and immune response. We found that there is an increased propensity for known cancer genes to be found in highly linked proteins. Three highly-linked proteins were identified that may play an important role in driving HCC progression - the G-protein coupled receptor signalling proteins, ARRB1/2 and the structural protein beta-actin, ACTB. Interestingly, both ARRB proteins evaded detection in the iTRAQ screen. ACTB was not detected in the original dataset derived from Mascot but was found to be strongly supported when we re-ran analysis using another protein detection database (Paragon). Identification of linked proteins helps to partially overcome the coverage issue in shotgun proteomics analysis. The set of linked proteins are found to be enriched for cancer-specific processes, and more likely so if they are more highly linked. Additionally, a higher quality linked set is derived if network-cleaning is performed prior. This form of network-based analysis complements the cluster-based approach, and can provide a larger list of proteins on which to perform functional analysis, as well as for biomarker identification.

Keywords: biological networks; PPINs; MaxLink; liver cancer; HCC; hepatocellular carcinoma; hepatitis B; proteomics expansion pipeline; protein-protein interaction networks; PPI networks; gene ontology; cancer genes; functional analysis; biomarkers.

DOI: 10.1504/IJBRA.2012.048967

International Journal of Bioinformatics Research and Applications, 2012 Vol.8 No.3/4, pp.155 - 170

Published online: 05 Dec 2014 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article