Title: Inference of domain-domain interactions by matrix factorisation and domain-level features

Authors: Tu Kien T. Le; Osamu Hirose; Lan Anh T. Nguyen; Thammakorn Saethang; Vu Anh Tran; Xuan Tho Dang; Duc Luu Ngo; Mamoru Kubo; Yoichi Yamada; Kenji Satou

Addresses: Graduate School of Natural Science and Technology, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan ' Institute of Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan ' Graduate School of Natural Science and Technology, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan ' Graduate School of Natural Science and Technology, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan ' Graduate School of Natural Science and Technology, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan ' Graduate School of Natural Science and Technology, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan ' Graduate School of Natural Science and Technology, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan ' Institute of Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan ' Institute of Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan ' Institute of Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan

Abstract: In the development of new drugs and improved treatment of diseases, it is essential to understand molecular networks in living organism. Especially, it is important to identify interacting domains among proteins to elucidate hidden functions for protein-protein interactions (PPIs). To date, a number of computational methods have been developed for predicting domain-domain interactions (DDIs) from known PPIs. However, they often contain a large number of false positives while the number of known structures of protein complexes is limited. In this study, we aim to develop a new method of predicting DDIs by a link prediction approach. By using a learning model including low rank matrices as latent features in combination with biological features and topological features of the domain network, the experimental results showed that our method achieved a good performance and the predicted DDIs have high fraction sharing rate with the ones known as true in gold-standard databases.

Keywords: PPIs; protein-protein interactions; DDIs; domain-domain interactions; DDI inference; latent learning model; matrix factorisation; link prediction; low rank matrix; domain-level features; conditional sampling; non-DDIs; imbalanced training dataset; biological features; topological features; domain networks.

DOI: 10.1504/IJFIPM.2014.068174

International Journal of Functional Informatics and Personalised Medicine, 2014 Vol.4 No.3/4, pp.259 - 273

Available online: 19 Mar 2015 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article