Title: An ensemble learning approach for prediction of phosphorylation sites

Authors: Jinyan Huang

Addresses: School of Life Sciences and Technology, Tongji University, Shanghai 200092, China

Abstract: Protein phosphorylation plays a fundamental role in most of the cellular regulatory pathways. Experimental identification of phosphorylation sites is labour-intensive and often limited by the availability and optimisation of enzymatic reaction. An ensemble learning approach that combines different encodings using a meta-learner was developed which was catalyzed by four protein kinase families and three residues. A predictor is constructed to predict the true and false phosphorylation sites based on Support Vector Machines (SVM), and knowledge based encoding method is used for amino sequences. Different encoding methods catch different aspects of amino sequences feature. The stacking SVM approach was applied to combine these aspects and improved both sensitivity and specificity.

Keywords: knowledge based encoding; SVM; support vector machines; phosphorylation sites; ensemble learning; phosphorylation site prediction; amino sequences; protein kinase families; bioinformatics.

DOI: 10.1504/IJBRA.2013.053608

International Journal of Bioinformatics Research and Applications, 2013 Vol.9 No.3, pp.271 - 284

Received: 26 Sep 2011
Accepted: 05 Oct 2011

Published online: 06 Sep 2014 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article