TISRover: ConvNets learn biologically relevant features for effective translation initiation site prediction Online publication date: Thu, 13-Sep-2018
by Jasper Zuallaert; Mijung Kim; Arne Soete; Yvan Saeys; Wesley De Neve
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 20, No. 3, 2018
Abstract: Being a key component in gene regulation, translation initiation is a well-studied topic. However, recent findings have shown translation initiation to be more complex than initially thought, urging for more effective prediction methods. In this paper, we present TISRover, a multi-layered convolutional neural network architecture for translation initiation site prediction. We achieve state-of-the-art results, outperforming a previous deep learning approach by 4% to 23% in terms of auPRC, and other approaches by at least 68% in terms of error rate. Furthermore, we present a methodology to analyse the decision-making process of our network models, revealing various biologically relevant features for translation initiation site prediction that are automatically learnt from scratch, without any prior knowledge. The most notable features found are the Kozak consensus sequence, the reading frame characteristics, the influence of stop and start codons in the sequence, and the presence of donor splice site patterns.
Online publication date: Thu, 13-Sep-2018
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email firstname.lastname@example.org