A hybrid method for splice site prediction based on Markov model and codon information
by Dan Wei; Yin Peng; Yanjie Wei; Qingshan Jiang; Jinglong Fang
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 16, No. 4, 2016

Abstract: Predicting splice sites is very important for gene identification. In this paper, we propose a hybrid splice site prediction method, SVM with Markov model and Codon usage (MC-SVM). The sequence features used for MC-SVM contain the codon bias information and the Markov probabilistic dependence information between adjacent nucleotides. Feature selection is performed using an F-score-based method, and then MC-SVM employs SVM to predict splice sites for both the acceptor and the donor sites. The test on the HS3D data set shows MC-SVM performs well for human gene sequences. The prediction accuracy of MC-SVM is 94.0% for donor splice sites, and 91.5% for acceptor splice sites on the data set with an equal amount of true and false splice site sequences. Compared with many other methods, MC-SVM achieved an improved prediction performance.

Online publication date: Sun, 12-Feb-2017

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com