An amino acid property-based method for identifying solenoid proteins
by Senthilnathan Rajendran; Arunachalam Jothi
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 24, No. 3, 2020

Abstract: Solenoid proteins are proteins that contain repeating structural units. They are associated with many important biological functions and also key factors for the onset of many human diseases like Huntington disease, mental retardation, inherited ataxias, etc. Detecting solenoid proteins from the sequence information alone is a challenging problem. Current methods for identifying solenoid proteins from sequence rely heavily on homology-based approaches. In this work, we have proposed an alternate method which uses just the amino acid composition and a set of biophysical descriptors to identify solenoid proteins. Four different machine learning approaches: Naive Bayes (NB), Support Vector Machine (SVM), Bayesian Generalised Linear Models (BGLM) and Random Forest (RF) method were used for classification. These four classification models were validated using the cross-validation technique. The Area under the Curve (AUC) was found to be above 0.9 for all the models. The entire procedure was performed using the R programming language.

Online publication date: Sun, 07-Feb-2021

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com