Title: Mining the Arabidopsis and rice genomes for cyclophilin protein families

Authors: S.O. Opiyo, E.N. Moriyama

Addresses: Department of Agronomy and Horticulture, University of Nebraska-Lincoln 68583, USA. ' School of Biological Sciences and Center for Plant Science Innovation, University of Nebraska-Lincoln, 403 Manter Hall, Lincoln, NE 68588-0118, USA

Abstract: Cyclophilins, which possess peptidyl-prolyl isomerase activity, are cellular targets of immunosuppressant drugs and involved in a wide variety of functions. While the Arabidopsis thaliana genome contains the largest number of cyclophilins, the number of plant cyclophilins available in databases is small compared to that of other organisms. It implies that many cyclophilins are yet to be identified in plants. In order to identify cyclophilin candidates from available plant sequence data, we examined alignment-free methods based on Partial Least Squares (PLS). PLS classifier performed better than profile hidden Markov models and PSI-BLAST in identifying cyclophilins from the Arabidopsis and rice genomes.

Keywords: plant cyclophilins; PLS classifier; partial least squares; profile hidden Markov models; HMM; bioinformatics; plant sequences; Arabidopsis genome; rice genome.

DOI: 10.1504/IJBRA.2009.026421

International Journal of Bioinformatics Research and Applications, 2009 Vol.5 No.3, pp.295 - 309

Published online: 11 Jun 2009 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article