Title: A novel multi-objective genetic algorithm for multiple sequence alignment

Authors: Mehmet Kaya; Buket Kaya; Reda Alhajj

Addresses: Department of Computer Engineering, Fırat University, Elazig, Turkey ' Department of Electrical and Electronics Engineering, Fırat University, Elazig, Turkey ' Department of Computer Sciences, University of Calgary, Calgary, AB, Canada

Abstract: There is no well-accepted theoretical model for multiple sequence alignment. An algorithm is accepted as a good method for multiple sequence alignment if it produces better fitness scores with respect to the benchmark datasets. For this purpose, we propose an efficient method using multi-objective genetic algorithm to discover optimal alignments in multiple sequence data. The main advantage of our approach is that a large number of trade-off alignments can be obtained by a single run with respect to conflicting objectives: alignment length minimisation and similarity and support maximisation. We compare our method with the four well-known multiple sequence alignment methods, MUSCLE, ClustalW, SAGA and MSA-GA. The first two of them are progressive methods, and the other two are based on evolutionary algorithms. Experimental results on the BAliBASE 2.0 database demonstrate that our method found better solutions than the others for most of the cases in terms of accuracy.

Keywords: multiple sequences; multiple sequence alignment; multi-objective genetic algorithms; bioinformatics.

DOI: 10.1504/IJDMB.2016.074684

International Journal of Data Mining and Bioinformatics, 2016 Vol.14 No.2, pp.139 - 158

Received: 20 Jul 2014
Accepted: 28 Apr 2015

Published online: 13 Feb 2016 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article