Feature prioritisation on big genomic data for analysing gene-gene interactions Online publication date: Wed, 21-Apr-2021
by Ahmad A. Aloqaily; Siamak Tafavogh; Bronwyn L. Harvey; Daniel R. Catchpoole; Paul J. Kennedy
International Journal of Bioinformatics Research and Applications (IJBRA), Vol. 17, No. 2, 2021
Abstract: Complex diseases are not caused by single genes but result from intricate non-linear interactions among them. There is a critical need to implement approaches that take into account non-linear gene-gene interactions in searching for markers that jointly cause diseases. Determining the interaction between more than two single nucleotide polymorphisms (SNP) within the whole genome data is computationally expensive and often infeasible. In this paper, we develop an approach to classify patients with Acute Lymphoblastic Leukaemia by analysing multiple SNP interactions. A novel feature prioritisation algorithm called interaction effect quantity (IEQ) selects SNPs with high potential of interaction by analysing their distribution throughout the genomic data and enables deeper analysis of non-linear interactions within large datasets. We show that IEQ enables analyses of interactions between up to four SNPs, with F-measure for classification greater than 89% obtained. Such an analysis is typically much more computationally challenging if IEQ is not implemented.
Online publication date: Wed, 21-Apr-2021
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Bioinformatics Research and Applications (IJBRA):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email firstname.lastname@example.org