Title: Longitudinal analysis for predicting amino acid changes in HIV-1 using association rule mining

Authors: Mounira Lakab; Abdelouahab Moussaoui

Addresses: Department of Computer Science, University of Abderrahmane Mira, Bejaia, 06000, Algeria ' Department of Computer Science, University of Feraht Abas Sétif, Sétif, 19000, Algeria

Abstract: The human immunodeficiency virus (HIV) remains a great challenge for humanity. HIV is characterised by high mutational rate, resulting in pathogenic variants that promotes the escape of immune response. In order to understand the correlations between amino acid mutations of the virus and quantify the evolutionary in HIV, we present a novel approach based on association rule mining (ARM) from protein sequence data taken at different time points. In this study, a longitudinal association rule mining (LARM) algorithm has been proposed. We collected the entire genome of 100 untreated HIV-1 infected patients over 3-5 years of infection, with 6-10 longitudinal samples per patient. We used the Los Alamos intra-patient search interface. Our experiments show the effectiveness of the proposed method in discovering major amino acid changes in comparison with the temporal analysis.

Keywords: association rule mining; longitudinal data; HIV-1; mutation; amino acid; data mining.

DOI: 10.1504/IJDMB.2025.142992

International Journal of Data Mining and Bioinformatics, 2025 Vol.29 No.1/2, pp.87 - 101

Received: 17 Jul 2023
Accepted: 08 Nov 2023

Published online: 02 Dec 2024 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article