Title: A new hybrid system combining active learning and particle swarm optimisation for medical data classification

Authors: Nawel Zemmal; Nabiha Azizi; Mokhtar Sellami; Soraya Cheriguene; Amel Ziani

Addresses: Department of Mathematics and Computer Science, Mohamed Cherif Messaadia University, Souk-Ahras, 41000, Algeria; Labged Laboratory, Computer Science Department, Badji Mokhtar University, Annaba, 23000, Algeria ' Labged Laboratory, Computer Science Department, Badji Mokhtar University, Annaba, 23000, Algeria ' Labged Laboratory, Computer Science Department, Badji Mokhtar University, Annaba, 23000, Algeria ' Labged Laboratory, Computer Science Department, Badji Mokhtar University, Annaba, 23000, Algeria; Computer Science Department, Saad Dahlab University, Blida, 9000, Algeria ' Lri laboratory, Information Research Laboratory, Badji Mokhtar University, Annaba, 23000, Algeria; Computer Science Department, Youcef Ben Khada University, Algiers, 16000, Algeria

Abstract: With the increase of unlabeled data in medical datasets, the labelling process becomes a more costly task. Therefore, active learning provides a framework to reduce the amount the manual labour process by querying an expert for just the labels of particular instances, the choice of these instances to annotate is paramount. However, the traditional active learning techniques can be computationally expensive as they require to analyse, at each iteration, all unlabeled instances including those that are redundant and uninformative, thereby decreasing the system performance. To handle this issue, it is necessary to have a global optimisation algorithm that allows finding the best solution in a reasonable time. This paper proposes a novel framework combining active learning and particle swarm optimisation algorithm. A novel uncertainty-based strategy was designed and integrated into the PSO as an objective function. This new strategy allows finding the most informative instances by calculating an uncertainty score using instance weighting method. Experiments were performed on binary and multi-class classification problems using both balanced and unbalanced medical datasets. Experimental results show that the proposed uncertainty strategy outperforms its existing counterparts. It achieves performances comparable to supervised methods.

Keywords: active learning; uncertainty sampling strategy; particle swarm optimisation; PSO; global optimisation problem; instance weighting; informativeness; unlabeled data; medical data.

DOI: 10.1504/IJBIC.2021.117427

International Journal of Bio-Inspired Computation, 2021 Vol.18 No.1, pp.59 - 68

Accepted: 21 Aug 2020
Published online: 06 Sep 2021 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article