Title: Semi-active learning to rank algorithms for document retrieval

Authors: Faiza Dammak; Hager Kammoun; Sawssen Ben Hmid; Abdelmajid Ben Hamadou

Addresses: MIRACL Laboratory, Institute of Computer Science and Multimedia of Sfax, Technology Center of Sfax, Sfax University, Tunis Road Km 10, B.P. 242, Sfax 3021, Tunisia ' MIRACL Laboratory, Institute of Computer Science and Multimedia of Sfax, Technology Center of Sfax, Sfax University, Tunis Road Km 10, B.P. 242, Sfax 3021, Tunisia ' MIRACL Laboratory, Institute of Computer Science and Multimedia of Sfax, Technology Center of Sfax, Sfax University, Tunis Road Km 10, B.P. 242, Sfax 3021, Tunisia ' MIRACL Laboratory, Institute of Computer Science and Multimedia of Sfax, Technology Center of Sfax, Sfax University, Tunis Road Km 10, B.P. 242, Sfax 3021, Tunisia

Abstract: Recently, several search engine applications are using learning to rank technologies to train their ranking models whose performance is strongly affected by labelled examples' number in the training set. Since these labels might be costly to acquire as labelling is usually scarce and expensive to get, active learning and semi-supervised learning technologies aim to reduce manual labelling workload. In this paper, we propose two inductive learning to rank strategies of alternatives that combine active and semi-supervised learning to assign the relevance scores to an unlabeled set of document-query pairs, using selectively sampled and automatically labelled data. These propositions enable the exploitation of all collected data and the avoidance of some problems caused by employing only active or semi-supervised learning. We showed through different ranking measures that the algorithms proposed yielded into competitive results compared to some other semi-supervised and active ranking algorithms on collections from the standard benchmark Letor.

Keywords: learning to rank; active learning; semi-supervised learning; supervised learning; document retrieval.

DOI: 10.1504/IJIIDS.2017.087252

International Journal of Intelligent Information and Database Systems, 2017 Vol.10 No.3/4, pp.289 - 313

Received: 04 Apr 2016
Accepted: 15 Dec 2016

Published online: 11 Oct 2017 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article