Title: MOSSA: a morpho-semantic knowledge extraction system for Arabic information retrieval

Authors: Nadia Soudani; Ibrahim Bounhas; Yahya Slimani

Addresses: LISI Laboratory of Computer Science for Industrial Systems, Carthage University, Tunisia; National school of Computer Sciences, University of Manouba, Tunisia; Joint group for Artificial Reasoning and Information Retrieval, Manouba University, Tunisia ' LISI Laboratory of Computer Science for Industrial Systems, Carthage University, Tunisia; Higher Institute of Documentation, University of Manouba, Tunisia; Joint group for Artificial Reasoning and Information Retrieval, Manouba University, Tunisia ' LISI Laboratory of Computer Science for Industrial Systems, Carthage University, Tunisia; Higher Institute of Multimedia Arts of Manouba, University of Manouba, Tunisia; Joint group for Artificial Reasoning and Information Retrieval, Manouba University, Tunisia

Abstract: In this paper, we propose to exploit different morpho-semantic resources to enhance Arabic information retrieval (IR). We use standardised LMF Arabic dictionaries and Arabic corpora. Our goal by this communication is to take advantage of the different existing resources to extract useful knowledge for Arabic IR. We equally study the impact of the Arabic morphology on IR effectiveness. Several query expansion strategies are carried based on morphological, semantic and morpho-semantic relations. In addition, combining such knowledge is also studied and evaluated. We experiment the effect of short diacritics and part of speech (POS) disambiguation and tagging in the indexing step. A graph-based representation is used to formalise knowledge resources graph-based representation. This latter represents a powerful formalism to express semantics of texts and to support NLP tools and applications as IR. Several experimental comparisons are handled between the different used knowledge resources and the different carried IR approaches.

Keywords: Arabic information retrieval; morpho-semantic knowledge; query expansion; graph-based; knowledge representation.

DOI: 10.1504/IJKWI.2019.103622

International Journal of Knowledge and Web Intelligence, 2019 Vol.6 No.2, pp.106 - 141

Received: 03 May 2018
Accepted: 07 Feb 2019

Published online: 15 Nov 2019 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article