MOSSA: a morpho-semantic knowledge extraction system for Arabic information retrieval Online publication date: Fri, 15-Nov-2019
by Nadia Soudani; Ibrahim Bounhas; Yahya Slimani
International Journal of Knowledge and Web Intelligence (IJKWI), Vol. 6, No. 2, 2019
Abstract: In this paper, we propose to exploit different morpho-semantic resources to enhance Arabic information retrieval (IR). We use standardised LMF Arabic dictionaries and Arabic corpora. Our goal by this communication is to take advantage of the different existing resources to extract useful knowledge for Arabic IR. We equally study the impact of the Arabic morphology on IR effectiveness. Several query expansion strategies are carried based on morphological, semantic and morpho-semantic relations. In addition, combining such knowledge is also studied and evaluated. We experiment the effect of short diacritics and part of speech (POS) disambiguation and tagging in the indexing step. A graph-based representation is used to formalise knowledge resources graph-based representation. This latter represents a powerful formalism to express semantics of texts and to support NLP tools and applications as IR. Several experimental comparisons are handled between the different used knowledge resources and the different carried IR approaches.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Knowledge and Web Intelligence (IJKWI):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com