Title: Arabic text semantic-based query expansion

Authors: Nuhu Yusuf; Mohd Amin Mohd Yunus; Norfaradilla Wahid; Aida Mustapha; Nazri Mohd Nawi; Noor Azah Samsudin

Addresses: Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia; Management and Information Technology Department, Abubakar Tafawa Balewa University Bauchi, Bauchi State, Nigeria ' Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia ' Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia ' Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia ' Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia ' Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia

Abstract: Query expansions are being used in many search applications for retrieving relevant documents. Although retrieving the relevant documents are important for search users, the complexity of Arabic morphology remains a challenge. As such, many irrelevant documents were still retrieved from the ranked results. To address this challenge, this paper proposes a new searching method for Arabic text semantic-based query expansion. The proposed method combines Arabic word synonyms and ontology to expand the query with additional terms. Specifically, the proposed method combined lexical words within the ranking algorithm and then improved with ontology links to expand query. The performance of Arabic text semantic-based query expansion was evaluated in terms of average precision, means average precision and means reciprocal rank. Experiments on Quran datasets show that the proposed method using Arabic text semantic-based query expansion approach outperforms the previous methods using other dataset which is called Tafsir dataset. The proposed method achieved 15.44% mean average precision.

Keywords: Arabic text; semantic search; query expansion; lexical words; ontology; ranking algorithms.

DOI: 10.1504/IJDMMM.2022.122037

International Journal of Data Mining, Modelling and Management, 2022 Vol.14 No.1, pp.30 - 40

Received: 30 Nov 2019
Accepted: 26 Jun 2020

Published online: 08 Apr 2022 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article