Title: ArabOnto: experimenting a new distributional approach for building Arabic ontological resources

Authors: Ibrahim Bounhas; Bilel Elayeb; Fabrice Evrard; Yahya Slimani

Addresses: Faculty of Sciences of Tunis, Department of Computer Science, University of Tunis, 1060 Tunis, Tunisia ' RIADI-GDL Research Laboratory, The National School of Computer Sciences (ENSI), 2010 Manouba, Tunisia ' The Computer Science Research Institute of Toulouse (IRIT), 02 Rue Camichel, 31071, Toulouse, France ' Faculty of Sciences of Tunis, Department of Computer Science, University of Tunis, 1060, Tunis, Tunisia

Abstract: Ontologies are useful for modelling and retrieving knowledge in complex information systems. Ontology construction environments use statistical and linguistic information to extract knowledge from corpora. Within the great improvement in this field, there is a need to introduce the Arabic language in these environments. We present the ArabOnto architecture modelling the process of Arabic ontology extraction from corpora. ArabOnto focuses on linguistic issues related to Arabic term extraction and linking (i.e. from morphosyntactic parsing to clustering). We experiment our system by testing several alternatives on three domains. Besides, our ontologies are validated in the context of an information retrieval system.

Keywords: Arabic language; ontology development; distributional analysis; terminology organisation; semantic similarity; information retrieval; ontology evaluation; ontological resources; modelling; ontology extraction; linguistics; term extraction; linking.

DOI: 10.1504/IJMSO.2011.046578

International Journal of Metadata, Semantics and Ontologies, 2011 Vol.6 No.2, pp.81 - 95

Received: 03 Jan 2011
Accepted: 09 Jun 2011

Published online: 12 Feb 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article