Title: An implementation method for Arabic keyword tendency using decision tree
Authors: Hasan Hashim; El-Sayed Atlam; Ahmad Reda Alzighaibi; Malik Almaliki
Addresses: College of Computer Science and Engineering, Taibah University (KSA), Yanbu, Madina, Saudi Arabia ' College of Computer Science and Engineering, Taibah University (KSA), Yanbu, Madina, Saudi Arabia; Faculty of Engineering, Tokushima University, Tokushima, Japan ' College of Computer Science and Engineering, Taibah University (KSA), Yanbu, Madina, Saudi Arabia ' College of Computer Science and Engineering, Taibah University (KSA), Yanbu, Madina, Saudi Arabia
Abstract: Generally, keyword recurrences change over certain periods of time. Traditional approaches estimated classes (increasing, relatively constant and decreasing) that indicate keywords' attribute changes in a document over certain periods of time using a decision tree. Furthermore, all earlier approaches are based on keywords in English and French languages. Therefore, the extension of keywords to other languages such as Arabic could strengthen further researches in this domain. This paper introduces new method to extract Arabic keywords from corpora based on their recurrences changes in a document over given periods of time using a decision tree. The new approach is applied on new data set field (computer science) which makes it different to traditionally used methods. The comparison between the manually classified results and the evaluation of the decision tree results reveals that F-measures of decreasing, relatively constant and increasing classes were 0.811, 0.917, and 0.897 which indicates effectiveness of our method.
Keywords: Arabic Wikipedia dumps; Alhayah newspaper; time-changing; keywords attribute; decision tree.
DOI: 10.1504/IJCAT.2020.107911
International Journal of Computer Applications in Technology, 2020 Vol.63 No.1/2, pp.55 - 63
Received: 21 Aug 2019
Accepted: 25 Dec 2019
Published online: 30 Jun 2020 *