Title: A hybrid optimal weighting scheme and machine learning for rendering sentiments in tweets
Authors: Walid Cherif; Abdellah Madani; Mohamed Kissi
Laboratory LIMA, Department of Computer Science, University Chouaib Doukkali, Faculty of Sciences, B.P. 20, 24000, El Jadida, Morocco
Laboratory LAROSERI, Department of Computer Science, University Chouaib Doukkali, Faculty of Sciences, B.P. 20, 24000, El Jadida, Morocco
Laboratory LIM, Department of Computer Science, University Hassan II Casablanca, Faculty of Sciences and Technology, B.P. 146, 20650, Mohammedia, Morocco
Abstract: Over recent years, the world has experienced an explosive growth in the volume of shared web texts. Everyday, a huge volume of opinions expressed in various forms such as articles, reviews and tweets is generated. In general, opinion mining refers to the task of extracting opinions, and sentiment analysis is the technique that extracts subjectivity and polarity; in other words, it determines whether a text is positive or negative (Taboada et al., 2011). Arabic sentiment analysis is conducted in this study using a publically available data set written in both modern standard Arabic and the Jordanian dialect. A new mathematical approach is introduced to determine the polarity of the tweet by using four functions whose parameters are the solutions of a linear program. These functions are then classified using support vector machines and K-nearest neighbours. The results show that the proposed approach is considerably reliable in Arabic sentiment analysis.
Keywords: automatic language processing; low level light stemming; sentiment analysis; support vector machines; SVM; k-nearest neighbour; KNN; hybrid weighting; optimal weighting; machine learning; sentiments; tweets; Twitter; Arabic.
Int. J. of Intelligent Engineering Informatics, 2016 Vol.4, No.3/4, pp.322 - 339
Submission date: 25 Sep 2015
Date of acceptance: 13 May 2016
Available online: 16 Nov 2016