Title: Grammar rule-based sentiment categorisation model for classification of Tamil tweets

Authors: Nadana Ravishankar; R. Shriram

Addresses: Department of Computer Science and Engineering, B.S. Abdur Rahman University, Chennai – 600048, India ' Department of Computer Science and Engineering, B.S. Abdur Rahman University, Chennai – 600048, India

Abstract: The advent of social media has enabled people to easily and publicly express their ideas on a movie/product in such a way that it reaches millions of people within no time. This research aims to implement a tool that would be helpful in predicting the genre of the movies as perceived by the audience through linguistic rules and natural language processing (NLP) tool kit. This paper focuses on development of rule-based sentiment categorising tool for Tamil tweets and a tool has been developed using Python and NLP tool kit. Furthermore, a model is designed to determine the opinion along with genre classification of Tamil movies. For this work, a set of genres are selected from Tamil movies with public tweets based on sentiment analysis. We find that the tool classifies the genre of a particular movie provided by user tweets and validated our approach with domain experts and baseline models.

Keywords: Tamil tweets; sentiment analysis; natural language processing; NLP; grammar rules; data mining.

DOI: 10.1504/IJISTA.2018.091589

International Journal of Intelligent Systems Technologies and Applications, 2018 Vol.17 No.1/2, pp.89 - 97

Received: 18 Feb 2017
Accepted: 31 Mar 2017

Published online: 03 May 2018 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article