Authors: Satish M. Srinivasan; Ruchika Chari; Abhishek Tripathi
Addresses: School of Graduate Professional Studies, Penn State Great Valley, Malvern, PA 19355, USA ' School of Graduate Professional Studies, Penn State Great Valley, Malvern, PA 19355, USA ' School of Business, The College of New Jersey, Ewing, NJ 08628, USA
Abstract: Predictive analytics on Twitter feeds is becoming a popular field for research. A tweet holds wealth of information on how an individual express and communicates their feelings and emotions within their social network. Large-scale mining of tweets will not only help in capturing an individual's emotion but also the emotions of a larger group. In this study, an emotion-based classification scheme has been proposed. By training the naïve Bayes multinomial and the random forest classifiers on different training datasets, emotion classification was performed on the test dataset containing tweets related to the 2016 US presidential election. Upon classifying the tweets in the test dataset to one of the four basic emotion types: anger, happy, sadness and surprise, and by determining the sentiments of the people, we have tried to portray the flux in the emotional landscape of the people towards the presidential candidates in the 2016 US election.
Keywords: emotion classification; Twitter data analysis; US presidential election; supervised classifier; random forest; naïve Bayes multinomial; NBM.
International Journal of Data Mining, Modelling and Management, 2021 Vol.13 No.4, pp.337 - 350
Received: 27 Apr 2020
Accepted: 12 Jun 2020
Published online: 07 Dec 2021 *