Title: Comparing different classifiers and feature selection techniques for emotion classification
Authors: Satish Mahadevan Srinivasan; Prashanth Ramesh
Addresses: Penn State Great Valley, School of Graduate Professional Studies, 30 East Swedesford Road, Malvern, PA 19355-1443, USA ' Penn State Great Valley, School of Graduate Professional Studies, 30 East Swedesford Road, Malvern, PA 19355-1443, USA
Abstract: In this study, we have explored the potentiality of six different supervised machine learning approaches and feature selection filters to recognize four basic emotions (anger, happy, sadness and surprise) using three different heterogeneous emotion-annotated dataset which combines sentences from news headlines, fairy tales and blogs. For classification purpose, we have chosen the feature set to include the bag-of-words. Our study reveals the fact that the use of the resampling filter and the feature selection filters together contribute towards boosting the prediction accuracies of the classifiers. However, the boosting capabilities are more profound in the resampling filter in comparison to the use of the different feature selection filters.
Keywords: text mining; feature selection techniques; supervised classifiers; emotion classification; machine learning; WEKA; synthetic dataset; resampling; attribute selection; society systems science.
DOI: 10.1504/IJSSS.2018.095595
International Journal of Society Systems Science, 2018 Vol.10 No.4, pp.259 - 284
Received: 21 Mar 2018
Accepted: 25 Apr 2018
Published online: 11 Oct 2018 *