Title: CASD on enhancing sentiment analysis using context-aware sarcasm detection on social media
Authors: G. Paul Davidson; D. Ravindran; R. Anne Pratheeba
Addresses: Department of Computer Science, St. Joseph's College (Autonomous), Trichy, Tamil Nadu, India; Affiliated to: Bharathidasan University, India ' Department of Computer Science, St. Joseph's College (Autonomous), Trichy, Tamil Nadu, India; Affiliated to: Bharathidasan University, India ' Department of Computer Science and Engineering, CARE College of Engineering, Trichy, Tamil Nadu, India; Affiliated to: Anna University, Chennai, India
Abstract: Effective sentiment analysis is crucial for understanding human language, especially when dealing with sarcasm. This study enhances sentiment analysis by integrating BERT contextual embeddings with an ensemble learning classifier to recognise sarcasm. A custom-labelled dataset rich in sarcastic elements was created to train and refine the model, utilising a random forest classifier for its robustness with complex datasets. Comparative analysis against standard models, including BOW-LR, VADER, SVM-TF-IDF, and LSTM, showed that the CASD model significantly improved performance metrics, achieving an accuracy of 0.85 and consistently outperforming baseline models across all sentiment classes. Notably, CASD achieved a precision score of 0.86 for neutral sentiment detection, illustrating its sensitivity to linguistic subtleties. This research introduces a novel framework that effectively accounts for sarcasm, leveraging BERT's contextual understanding and random forest's ensemble classification to advance sentiment analysis accuracy. This improvement is vital for applications requiring fine-grained sentiment detection, showcasing the potential for sophisticated natural language processing technologies to reflect the complexities of human communication more accurately.
Keywords: support vector machine with term frequency-inverse document frequency; SVM-TF-IDF; valence aware dictionary and sentiment reasoner; VADER; context-aware sarcasm detection; CASD; bidirectional encoder representations from transformers; BERT; natural language processing; NLP; long short-term memory; LSTM; bag-of-words with logistic regression; BOW-LR.
DOI: 10.1504/IJIEI.2025.148583
International Journal of Intelligent Engineering Informatics, 2025 Vol.13 No.3, pp.267 - 296
Received: 01 Feb 2024
Accepted: 20 Jul 2024
Published online: 14 Sep 2025 *