Title: Combination of domain knowledge and deep learning for sentiment analysis of short and informal messages on social media

Authors: Khuong Vo; Tri Nguyen; Dang Pham; Mao Nguyen; Minh Truong; Trung Mai; Tho T. Quan

Addresses: YouNet Group, Ho Chi Minh City, Vietnam ' YouNet Group, Ho Chi Minh City, Vietnam ' YouNet Group, Ho Chi Minh City, Vietnam ' YouNet Group, Ho Chi Minh City, Vietnam ' YouNet Group, Ho Chi Minh City, Vietnam ' Ho Chi Minh City University of Technology, Vietnam National University – HCMC, Ho Chi Minh City, Vietnam ' Ho Chi Minh City University of Technology, Vietnam National University – HCMC, Ho Chi Minh City, Vietnam

Abstract: Sentiment analysis has been emerging recently as one of major natural language processing (NLP) tasks, with the increasing significance of social media channels for brands to observe user opinions about their products. In the previous work, we proposed to combine the typical deep learning techniques with domain knowledge. The combination is used for acquiring additional training data augmentation and a more reasonable loss function. However, there is a high volume of short and informal messages posted by users which makes the existing works suffer from many difficulties. In this work, we further improve our architecture, aiming to handle those problems, by various substantial enhancements, including negation-based data augmentation, transfer learning for word embeddings, combination of word-level embeddings and character-level embeddings, and using multitask learning technique for attaching domain knowledge rules in the learning process. Those enhancements help us to enjoy significant improvement in performance once experimenting on real datasets.

Keywords: sentiment analysis; deep learning; domain knowledge; recurrent neural networks; transfer learning; multi-task learning; data augmentation; informal messages.

DOI: 10.1504/IJCVR.2019.102286

International Journal of Computational Vision and Robotics, 2019 Vol.9 No.5, pp.458 - 485

Available online: 11 Sep 2019 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article