Authors: Nidhi Chandra; Sunil Kumar Khatri; Subhranil Som
Addresses: Amity School of Engineering and Technology, Amity University, Noida, Uttar Pradesh, India ' Amity School of Engineering and Technology, Amity University, Noida, Uttar Pradesh, India ' Amity School of Engineering and Technology, Amity University, Noida, Uttar Pradesh, India
Abstract: Social Networking and micro-blogging applications provide active platforms for communications, sharing thoughts and ideas. Processing natural text coming from varied social platforms possess many technical challenges such as processing messages written in slang, informal short messages, classifying messages into different labels and category based on the meaning. Maximum natural text processing and interpretation systems use n-gram language models, which can be simple and powerful most of the time. Random forest ensemble-based classifier has the potential to generalise the unseen data as compared to n-gram language models. Anti-social messages are a significant problem in social media. In this paper we present an approach to classify the natural language text as anti-social text using Random Forest classifier. In this paper we are addressing the challenge to identify anti-social messages using this algorithm using vector ensemble technique to classify anti-social text in offline mode. Word to vector approach has been used for word embeddings to train the model. This paper combines word to vector approach with random forest classifier using a multilayer network.
Keywords: natural language processing; random forest; ensemble classifier; anti-social behaviour analysis; word to vector.
International Journal of Applied Management Science, 2022 Vol.14 No.1, pp.38 - 56
Received: 17 Apr 2019
Accepted: 28 Sep 2019
Published online: 23 Feb 2022 *