Title: Predicting movie genre from plot summaries using Bi-LSTM network

Authors: Prakhar Srivastava; Pankaj Srivastava

Addresses: Department of Information Technology, ABV – Indian Institute of Information Technology and Management, Gwalior, Madhya Pradesh, India ' Department of Applied Physics, ABV – Indian Institute of Information Technology and Management, Gwalior, Madhya Pradesh, India

Abstract: Movie plot summaries are highly indicative of the genre to which they belong. Depending upon the words present in the plot summaries, we can easily decide which emotion is being portrayed in the movie. In this paper, we predict the movie genres by feeding the plot summaries to our proposed model. For making word representations that can be understood by our model, we use Facebook's fasttext library. Our model uses a Bi-LSTM network and a ranking system depending upon posterior probability scores to determine the movie genre. We split the plot summary into sentences and predict the genre associated with each sentence using our model. We then fuse the decision from all the sentences to make a collective decision for a particular plot summary. We use the majority voting algorithm for making this decision. We try document-level and sentence-level approaches for predicting the movie genres. Post comparison of results, we found sentence-level approach using Bi-LSTM network performs better than the document-level approach using the same network. For the baseline models, we used recurrent neural networks (RNN) and logistic regression (LR) and compared the results with our proposed model.

Keywords: NLP; movie genre prediction; Bi-LSTM; word embeddings; recurrent neural networks; fasttext; majority voting algorithm.

DOI: 10.1504/IJSI.2021.118610

International Journal of Swarm Intelligence, 2021 Vol.6 No.2, pp.168 - 177

Received: 29 Jun 2020
Accepted: 27 Nov 2020

Published online: 29 Oct 2021 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article