Title: Analysis of drug reviews using data analytics and sentiment analysis
Authors: Bijayalaxmi Panda; Chhabi Rani Panigrahi; Bibudhendu Pati
Addresses: Department of Computer Science and Technology, Gita Autonomous College, Bhubaneswar, India ' Department of Computer Science, Rama Devi Women's University, Bhubaneswar, India ' Department of Computer Science, Rama Devi Women's University, Bhubaneswar, India
Abstract: This study endeavours to manage and analyse the volume of data in our daily lives by employing a drug review dataset from the UCI machine learning library. Focusing on six key parameters - drug-id, name, condition, review, rating, and usefulness count - we meticulously curated a subset of dataset, targeting the top ten conditions. Our approach integrates sentiment analysis (SA) and exploratory data analysis (EDA) to glean insights. The EDA provides an overview, including details on the top ten drugs for each condition, their respective value counts, mean ratings, mean polarity, the number of reviews, and quantity of medications. Our experimentation reveals phentermine as the most popular and efficacious drug for obesity, based on mean rating and polarity, while ethinyl estradiol or norethindrone emerges as the least favoured drug by rating, and Skyla by mean polarity. This work aims to contribute valuable insights into drug efficacy and popularity, facilitating informed decision-making in healthcare.
Keywords: drug review; exploratory data analysis; EDA; sentiment analysis; polarity; rating; machine learning.
DOI: 10.1504/IJCSE.2025.149767
International Journal of Computational Science and Engineering, 2025 Vol.28 No.6, pp.619 - 627
Received: 26 Oct 2023
Accepted: 16 Feb 2024
Published online: 12 Nov 2025 *