Title: A random forest algorithm based customer demand forecasting model for sports enterprises in the real economy

Authors: Yingge Feng; Xiaowei Xu; Rongna Wang

Addresses: Sports Art and Labor Education Center, Zhejiang Shuren University, Hangzhou, 310015, China ' Sports Art and Labor Education Center, Zhejiang Shuren University, Hangzhou, 310015, China ' School of Humanities and foreign languages, Zhejiang Shuren University, Hangzhou, 310015, China

Abstract: The study proposes two strategies to optimise the random forest algorithm (RF) by fine-tuning the data distribution and introducing customer life values, constructing the improved random forest algorithm (IRF), and building a customer churn prediction model based on the IRF algorithm. A churn segmentation model is constructed based on the k-means algorithm to classify customers according to their characteristics in order to predict their needs and thus develop differentiated strategies to retain them. The experimental results show that the IRF prediction model has an accuracy of 99.84% and an AUC value of 0.932. The above results show that the accuracy of the IRF algorithm can meet the actual demand, which will contribute to the long-term development of sports enterprises. In the future, it is necessary to consider the relationship and interaction between customers to further improve the prediction accuracy of customer demand.

Keywords: random forest algorithm; customer churn; data mining; K-means algorithm; customer lifetime value.

DOI: 10.1504/IJKBD.2023.133337

International Journal of Knowledge-Based Development, 2023 Vol.13 No.2/3/4, pp.363 - 378

Received: 09 Aug 2022
Accepted: 10 Apr 2023

Published online: 12 Sep 2023 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article