Title: Evaluation of cigarette market state based on multi-source data modelling

Authors: Taicheng Wei; Hao Chen; Yuting Ou; Chen Zhang; Haiying Li; Yue Huang; Yanbing Liu

Addresses: China Tobacco Guangxi Industrial Co., Ltd., Nanning, Guangxi, 530001, China ' China Tobacco Guangxi Industrial Co., Ltd., Nanning, Guangxi, 530001, China ' Jiangsu Lianyungang Tobacco Co., Ltd., Lianyungang, Jiangsu, 222000, China ' China Sciences Group Known (Beijing) Technology Co., Ltd., Bejing, 100190, China ' China Sciences Group Known (Beijing) Technology Co., Ltd., Bejing, 100190, China ' China Sciences Group Known (Beijing) Technology Co., Ltd., Bejing, 100190, China ' China Tobacco Guangxi Industrial Co., Ltd., Nanning, Guangxi, 530001, China

Abstract: Traditional cigarette market forecasting model usually has a low accuracy since it did not take the external data into account. Thus, a random forest was firstly used to extract features of data and rank the importance of influencing factors. Then, different external factors were eliminated, the percentage of reduced model interpretation was demonstrated, and expert feedback was introduced to input evaluation values. After optimising the training RF-LSTM model, the prediction of the whole market sales status were finally constructed, and the historical week cigarette market status evaluation model was also established. The proposed machine learning model had a high prediction accuracy and generalisation based on the local market data in province Guangxi of China. Overall results demonstrated that it can accurately and conveniently evaluate the market status of cigarettes.

Keywords: multi-source data; cigarette market; evaluation; deep learning; machine learning.

DOI: 10.1504/IJDS.2023.132295

International Journal of Data Science, 2023 Vol.8 No.3, pp.258 - 273

Received: 19 Sep 2022
Accepted: 08 Dec 2022

Published online: 17 Jul 2023 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article