Title: Web data mining algorithm based on cloud computing environment

Authors: Yunpeng Liu; Xiaolong Gu; Jie Zhang

Addresses: College of Information Engineering, Jiaozuo University, Jiaozuo 454003, Henan, China ' National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University, Nan'an, Chongqing 400067, China; College of Computer Science and Technology, Chongqing University of posts and Telecommunications, Nan'an, Chongqing 400065, China ' National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University, Nan'an, Chongqing 400065, China

Abstract: The purpose of this article is to study web data mining algorithms in the cloud. In order to quickly extract valuable rules and patterns from massive and noisy data, and make them easy to understand and directly apply, we use data mining technology. On the other hand, based on the low cost of cloud computing, large throughput, good fault tolerance, and strong stability, the web chose the cloud computing method. This paper studies and analyses the K-Means clustering algorithm, and uses the web data mining algorithm based on the cloud computing environment to improve the K-Means algorithm, overcomes the shortcomings of the K-Means algorithm itself, and builds a good cloud computing environment. The research results show that the improved and optimised algorithm in this paper solves the problem of insufficient speed and efficiency in the clustering process.

Keywords: cloud computing; data mining; clustering algorithm; k-means algorithm.

DOI: 10.1504/IJGUC.2021.119552

International Journal of Grid and Utility Computing, 2021 Vol.12 No.4, pp.359 - 368

Received: 30 Jun 2020
Accepted: 15 Aug 2020

Published online: 09 Dec 2021 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article