Authors: Li Liu
Addresses: Yunnan Power Grid Co., Ltd., Information Center, Kunming, Yunnan, 650000, China
Abstract: The traditional method of information management and storage in power grid enterprises has some problems, such as time-consuming uploading and downloading of files and unclear classification of information. Therefore, this paper proposes a digital information storage method based on random forest for power grid enterprises. After information was collected, Canopy clustering technology was used to clean the data to avoid the interference of repeated data in the classification process of information. Then the random forest algorithm is used to divide the information categories, and then the digital information storage management module is constructed based on the relational database, and the information storage management is completed in a modular way. Experimental results show that the method takes less time to upload and download files, and the highest accuracy of information classification can reach 93.1%, indicating that the method effectively improves the effect of digital information storage.
Keywords: random forest; digital information; data mining; canopy clustering technology.
International Journal of Internet Manufacturing and Services, 2022 Vol.8 No.3, pp.243 - 253
Received: 30 Dec 2021
Accepted: 28 Mar 2022
Published online: 18 Jul 2022 *