Title: Traceability method for multisource heterogeneous data of power grid business based on multidimensional feature clustering

Authors: Zhiguo Zhou; Hao Chen; Mengxiao Ni

Addresses: State Grid Zhejiang Electric Power Company, Quzhou Power Supply Company, Quzhou, Zhejiang, China ' State Grid Zhejiang Electric Power Company, Hangzhou, Beijing, China ' State Grid Zhejiang Electric Power Company, Quzhou Power Supply Company, Quzhou, Zhejiang, China

Abstract: To shorten the data traceability delay and improve the traceability effect, a multisource heterogeneous data traceability method for power grid business based on multi-dimensional feature clustering is proposed. Firstly, through unstructured data transformation and cleaning, we pre-process multi-source heterogeneous data of power grid business. Then, multi-dimensional features of the data are extracted through sliding clustering, and the fusion results of data features are obtained through fuzzy decision-making. Finally, combining the K-value in RFID encoding with the hash value in the index structure, we perform forward tracing, backward tracing and process tracing on the data. According to the experiment, it is known that after applying this method, the maximum value of traceability delay is only 2861 ms, the traceability error rate is basically maintained at around 0.85% and the data integrity coefficient can reach up to 97.96%. It can quickly achieve accurate traceability of power grid data and has good application effects.

Keywords: power grid business; multisource heterogeneous data; data traceability; data conversion; data cleaning; feature extraction; feature fusion.

DOI: 10.1504/IJCAT.2024.143299

International Journal of Computer Applications in Technology, 2024 Vol.74 No.4, pp.267 - 274

Received: 09 Jan 2024
Accepted: 30 Apr 2024

Published online: 12 Dec 2024 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article