Authors: Guang-hua Yu
Addresses: Network and Information Center, Lingnan Normal University, Zhanjiang 524048, China
Abstract: To address the low efficiency of the traditional cleaning method, this paper presents a build path tree clean method based on split method for identification of the redundant data, through the traditional mobile internet big data cleaning process for identifying the redundant data is analysed, by using median filtering algorithm, the features of redundant data are extracted. Redundant data is classified by support vector machine (SVM), and the redundant data is identified by self-organising feature map. Based on this, the redundant data identification model is built, which can clean the redundant data in mobile internet big data. Comparing with the classical methods, the simulation results show that the proposed method has the advantages of high accuracy, good stability, high recall rate, short time consuming and low energy consumption.
Keywords: mobile internet; big data; redundant data; detection; cleaning methods; optimisation.
International Journal of Internet Protocol Technology, 2018 Vol.11 No.1, pp.29 - 37
Received: 14 Aug 2017
Accepted: 09 Oct 2017
Published online: 25 Apr 2018 *