Title: Cluster-based convolution process on big data in privacy preserving data mining

Authors: R. Lalitha; K. Rameshkumar

Addresses: Research and Development Centre, Bharathiar University, Coimbatore-641046, India ' Research and Development Centre, Bharathiar University, Coimbatore-641046, India

Abstract: The main intension of this paper is to privacy preserving-aware over big data in clouds using KNN and MapReduce framework. This paper consists of three phases such as, MapReduce phase, clustering the map reduced data and evaluation phase. In MapReduce phase, we are splitting the input data after the splitting process we are including a k-means clustering algorithm to cluster the map reduced data. Then, we are performing a convolution process to the dataset and create a new matrix. Once it is over, the privacy-persevering framework over big data in cloud systems is performed based on the evaluation base. In evaluation module, deduplication is performed with the aid of the KNN algorithm. In this phase using the KNN technique to check the duplication of data based on the threshold. Thus the non-duplicated data's are stored in cloud database, which is improving the utility of the privacy data.

Keywords: k-means clustering; MapReduce; privacy preserving; K-nearest neighbour; KNN.

DOI: 10.1504/IJBIS.2020.10023788

International Journal of Business Information Systems, 2021 Vol.38 No.1, pp.17 - 33

Received: 25 Apr 2018
Accepted: 12 Mar 2019

Published online: 30 Oct 2021 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article