Title: PPHE-automatic detection of sensitive attributes in a privacy preserved Hadoop environment using data mining techniques
Authors: Kumaran Umapathy; Neelu Khare
Addresses: School of Information Technology and Engineering, VIT, Vellore, India ' School of Information Technology and Engineering, VIT, Vellore, India
Abstract: Online social networks (OSN) has become highly popular, where users are more and more lured to reveal their private information. To balance privacy and utility, many privacy preserving approaches have been proposed which does not meet well users personalized requirements. In this paper, we present a privacy preserved Hadoop environment (PPHE) which automatically detects sensitive attributes using data mining techniques. This work considers Twitter which contains private information such as email addresses, mobile numbers, physical addresses, and date of births. First, we authenticate each Twitter user using the integrated algorithm RSA and Elgamal algorithm. Second, we categorize the tweets into private and non-private attributes based on the type-2 fuzzy logic system. Third, we apply a data suppression technique for private tweets and finally share the user's content based on their similarity information. Content similarity has been evaluated using cosine similarity. Finally we evaluate the system performance in terms of accuracy, precision, recall, and F-measure.
Keywords: privacy preserving data mining; online social networks; OSN; Twitter; data mining techniques.
DOI: 10.1504/IJCAET.2021.114488
International Journal of Computer Aided Engineering and Technology, 2021 Vol.14 No.3, pp.296 - 319
Received: 16 May 2018
Accepted: 09 Jul 2018
Published online: 26 Apr 2021 *