Title: Incorporating security and integrity into the mining process of hybrid weighted-hashT apriori algorithm using Hadoop

Authors: R. Sumithra; Sujni Paul

Addresses: Centre for Development of Advanced Computing, Chennai, 600113, India ' School of Engineering and Information Technology, Al Dar University College, Al Garhoud, Dubai

Abstract: This paper talks about the best algorithms of association rule mining (ARM), weighted and hash tree apriori algorithms in a distributed cloud platform and its enhancement as a hybrid weighted-hashT apriori algorithm and its implementation in a eucalyptus platform. Then, this research work handles the integrity and security issues of data during the process of mining. The algorithm is experimented in a cloud environment using Eucalyptus platform with VMware workstation and Hadoop distributed file system (HDFS). And also, the work evaluated how distributed implementation goes better than stand-alone implementations of weighted and hash tree apriori algorithms as well as distributed implementation. The work further studies the effectiveness of using eucalyptus Hadoop nodes and the performance changes with respect to the use of the security protocol for ensuring the security of data in the mining process.

Keywords: data mining; weighted apriori; hashT; Hadoop; cloud; data integrity; data security; eucalyptus; apriori; distributed mining.

DOI: 10.1504/IJDS.2018.094506

International Journal of Data Science, 2018 Vol.3 No.3, pp.266 - 287

Received: 01 Sep 2016
Accepted: 29 May 2017

Published online: 04 Sep 2018 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article