Title: Implementation of multi node Hadoop virtual cluster on open stack cloud environments

Authors: S. Karthikeyan; R. Manimegalai

Addresses: Anna University, Chennai, Tamil Nadu, India ' Department of IT, PSG College of Technology, Coimbatore, Tamil Nadu, India

Abstract: Nowadays computing plays a vital role in information technology and all other fields. Yes, the cloud computing is one of the biggest milestone in most leading next generation technology and booming up in IT filed and business sectors. In our day to day life the data is being generated is enormous amount such as tera (TB), peta (PB), zeta (ZB) bytes. Hadoop Map Reduce is the popular distributed computing paradigm to process data intensive jobs in cloud. Completion time goals for deadline of map reduce jobs set by users are becoming crucial in existing cloud based data processing environments like Hadoop. In this paper proposes a real-time implementation of multi node Hadoop virtual cluster on open stack cloud environments and also it processes the huge data sets in parallel different virtual machines (VMs) and it compares average execution time for different node virtual clusters and various size inputs.

Keywords: cloud; data intensive; Hadoop; Map Reduce; open stack-cluster; virtualisation.

DOI: 10.1504/IJBIDM.2020.108768

International Journal of Business Intelligence and Data Mining, 2020 Vol.17 No.2, pp.193 - 205

Received: 17 Aug 2017
Accepted: 21 Jan 2018

Published online: 06 Apr 2020 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article