Int. J. of Business Intelligence and Data Mining   »   2018 Vol.13, No.1/2/3

 

 

Title: Data mining of unstructured big data in cloud computing

 

Authors: A.K. Reshmy; D. Paulraj

 

Addresses:
Anna University, Chennai, India
RMD Engineering College, Chennai, India

 

Abstract: Hadoop Distributed File System, Talend, MapReduce (MR), YARN and Cloudera model have gotten to be prevalent advancements for expansive scale information association and investigation. In our work, we distinguish the prerequisites of the covered information association and propose an augmentation to the present programming model, called Comprehensive Hadoop Distributed File System along with MapReduce (C-HDFS-MR), to address them. The expanded interface is exhibited as application programming interface and actualised with regards to image processing application space. In our work, we show viability of C-HDFS-MR through contextual investigations of picture handling capacities along with the outcomes. Despite the fact that C-HDFS-MR has minimal overhead in information stockpiling and I/O operations, it enormously upgrades the framework execution and improves the application advancement process. Our proposed framework, C-HDFS-MR, works in the absence of progressions for the current prototypes, and is used by numerous applications to prerequisite of covered information.

 

Keywords: big data; MapReduce; MR; Hadoop; Comprehensive Hadoop Distributed File System along with MapReduce; C-HDFS-MR; medical image processing; analysis; and visualisation; MIPAV.

 

DOI: 10.1504/IJBIDM.2017.10004683

 

Int. J. of Business Intelligence and Data Mining, 2018 Vol.13, No.1/2/3, pp.147 - 162

 

Available online: 03 Nov 2017

 

 

Editors Full text accessAccess for SubscribersPurchase this articleComment on this article