Title: Simplifying MapReduce data processing

Authors: Chih-Shan Liao; Jin-Ming Shih; Ruay-Shiung Chang

Addresses: Department of Computer Science and Information Engineering, National Dong Hwa University, 1, Sec. 2, Da Hsueh Rd., Shou-Feng, Hualien, 974, Taiwan ' Department of Computer Science and Information Engineering, National Dong Hwa University, 1, Sec. 2, Da Hsueh Rd., Shou-Feng, Hualien, 974, Taiwan ' Department of Computer Science and Information Engineering, National Dong Hwa University, 1, Sec. 2, Da Hsueh Rd., Shou-Feng, Hualien, 974, Taiwan

Abstract: MapReduce is a programming model developed by Google for processing and generating large datasets in distributed environments. Many real world tasks can be implemented by two functions, map and reduce. MapReduce plays a key role in cloud computing, since it decreases the complexity of the distributed programming and is easy to be developed on large clusters of common machines. Hadoop, an open-source project, is used to implement Google MapReduce architecture. It is widely used by many applications such as FaceBook, Yahoo, Twitter, and so on. However, it is difficult to decouple an application into functions of map and reduce for common users. In this paper, we focus on convenient use of MapReduce and propose using components to compose a MapReduce solution. We develop a web-based graphic user interface for ordinary users to utilise MapReduce without the real programming. Users only have to know how to specify their tasks in target-value-action tuples. Real examples are provided for demonstration.

Keywords: MapReduce data processing; Hadoop; cloud computing; web-based GUIs; graphical user interface.

DOI: 10.1504/IJCSE.2013.055353

International Journal of Computational Science and Engineering, 2013 Vol.8 No.3, pp.219 - 226

Received: 02 Mar 2012
Accepted: 29 Apr 2012

Published online: 27 Dec 2013 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article