Authors: Carlyna Bondiombouy; Patrick Valduriez
Addresses: Inria and LIRMM, University of Montpellier, Montpellier, France ' Inria and LIRMM, University of Montpellier, Montpellier, France
Abstract: Building cloud data-intensive applications often requires using multiple data stores (NoSQL, HDFS, RDBMS, etc.), each optimised for one kind of data and tasks. However, the wide diversification of data store interfaces makes it difficult to access and integrate data from multiple data stores. This important problem has motivated the design of a new generation of systems, called multistore systems, which provide integrated or transparent access to a number of cloud data stores through one or more query languages. In this paper, we give an overview of query processing in multistore systems. We start by introducing the recent cloud data management solutions and query processing in multidatabase systems. Then, we describe and analyse some representative multistore systems, based on their architecture, data model, query languages and query processing techniques. To ease comparison, we divide multistore systems based on the level of coupling with the underlying data stores, i.e., loosely-coupled, tightly-coupled and hybrid. Our analysis reveals some important trends, which we discuss. We also identify some major research issues.
Keywords: cloud data stores; multistore systems; multidatabase systems; query processing; cloud computing; cloud data management.
International Journal of Cloud Computing, 2016 Vol.5 No.4, pp.309 - 346
Received: 22 Mar 2016
Accepted: 31 Mar 2016
Published online: 09 Dec 2016 *