Title: Data replication strategies with performance objective in data grid systems: a survey

Authors: Riad Mokadem; Abdelkader Hameurlain

Addresses: Institut de Recherche en Informatique de Toulouse (IRIT), Paul Sabatier University, 118 Route de Narbonne 31062, Toulouse, France ' Institut de Recherche en Informatique de Toulouse (IRIT), Paul Sabatier University, 118 Route de Narbonne 31062, Toulouse, France

Abstract: Replicating for performance constitutes an important issue in large-scale data management systems. In this context, a significant number of replication strategies have been proposed for data grid systems. Some works classified these strategies into static vs. dynamic or centralised vs. decentralised or client vs. server initiated strategies. Very few works deal with a replication strategy classification based on the role of these strategies when building a replica management system. In this paper, we propose a new replication strategy classification based on objective functions of these strategies. Also, each replication strategy is designed according to the data grid topology for which it was proposed. We point out the impact of the topology on replication performance although most of these strategies have been proposed for a hierarchical grid topology. We also study the impact of some factors on performance of these strategies, e.g. access pattern, bandwidth consumption and storage capacity.

Keywords: data grid systems; data replication; replication strategies; classification; objective functions; performance; large-scale data management; access patterns; bandwidth consumption; storage capacity.

DOI: 10.1504/IJGUC.2015.066395

International Journal of Grid and Utility Computing, 2015 Vol.6 No.1, pp.30 - 46

Received: 15 Aug 2013
Accepted: 30 Jan 2014

Published online: 18 Dec 2014 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article