Authors: Riad Mokadem; Abdelkader Hameurlain
Addresses: Institut de Recherche en Informatique de Toulouse (IRIT), Paul Sabatier University, 118 Route de Narbonne 31062, Toulouse, France ' Institut de Recherche en Informatique de Toulouse (IRIT), Paul Sabatier University, 118 Route de Narbonne 31062, Toulouse, France
Abstract: Replicating for performance constitutes an important issue in large-scale data management systems. In this context, a significant number of replication strategies have been proposed for data grid systems. Some works classified these strategies into static vs. dynamic or centralised vs. decentralised or client vs. server initiated strategies. Very few works deal with a replication strategy classification based on the role of these strategies when building a replica management system. In this paper, we propose a new replication strategy classification based on objective functions of these strategies. Also, each replication strategy is designed according to the data grid topology for which it was proposed. We point out the impact of the topology on replication performance although most of these strategies have been proposed for a hierarchical grid topology. We also study the impact of some factors on performance of these strategies, e.g. access pattern, bandwidth consumption and storage capacity.
Keywords: data grid systems; data replication; replication strategies; classification; objective functions; performance; large-scale data management; access patterns; bandwidth consumption; storage capacity.
International Journal of Grid and Utility Computing, 2015 Vol.6 No.1, pp.30 - 46
Received: 15 Aug 2013
Accepted: 30 Jan 2014
Published online: 18 Dec 2014 *