Title: An efficient replication scheme based on living-replicas estimation for distributed storage platforms

Authors: Ying Hu

Addresses: College of Computer and Communication, Hunan Institute of Engineering, Xiangtan, 411104, China

Abstract: In the past few years, massive storage platforms are playing critical roles in many practical distributed systems (i.e., grid or cloud). To ensure desirable data availability, replication mechanism has been widely applied in those massive storage platforms. Meanwhile, a high level of data redundancy inevitably degrades the performance of storage management and bandwidth utilisation. In this paper, we propose an intelligent replication scheme which leverages failure pattern of storage nodes to estimate the actual number of living-replicas in a storage platform. In this way, the storage platform can make more accurate decisions on how many replicas should be generated for maintaining a given data availability. Extensive experiments based on real-world traces indicate that the proposed replication scheme can significantly reduce the overall data redundancy and maintain desirable data availability at the same time.

Keywords: data replication; cloud computing; distributed storage; probability model.

DOI: 10.1504/IJNVO.2019.100183

International Journal of Networking and Virtual Organisations, 2019 Vol.20 No.3, pp.301 - 317

Received: 27 Feb 2017
Accepted: 17 May 2017

Published online: 17 Jun 2019 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article