Title: Efficient data storage: adaptively changing chunk size in cloud computing storage systems

Authors: Chalabi Baya; Slimani Yahya

Addresses: Laboratoire de la Communication dans les Systèmes Informatiques, Ecole nationale Supérieure d'Informatique (ESI), BP, 68M Oued-Smar, 16270 Alger, Algeria ' ISAMM, Universitié de La Manouba, Manouba, Tunisia

Abstract: Cloud computing enables users to utilise a shared pool of resources on demand. BLOB storage is a type of cloud storage for unstructured data. Most data management systems use chunk sizes equal to a given BLOB. Despite the simplicity of this strategy, as the nodes in the system are heterogeneous, the BLOB sizes are different and access to the data is not consistent due to these issues. We propose an adaptive strategy in order to define an adequate chunk size by taking into account metrics concerning the actual resources (bandwidth, storage usage), the size of the BLOB/file and the access rate of the chunks. Our experimental results show that our proposed approach provides an execution time that is about 24% better than that of a method based only on one chunk size and about 96% better than that of a method based on a random choice of the chunk size.

Keywords: cloud computing; data striping; data management system; data storage; BlobSeer; chunk size; Hadoop; big data; scalability; concurrent access.

DOI: 10.1504/IJGUC.2023.133455

International Journal of Grid and Utility Computing, 2023 Vol.14 No.5, pp.516 - 525

Received: 01 Dec 2022
Accepted: 24 Jan 2023

Published online: 15 Sep 2023 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article