Title: Performance evaluation of a dynamic load-balancing library for cluster computing

Authors: Ioana Banicescu, Ricolindo L. Carino, Jaderick P. Pabico, Mahadevan Balasubramaniam

Addresses: Department of Computer Science and Engineering, and Center for Computational Sciences ERC, Mississippi State University, Mississippi State, MS 39762, USA. ' Center for Computational Sciences ERC, Mississippi State University, Mississippi State, MS 39762, USA. ' Institute of Computer Science, University of the Philippines Los Banos, 4031 College, Laguna, Philippines. ' Department of Computer Science, University of New Hampshire, Durham, NH 03824-3597, USA

Abstract: The performance of scientific applications in heterogeneous environments has been improved with the research advances in dynamic scheduling at application and runtime system levels. This paper presents the performance evaluation of a library as a result of an integrated approach to dynamic load balancing. This approach combines the advantages of optimising data migration via novel dynamic loop-scheduling strategies with the advances in resource management and task migration capabilities offered by a recently developed parallel runtime system. The performance of the library has been investigated by its use in three scientific applications: the N-body simulations, the profiling of automatic quadrature routines and the solution to the 3D heat equation. The investigations focus on the performance degradation owing to the overhead induced by the runtime system software layer. The experimental results obtained indicate only a slight increase in the cost of load balancing owing to this overhead. The results validate the suitability of the runtime system as an implementation platform for dynamic load-balancing schemes and underscore the significance of using the integrated approach, as well as the benefits of using the library especially in cluster applications characterised by irregular and unpredictable behaviour.

Keywords: dynamic load balancing library; performance evaluation; overhead analysis; parallel computing; cluster computing; dynamic scheduling; data migration; loop scheduling; resource management; task migration; parallel runtime system.

DOI: 10.1504/IJCSE.2005.009697

International Journal of Computational Science and Engineering, 2005 Vol.1 No.2/3/4, pp.118 - 133

Published online: 05 May 2006 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article