Title: Dynamic, capability-driven scheduling of DAG-based real-time jobs in heterogeneous clusters

Authors: Ligang He, Stephen A. Jarvis, Daniel P. Spooner, Graham R. Nudd

Addresses: Department of Computer Science, University of Warwick, Coventry CV4 7AL, UK. ' Department of Computer Science, University of Warwick, Coventry CV4 7AL, UK. ' Department of Computer Science, University of Warwick, Coventry CV4 7AL, UK. ' Department of Computer Science, University of Warwick, Coventry CV4 7AL, UK

Abstract: In this research a scenario is assumed where periodic real-time jobs are being run on a heterogeneous cluster of computers and new aperiodic parallel real-time jobs, modelled by directed acyclic graphs, arrive at the system dynamically. In the scheduling scheme presented in this paper, a global scheduler situated within the cluster schedules new jobs onto the computers by modelling their spare capabilities left by existing periodic jobs. Admission control is introduced so that new jobs are rejected if their deadlines cannot be met under the precondition of still guaranteeing the real-time requirements of existing jobs. Each computer within the cluster houses a local scheduler, which uniformly schedules both periodic job instances and the subtasks in each parallel real-time job using an early deadline first policy. The modelling of the spare capabilities is optimal in the sense that once a new task starts running on a computer, it will utilise all the spare capability left by the periodic real-time jobs and its finish time will be the earliest possible. The performance of the proposed modelling and scheduling is evaluated through extensive simulation; the results show that the system utilisation is significantly enhanced, while the real-time requirements of the existing jobs remain guaranteed.

Keywords: cluster computing; dynamic scheduling; spare capabilities; heterogeneous clusters; periodic real-time jobs; DAG real-time jobs; performance prediction.

DOI: 10.1504/IJHPCN.2004.008901

International Journal of High Performance Computing and Networking, 2004 Vol.2 No.2/3/4, pp.165 - 177

Published online: 02 Feb 2006 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article