An algorithm for online distributed fault-tolerant job scheduling in grid computing
by Jun Zeng
International Journal of Web and Grid Services (IJWGS), Vol. 17, No. 4, 2021

Abstract: In order to solve the problem of various faults in grid computing environment, this paper raises an online distributed fault-tolerant job scheduling algorithm. The algorithm is consisted of two main algorithm modules, which was job schedule algorithm module, and replica management and placement algorithm module, respectively. The former is based on the idea of job replica, which each replica is independently and scheduled at different sites. Those unused resources are used to run the job replica so that at least one of replicas can be successfully completed. The latter makes each remote separate resource manager (SRM) to run a job replica to send jobs at each monitoring interval, which the status of the replica can be told to the original SRM (PSRM). PSRM periodically checks the application status table and queries all remote SRMs to obtain the status of the computing machine and network, and monitors all the running job replicas in the site, so as to achieve the fault tolerance function. The experimental results show that the online distributed fault-tolerant job scheduling algorithm can achieve better job average response time under various failure rates when compared with other grid fault-tolerant scheduling algorithms and non-fault-tolerant scheduling algorithms.

Online publication date: Mon, 25-Oct-2021

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Web and Grid Services (IJWGS):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com