Title: Don't hurry be green: scheduling servers shutdown in grid computing with deep reinforcement learning

Authors: Lucas Camelo Casagrande; Guilherme Piêgas Koslovski; Charles Christian Miers; Maurício Aronne Pillon; Nelson Mimura Gonzalez

Addresses: Graduate Program in Applied Computing, Santa Catarina State University, Florianópolis, Joinville, Santa Catarina, Brazil ' Graduate Program in Applied Computing, Santa Catarina State University, Florianópolis, Joinville, Santa Catarina, Brazil ' Graduate Program in Applied Computing, Santa Catarina State University, Florianópolis, Joinville, Santa Catarina, Brazil ' Graduate Program in Applied Computing, Santa Catarina State University, Florianópolis, Joinville, Santa Catarina, Brazil ' IBM Research, Thomas J. Watson Research Centre, Armonk, New York, USA

Abstract: Grid computing platforms dissipate massive amounts of energy. Energy efficiency, therefore, is an essential requirement that directly affects its sustainability. Resource management systems deploy rule-based approaches to mitigate this cost. However, these strategies do not consider the patterns of the workloads being executed. In this context, we demonstrate how a solution based on Deep Reinforcement Learning is used to formulate an adaptive power-efficient policy. Specifically, we implement an off-reservation approach to overcome the disadvantages of an aggressive shutdown policy and minimise the frequency of shutdown events. Through simulation, we train the algorithm and evaluate it against commonly used shutdown policies using real traces from GRID'5000. Based on the experiments, we observed a reduction of 46% on the averaged energy waste with an equivalent frequency of shutdown events compared to a soft shutdown policy.

Keywords: deep reinforcement learning; grid computing; energy-aware scheduling; shutdown strategy; Markov decision process; resource management.

DOI: 10.1504/IJGUC.2022.128303

International Journal of Grid and Utility Computing, 2022 Vol.13 No.6, pp.589 - 606

Received: 04 Sep 2020
Accepted: 06 Oct 2020

Published online: 17 Jan 2023 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article