When Green Computing Meets Performance and Resilience SLOs.

Qiu, H; Mao, W; Wang, C; Jha, S; Franke, H; Narayanaswami, C; Kalbarczyk, ZT; Basar, T; Iyer, R_K

doi:10.1109/DSN-S60304.2024

Citation Details

When Green Computing Meets Performance and Resilience SLOs.

This paper addresses the urgent need to transition to global net-zero carbon emissions by 2050 while retaining the ability to meet joint performance and resilience objectives. The focus is on the computing infrastructures, such as hyperscale cloud datacenters, that consume significant power, thus producing increasing amounts of carbon emissions. Our goal is to (1) optimize the usage of green energy sources (e.g., solar energy), which is desirable but expensive and relatively unstable, and (2) continuously reduce the use of fossil fuels, which have a lower cost but a significant negative societal impact. Meanwhile, cloud datacenters strive to meet their customers’ requirements, e.g., service-level objectives (SLOs) in application latency or throughput, which are impacted by infrastructure resilience and availability. We propose a scalable formulation that combines sustainability, cloud resilience, and performance as a joint optimization problem with multiple interdependent objectives to address these issues holistically. Given the complexity and dynamicity of the problem, machine learning (ML) approaches, such as reinforcement learning, are essential for achieving continuous optimization. Our study highlights the challenges of green energy instability which necessitates innovative MLcentric solutions across heterogeneous infrastructures to manage the transition towards green computing. Underlying the MLcentric solutions must be methods to combine classic system resilience techniques with innovations in real-time ML resilience (not addressed heretofore). We believe that this approach will not only set a new direction in the resilient, SLO-driven adoption of green energy but also enable us to manage future sustainable systems in ways that were not possible before. more »

Award ID(s):: 2029049

PAR ID:: 10546463

Author(s) / Creator(s):: Qiu, H; Mao, W; Wang, C; Jha, S; Franke, H; Narayanaswami, C; Kalbarczyk, ZT; Basar, T; Iyer, R_K

Corporate Creator(s):: IEEE_IFIP

Editor(s):: nd

Publisher / Repository:: Institute of Electrical and Electronics Engineers

Date Published:: 2024-01-01

Edition / Version:: 1

Volume:: 1

Issue:: 1

ISBN:: 979-8-3503-9570-9

Page Range / eLocation ID:: 17-22

Subject(s) / Keyword(s):: sustainability, green energy, cloud computing, resilience, machine learning, machine learning resilience

Format(s):: Medium: X Size: 445 kb Other: pdf

Size(s):: 445 kb

Location:: Brisbane, Australia

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
https://doi.org/10.1109/DSN-S60304.2024

More Like this