Modeling and Analyzing Waiting Policies for Cloud-Enabled Schedulers

Ambati, Pradeep; Bashir, Noman; Irwin, David; Shenoy, Prashant

doi:10.1109/TPDS.2021.3086270

Citation Details

Modeling and Analyzing Waiting Policies for Cloud-Enabled Schedulers

While cloud platforms enable users to rent computing resources on demand to execute their jobs, buying fixed resources is still much cheaper than renting if their utilization is high. Thus, optimizing cloud costs requires users to determine how many fixed resources to buy versus rent based on their workload. In this paper, we introduce the concept of a waiting policy for cloud-enabled schedulers, which is the dual of a scheduling policy, and show that the optimal cost depends on it. We define multiple waiting policies and develop simple analytical models to reveal their tradeoff between fixed resource provisioning, cost, and job waiting time. We evaluate the impact of these waiting policies on a year-long production batch workload consisting of 14M jobs run on a 14.3k-core cluster, and show that a compound waiting policy decreases the cost (by 5%) and mean job waiting time (by 7x) compared to a fixed cluster of the current size. more »

Award ID(s):: 1802523 1908536

PAR ID:: 10248764

Author(s) / Creator(s):: Ambati, Pradeep; Bashir, Noman; Irwin, David; Shenoy, Prashant

Date Published:: 2021-06-03

Journal Name:: IEEE Transactions on Parallel and Distributed Systems

ISSN:: 1045-9219

Page Range / eLocation ID:: 1 to 1

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TPDS.2021.3086270

More Like this