RubberBand: cloud-based hyperparameter tuning

Misra, Ujval; Liaw, Richard; Dunlap, Lisa; Bhardwaj, Romil; Kandasamy, Kirthevasan; Gonzalez, Joseph E.; Stoica, Ion; Tumanov, Alexey

doi:10.1145/3447786.3456245

Citation Details

RubberBand: cloud-based hyperparameter tuning

Hyperparameter tuning is essential to achieving state-of-the-art accuracy in machine learning (ML), but requires substantial compute resources to perform. Existing systems primarily focus on effectively allocating resources for a hyperparameter tuning job under fixed resource constraints. We show that the available parallelism in such jobs changes dynamically over the course of execution and, therefore, presents an opportunity to leverage the elasticity of the cloud. In particular, we address the problem of minimizing the financial cost of executing a hyperparameter tuning job, subject to a time constraint. We present RubberBand---the first framework for cost-efficient, elastic execution of hyperparameter tuning jobs in the cloud. RubberBand utilizes performance instrumentation and cloud pricing to model job completion time and cost prior to runtime, and generate a cost-efficient, elastic resource allocation plan. RubberBand is able to efficiently execute this plan and realize a cost reduction of up to 2x in comparison to static allocation baselines. more »

Award ID(s):: 1730628

NSF-PAR ID:: 10310457

Author(s) / Creator(s):: Misra, Ujval; Liaw, Richard; Dunlap, Lisa; Bhardwaj, Romil; Kandasamy, Kirthevasan; Gonzalez, Joseph E.; Stoica, Ion; Tumanov, Alexey

Date Published:: 2021-04-21

Journal Name:: EuroSys '21: Proceedings of the Sixteenth European Conference on Computer Systems

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3447786.3456245

More Like this