Cloud-scale VM-deflation for Running Interactive Applications On Transient Servers

Fuerst, Alexander; Ali-Eldin, Ahmed; Shenoy, Prashant; Sharma, Prateek

doi:10.1145/3369583.3392675

Citation Details

Cloud-scale VM-deflation for Running Interactive Applications On Transient Servers

Transient computing has become popular in public cloud environments for running delay-insensitive batch and data processing applications at low cost. Since transient cloud servers can be revoked at any time by the cloud provider, they are considered unsuitable for running interactive application such as web services. In this paper, we present VM deflation as an alternative mechanism to server preemption for reclaiming resources from transient cloud servers under resource pressure. Using real traces from top-tier cloud providers, we show the feasibility of using VM deflation as a resource reclamation mechanism for interactive applications in public clouds. We show how current hypervisor mechanisms can be used to implement VM deflation and present cluster deflation policies for resource management of transient and on-demand cloud VMs. Experimental evaluation of our deflation system on a Linux cluster shows that microservice-based applications can be deflated by up to 50% with negligible performance overhead. Our cluster-level deflation policies allow overcommitment levels as high as 50%, with less than a 1% decrease in application throughput, and can enable cloud platforms to increase revenue by 30% more »

Award ID(s):: 1836752 1802523 1763834

PAR ID:: 10190511

Author(s) / Creator(s):: Fuerst, Alexander; Ali-Eldin, Ahmed; Shenoy, Prashant; Sharma, Prateek

Date Published:: 2020-06-21

Journal Name:: ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC)

Page Range / eLocation ID:: 53 to 64

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3369583.3392675

More Like this