Pufferfish: Container-driven Elastic Memory Management for Data-intensive Applications

Chen, Wei; Pi, Aidi; Wang, Shaoqi; Zhou, Xiaobo

doi:https://doi.org/10.1145/3357223.3362730

Citation Details

Pufferfish: Container-driven Elastic Memory Management for Data-intensive Applications

Data-intensive applications often suffer from significant memory pressure, resulting in excessive garbage collection (GC) and out-of-memory (OOM) errors, harming system performance and reliability. In this paper, we demonstrate how lightweight virtualization via OS containers opens up opportunities to address memory pressure and realize memory elasticity: 1) tasks running in a container can be set to a large heap size to avoid OutOfMemory (OOM) errors, and 2) tasks that are under memory pressure and incur significant swapping activities can be temporarily "suspended" by depriving resources from the hosting containers, and be "resumed" when resources are available. We propose and develop Pufferfish, an elastic memory manager, that leverages containers to flexibly allocate memory for tasks. Memory elasticity achieved by Pufferfish can be exploited by a cluster scheduler to improve cluster utilization and task parallelism. We implement Pufferfish on the cluster scheduler Apache Yarn. Experiments with Spark and MapReduce on real-world traces show Pufferfish is able to avoid OOM errors, improve cluster memory utilization by 2.7x and the median job runtime by 5.5x compared to a memory over-provisioning solution. more »

Award ID(s):: 1816850

PAR ID:: 10146845

Author(s) / Creator(s):: Chen, Wei; Pi, Aidi; Wang, Shaoqi; Zhou, Xiaobo

Date Published:: 2019-11-01

Journal Name:: ACM SoCC '19: Proceedings of the ACM Symposium on Cloud Computing

Page Range / eLocation ID:: 259 - 271

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/https://doi.org/10.1145/3357223.3362730

More Like this