Exploration of memory hybridization for RDD caching in Spark

Khan, Md Muhib; Alam, Muhammad Ahad; Nath, Amit Kumar; Yu, Weikuan

doi:10.1145/3315573.3329988

Citation Details

Exploration of memory hybridization for RDD caching in Spark

Apache Spark is a popular cluster computing framework for iterative analytics workloads due to its use of Resilient Distributed Datasets (RDDs) to cache data for in-memory processing. We have revealed that the performance of Spark RDD cache can be severely limited if its capacity falls short to the needs of the workloads. In this paper, we have explored different memory hybridization strategies to leverage emergent Non-Volatile Memory (NVM) devices for Spark's RDD cache. We have found that a simple layered hybridization approach does not offer an effective solution. Therefore, we have designed a flat hybridization scheme to leverage NVM for caching RDD blocks, along with several architectural optimizations such as dynamic memory allocation for block unrolling, asynchronous migration with preemption, and opportunistic eviction to disk. We have performed an extensive set of experiments to evaluate the performance of our proposed flat hybridization strategy and found it to be robust in handling different system and NVM characteristics. Our proposed approach uses DRAM for a fraction of the hybrid memory system and yet manages to keep the increase in execution time to be within 10% on average. Moreover, our opportunistic eviction of blocks to disk improves performance by up to 7.5% when utilized alongside the current mechanism. more »

Award ID(s):: 1822737 1561041 1564647 1744336 1763547

NSF-PAR ID:: 10162658

Author(s) / Creator(s):: Khan, Md Muhib; Alam, Muhammad Ahad; Nath, Amit Kumar; Yu, Weikuan

Date Published:: 2019-06-01

Journal Name:: the 2019 ACM SIGPLAN International Symposium on Memory Management

Page Range / eLocation ID:: 41 to 52

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3315573.3329988

More Like this