SwapAdvisor: Pushing Deep Learning Beyond the GPU Memory Limit via Smart Swapping

Huang, Chien-Chin; Jin, Gu; Li, Jinyang

doi:10.1145/3373376.3378530

Citation Details

SwapAdvisor: Pushing Deep Learning Beyond the GPU Memory Limit via Smart Swapping

It is known that deeper and wider neural networks can achieve better accuracy. But it is difficult to continue the trend to increase model size due to limited GPU memory. One promising solution is to support swapping between GPU and CPU memory. However, existing work on swapping only handle certain models and do not achieve satisfactory performance. Deep learning computation is commonly expressed as a dataflow graph which can be analyzed to improve swapping. We propose SwapAdvisor, which performs joint optimization along 3 dimensions based on a given dataflow graph: operator scheduling, memory allocation, and swap decisions. SwapAdvisor explores the vast search space using a custom-designed genetic algorithm. Evaluations using a variety of large models show that SwapAdvisor can train models up to 12 times the GPU memory limit while achieving 53-99% of the throughput of a hypothetical baseline with infinite GPU memory. more »

Award ID(s):: 1816717

NSF-PAR ID:: 10191573

Author(s) / Creator(s):: Huang, Chien-Chin; Jin, Gu; Li, Jinyang

Date Published:: 2020-03-09

Journal Name:: International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)

Page Range / eLocation ID:: 1341 to 1355

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3373376.3378530

More Like this