HOMP: Automated Distribution of Parallel Loops and Data in Highly Parallel Accelerator-Based Systems

Yan, Yonghong; Liu, Jiawen; Cameron, Kirk W.; Umar, Mariam

doi:10.1109/IPDPS.2017.99

Citation Details

HOMP: Automated Distribution of Parallel Loops and Data in Highly Parallel Accelerator-Based Systems

Heterogeneous computing systems, e.g., those with accelerators than the host CPUs, offer the accelerated performance for a variety of workloads. However, most parallel programming models require platform dependent, time-consuming hand-tuning efforts for collectively using all the resources in a system to achieve efficient results. In this work, we explore the use of OpenMP parallel language extensions to empower users with the ability to design applications that automatically and simultaneously leverage CPUs and accelerators to further optimize use of available resources. We believe such automation will be key to ensuring codes adapt to increases in the number and diversity of accelerator resources for future computing systems. The proposed system combines language extensions to OpenMP, load-balancing algorithms and heuristics, and a runtime system for loop distribution across heterogeneous processing elements. We demonstrate the effectiveness of our automated approach to program on systems with multiple CPUs, GPUs, and MICs. more »

Award ID(s):: 1409946 1551182 1422961

PAR ID:: 10050479

Author(s) / Creator(s):: Yan, Yonghong; Liu, Jiawen; Cameron, Kirk W.; Umar, Mariam

Date Published:: 2017-05-01

Journal Name:: Parallel and Distributed Processing Symposium (IPDPS), 2017 IEEE International

Page Range / eLocation ID:: 788 to 798

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/IPDPS.2017.99

More Like this