NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Implementing OpenMP’s SIMD Directive in LLVM’s GPU Runtime

https://doi.org/10.1145/3605573.3605640

Wright, Eric; Doerfert, Johannes; Tian, Shilei; Chapman, Barbara; Chandrasekaran, Sunita (August 2023, ACM)

Full Text Available
ECP SOLLVE: Validation and Verification Testsuite Status Update and Compiler Insight for OpenMP

https://doi.org/10.1109/P3HPC56579.2022.00017

Huber, Thomas; Pophale, Swaroop; Baker, Nolan; Carr, Michael; Rao, Nikhil; Reap, Jaydon; Holsapple, Kristina; Davis, Joshua Hoke; Burnus, Tobias; Lee, Seyong; et al (November 2022, IEEE)

Full Text Available
Analysis of Validating and Verifying OpenACC Compilers 3.0 and Above

https://doi.org/10.1109/WACCPD56842.2022.00006

Jarmusch, Aaron; Liu, Aaron; Munley, Christian; Horta, Daniel; Ravichandran, Vaidhyanathan; Denny, Joel; Friedline, Kyle; Chandrasekaran, Sunita (November 2022, IEEE)

Full Text Available
OpenACC Acceleration of an Agent-Based Biological Simulation Framework

https://doi.org/10.1109/MCSE.2022.3226602

Stack, Matt; Macklin, Paul; Searles, Robert; Chandrasekaran, Sunita (September 2022, Computing in Science & Engineering)

Full Text Available
SPEChpc 2021 Benchmark Suites for Modern HPC Systems

https://doi.org/10.1145/3491204.3527498

Li, Junjie; Bobyr, Alexander; Boehm, Swen; Brantley, William; Brunst, Holger; Cavelan, Aurelien; Chandrasekaran, Sunita; Cheng, Jimmy; Ciorba, Florina M.; Colgrove, Mathew; et al (July 2022, Companion of the 2022 ACM/SPEC International Conference on Performance Engineering)

The SPEChpc™ 2021 suites are application-based benchmarks designed to measure performance of modern HPC systems. The benchmarks support MPI, MPI+OpenMP, MPI+OpenMP target offload, MPI+OpenACC and are portable across all major HPC platforms.
more » « less
Full Text Available
First Experiences in Performance Benchmarking with the New SPEChpc 2021 Suites

https://doi.org/10.1109/CCGrid54584.2022.00077

Brunst, Holger; Chandrasekaran, Sunita; Ciorba, Florina M.; Hagerty, Nick; Henschel, Robert; Juckeland, Guido; Li, Junjie; Vergara, Veronica G.; Wienke, Sandra; Zavala, Miguel (May 2022, 2022 22nd International Symposium on Cluster, Cloud and Internet Computing (CCGrid))

Modern High Performance Computing (HPC) systems are built with innovative system architectures and novel programming models to further push the speed limit of computing. The increased complexity poses challenges for performance portability and performance evaluation. The Standard Performance Evaluation Corporation (SPEC) has a long history of producing industry-standard benchmarks for modern computer systems. SPEC’s newly released SPEChpc 2021 benchmark suites, developed by the High Performance Group, are a bold attempt to provide a fair and objective benchmarking tool designed for stateof-the-art HPC systems. With the support of multiple host and accelerator programming models, the suites are portable across both homogeneous and heterogeneous architectures. Different workloads are developed to fit system sizes ranging from a few compute nodes to a few hundred compute nodes. In this work we present our first experiences in performance benchmarking the new SPEChpc2021 suites and evaluate their portability and basic performance characteristics on various popular and emerging HPC architectures, including x86 CPU, NVIDIA GPU, and AMD GPU. This study provides a first-hand experience of executing the SPEChpc 2021 suites at scale on production HPC systems, discusses real-world use cases, and serves as an initial guideline for using the benchmark suites.
more » « less
Full Text Available
Metrics and Design of an Instruction Roofline Model for AMD GPUs

https://doi.org/10.1145/3505285

Leinhauser, Matthew; Widera, René; Bastrakov, Sergei; Debus, Alexander; Bussmann, Michael; Chandrasekaran, Sunita (March 2022, ACM Transactions on Parallel Computing)

Due to the recent announcement of the Frontier supercomputer, many scientific application developers are working to make their applications compatible with AMD (CPU-GPU) architectures, which means moving away from the traditional CPU and NVIDIA-GPU systems. Due to the current limitations of profiling tools for AMD GPUs, this shift leaves a void in how to measure application performance on AMD GPUs. In this article, we design an instruction roofline model for AMD GPUs using AMD’s ROCProfiler and a benchmarking tool, BabelStream (the HIP implementation), as a way to measure an application’s performance in instructions and memory transactions on new AMD hardware. Specifically, we create instruction roofline models for a case study scientific application, PIConGPU, an open source particle-in-cell simulations application used for plasma and laser-plasma physics on the NVIDIA V100, AMD Radeon Instinct MI60, and AMD Instinct MI100 GPUs. When looking at the performance of multiple kernels of interest in PIConGPU we find that although the AMD MI100 GPU achieves a similar, or better, execution time compared to the NVIDIA V100 GPU, profiling tool differences make comparing performance of these two architectures hard. When looking at execution time, GIPS, and instruction intensity, the AMD MI60 achieves the worst performance out of the three GPUs used in this work.
more » « less
Full Text Available
Accelerating prediction of chemical shift of protein structures on GPUs: Using OpenACC

https://doi.org/10.1371/journal.pcbi.1007877

Wright, Eric; Ferrato, Mauricio H.; Bryer, Alexander J.; Searles, Robert; Perilla, Juan R.; Chandrasekaran, Sunita; Schneidman-Duhovny, Dina (May 2020, PLOS Computational Biology)

Full Text Available
MPI + OpenACC: Accelerating radiation transport mini-application, minisweep, on heterogeneous systems

https://doi.org/10.1016/j.cpc.2018.10.007

Searles, Robert; Chandrasekaran, Sunita; Joubert, Wayne; Hernandez, Oscar (March 2019, Computer Physics Communications)

Full Text Available

Search for: All records