NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MITgcm-AD v2: Open source tangent linear and adjoint modeling framework for the oceans and atmosphere enabled by the Automatic Differentiation tool Tapenade

https://doi.org/10.1016/j.future.2024.107512

Gaikwad, Shreyas Sunil; Narayanan, Sri_Hari Krishna; Hascoët, Laurent; Campin, Jean-Michel; Pillar, Helen; Nguyen, An; Hückelheim, Jan; Hovland, Paul; Heimbach, Patrick (February 2025, Future Generation Computer Systems)

Free, publicly-accessible full text available February 1, 2026
QuTracer: Mitigating Quantum Gate and Measurement Errors by Tracing Subsets of Qubits

https://doi.org/10.1109/ISCA59077.2024.00018

Li, Peiyi; Liu, Ji; Gonzales, Alvin; Saleem, Zain Hamid; Zhou, Huiyang; Hovland, Paul (June 2024, IEEE)

Full Text Available
Enhancing Virtual Distillation with Circuit Cutting for Quantum Error Mitigation

https://doi.org/10.1109/ICCD58817.2023.00024

Li, Peiyi; Liu, Ji; Patil, Hrushikesh Pramod; Hovland, Paul; Zhou, Huiyang (November 2023, Proceedings IEEE International Conference on Computer Design)

Full Text Available
Tackling the Qubit Mapping Problem with Permutation-Aware Synthesis

https://doi.org/10.1109/QCE57702.2023.00090

Liu, Ji; Younis, Ed; Weiden, Mathias; Hovland, Paul; Kubiatowicz, John; Iancu, Costin (September 2023, IEEE)

Full Text Available
Model Checking Race-Freedom When “Sequential Consistency for Data-Race-Free Programs” is Guaranteed

https://doi.org/10.1007/978-3-031-37703-7_13

Wu, Wenhao; Hückelheim, Jan; Hovland, Paul D.; Luo, Ziqing; Siegel, Stephen F. (July 2023, International Conference on Computer Aided Verification)
Enea, Constantin; Lal, Akash (Ed.)
Many parallel programming models guarantee that if all sequentially consistent (SC) executions of a program are free of data races, then all executions of the program will appear to be sequentially consistent. This greatly simplifies reasoning about the program, but leaves open the question of how to verify that all SC executions are race-free. In this paper, we show that with a few simple modifications, model checking can be an effective tool for verifying race-freedom. We explore this technique on a suite of C programs parallelized with OpenMP.
more » « less
Full Text Available
QContext: Context-Aware Decomposition for Quantum Gates

https://doi.org/10.1109/ISCAS46773.2023.10181370

Liu, Ji; Bowman, Max; Gokhale, Pranav; Dangwal, Siddharth; Larson, Jeffrey; Chong, Frederic T.; Hovland, Paul D. (May 2023, 2023 IEEE International Symposium on Circuits and Systems (ISCAS))

Full Text Available
Transfer-learning-based Autotuning using Gaussian Copula

https://doi.org/10.1145/3577193.3593712

Randall, Thomas; Koo, Jaehoon; Videau, Brice; Kruse, Michael; Wu, Xingfu; Hovland, Paul; Hall, Mary; Ge, Rong; Balaprakash, Prasanna (June 2023, ACM)

As diverse high-performance computing (HPC) systems are built, many opportunities arise for applications to solve larger problems than ever before. Given the significantly increased complexity of these HPC systems and application tuning, empirical performance tuning, such as autotuning, has emerged as a promising approach in recent years. Despite its effectiveness, autotuning is often a computationally expensive approach. Transfer learning (TL)-based autotuning seeks to address this issue by leveraging the data from prior tuning. Current TL methods for autotuning spend significant time modeling the relationship between parameter configurations and performance, which is ineffective for few-shot (that is, few empirical evaluations) tuning on new tasks. We introduce the first generative TL-based autotuning approach based on the Gaussian copula (GC) to model the high-performing regions of the search space from prior data and then generate high-performing configurations for new tasks. This allows a sampling-based approach that maximizes few-shot performance and provides the first probabilistic estimation of the few-shot budget for effective TL-based autotuning. We compare our generative TL approach with state-of-the-art autotuning techniques on several benchmarks. We find that the GC is capable of achieving 64.37% of peak few-shot performance in its first evaluation. Furthermore, the GC model can determine a few-shot transfer budget that yields up to 33.39X speedup, a dramatic improvement over the 20.58X speedup using prior techniques.
more » « less
Full Text Available
Scalable Automatic Differentiation of Multiple Parallel Paradigms through Compiler Augmentation

https://doi.org/10.1109/SC41404.2022.00065

Moses, William S.; Narayanan, Sri Hari; Paehler, Ludger; Churavy, Valentin; Schanen, Michel; Hückelheim, Jan; Doerfert, Johannes; Hovland, Paul (November 2022, IEEE)

Full Text Available
Verifying Fortran Programs with CIVL

https://doi.org/10.1007/978-3-030-99524-9_6

Wu, Wenhao; Hückelheim, Jan; Hovland, Paul D.; Siegel, Stephen F. (March 2022, TACAS 2022: Tools and Algorithms for the Construction and Analysis of Systems)
Fisman, Dana; Rosu, Grigore (Ed.)
Fortran is widely used in computational science, engineering, and high performance computing. This paper presents an extension to the CIVL verification framework to check correctness properties of Fortran programs. Unlike previous work that translates Fortran to C, LLVM IR, or other intermediate formats before verification, our work allows CIVL to directly consume Fortran source files. We extended the parsing, translation, and analysis phases to support Fortran-specific features such as array slicing and reshaping, and to find program violations that are specific to Fortran, such as argument aliasing rule violations, invalid use of variable and function attributes, or defects due to Fortran's unspecified expression evaluation order. We demonstrate the usefulness of our tool on a verification benchmark suite and kernels extracted from a real world application.
more » « less
Full Text Available

Search for: All records