NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

D-Rex: Heterogeneity-Aware Reliability Framework and Adaptive Algorithms for Distributed Storage

https://doi.org/10.1145/3721145.3730412

Gonthier, Maxime; Sanchez-Gallegos, Dante D; Pan, Haochen; Nicolae, Bogdan; Zhou, Sicheng; Nguyen, Hai Duc; Hayot-Sasson, Valerie; Pauloski, Greg; Carretero, Jesus; Chard, Kyle; et al (June 2025, ACM)

Free, publicly-accessible full text available June 8, 2026
Object Proxy Patterns for Accelerating Distributed Applications

https://doi.org/10.1109/TPDS.2024.3511347

Pauloski, J Gregory; Hayot-Sasson, Valerie; Ward, Logan; Brace, Alexander; Bauer, André; Chard, Kyle; Foster, Ian (February 2025, IEEE Transactions on Parallel and Distributed Systems)

Free, publicly-accessible full text available February 1, 2026
X-OpenMP — eXtreme fine-grained tasking using lock-less work stealing

https://doi.org/10.1016/j.future.2024.05.019

Nookala, Poornima; Chard, Kyle; Raicu, Ioan (October 2024, Future Generation Computer Systems)

Full Text Available
TaPS: A Performance Evaluation Suite for Task-based Execution Frameworks

https://doi.org/10.1109/e-Science62913.2024.10678702

Pauloski, J Gregory; Hayot-Sasson, Valerie; Gonthier, Maxime; Hudson, Nathaniel; Pan, Haochen; Zhou, Sicheng; Foster, Ian; Chard, Kyle (September 2024, IEEE)

Full Text Available
SCIPIS: Scalable and concurrent persistent indexing and search in high-end computing systems

https://doi.org/10.1016/j.jpdc.2024.104878

Orhean, Alexandru Iulian; Giannakou, Anna; Ramakrishnan, Lavanya; Chard, Kyle; Glavic, Boris; Raicu, Ioan (July 2024, Journal of Parallel and Distributed Computing)

Full Text Available
Accelerating Function-Centric Applications by Discovering, Distributing, and Retaining Reusable Context in Workflow Systems

https://doi.org/10.1145/3625549.3658663

Phung, Thanh Son; Thomas, Colin; Ward, Logan; Chard, Kyle; Thain, Douglas (June 2024, ACM)

Workflow systems provide a convenient way for users to write large-scale applications by composing independent tasks into large graphs that can be executed concurrently on high-performance clus- ters. In many newer workflow systems, tasks are often expressed as a combination of function invocations in a high-level language. Because necessary code and data are not statically known prior to execution, they must be moved into the cluster at runtime. An obvious way of doing this is to translate function invocations into self-contained executable programs and run them as usual, but this brings a hefty performance penalty: a function invocation now needs to piggyback its context with extra code and data to a remote node, and the remote node needs to take extra time to reconstruct the invocation’s context before executing it, both detrimental to lightweight short-running functions. A better solution for workflow systems is to treat functions and invocations as first-class abstractions: subsequent invocations of the same function on a worker node should only pay for the cost of context setup once and reuse the context between different invocations. The remaining problems lie in discovering, distributing, and retaining the reusable context among workers. In this paper, we discuss the rationale and design requirement of these mechanisms to support context reuse, and implement them in TaskVine, a data- intensive distributed framework and execution engine. Our results from executing a large-scale neural network inference application and a molecular design application show that treating functions and invocations as first-class abstractions reduces the execution time of the applications by 94.5% and 26.9%, respectively.
more » « less
Full Text Available
UniFaaS: Programming across Distributed Cyberinfrastructure with Federated Function Serving

https://doi.org/10.1109/IPDPS57955.2024.00027

Li, Yifei; Chard, Ryan; Babuji, Yadu; Chard, Kyle; Foster, Ian; Li, Zhuozhao (May 2024, IEEE)

Full Text Available
Establishing a High-Performance and Productive Ecosystem for Distributed Execution of Python Functions Using Globus Compute

https://doi.org/10.1109/SCW63240.2024.00083

Ananthakrishnan, Rachana; Babuji, Yadu; Bryan, Josh; Chard, Kyle; Chard, Ryan; Clifford, Ben; Foster, Ian; Gorenstein, Lev; Kesling, Kevin Hunter; Janidlo, Chris; et al (November 2024, IEEE)

Free, publicly-accessible full text available November 17, 2025
An Empirical Investigation of Container Building Strategies and Warm Times to Reduce Cold Starts in Scientific Computing Serverless Functions

https://doi.org/10.1109/e-Science62913.2024.10678668

Bauer, André; Gonthier, Maxime; Pan, Haochen; Chard, Ryan; Grzenda, Daniel; Straesser, Martin; Pauloski, J Gregory; Kamatar, Alok; Baughman, Matt; Hudson, Nathaniel; et al (September 2024, IEEE)

Full Text Available
The globus compute dataset: An open function-as-a-service dataset from the edge to the cloud

https://doi.org/10.1016/j.future.2023.12.007

Bauer, André; Pan, Haochen; Chard, Ryan; Babuji, Yadu; Bryan, Josh; Tiwari, Devesh; Foster, Ian; Chard, Kyle (April 2024, Future Generation Computer Systems)

Full Text Available

« Prev Next »

Search for: All records