NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Lining up Garbage Collection and Application for a Far-Memory-Friendly Runtime

https://doi.org/10.1145/3749283

Li, Shengkai; Wang, Chenxi; Xue, Haonan; Ma, Haoran; Liu, Shi; Qiao, Yifan; Eyolfson, Jonathan; Navasca, Christian; Lu, Shan; Xu, Harry (August 2025, ACM Transactions on Computer Systems)

Far-memory techniques that enable applications to use remote memory are increasingly appealing in modern data centers, supporting applications’ large memory footprint and improving machines’ resource utilization. Unfortunately, most far-memory techniques focus on OS-level optimizations and are agnostic to managed runtimes and garbage collections (GC) underneath applications written in high-level languages. With different object-access patterns from applications, GC can severely interfere with existing far-memory techniques, breaking remote memory prefetching algorithms and causing severe local-memory misses. We developed MemLiner, a runtime technique that improves the performance of far-memory systems by aligning memory accesses from application and GC threads so that they follow similar memory access paths, thereby (1) reducing the local-memory working set and (2) improving remote-memory prefetching through simplified memory access patterns. We implemented MemLiner in two widely used GCs in OpenJDK: G1 and Shenandoah. Our evaluation with a range of widely deployed cloud systems shows that MemLiner improves applications’ end-to-end performance by up to3.3×and reduces applications’ tail latency by up to220.0×.
more » « less
Free, publicly-accessible full text available August 31, 2026
ExChain: exception dependency analysis for root cause diagnosis

Li, Ao; Lu, Shan; Nath, Suman; Padhye, Rohan; Sekar, Vyas (August 2024, Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 2024))

Full Text Available
If At First You Don’t Succeed, Try, Try, Again...? Insights and LLM-informed Tooling for Detecting Retry Bugs in Software Systems

https://doi.org/10.1145/3694715.3695971

Stoica, Bogdan Alexandru; Sethi, Utsav; Su, Yiming; Zhou, Cyrus; Lu, Shan; Mace, Jonathan; Musuvathi, Madanlal; Nath, Suman (November 2024, ACM)

Full Text Available
ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications

Liu, Yuhan; Wan, Chengcheng; Du, Kuntai; Hoffmann, Henry; Jiang, Junchen; Lu, Shan; Maire, Michael (July 2024, Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation)

ML APIs have greatly relieved application developers of the burden to design and train their own neural network models—classifying objects in an image can now be as simple as one line of Python code to call an API. However, these APIs offer the same pre-trained models regardless of how their output is used by different applications. This can be suboptimal as not all ML inference errors can cause application failures, and the distinction between inference errors that can or cannot cause failures varies greatly across applications. To tackle this problem, we first study 77 real-world applications, which collectively use six ML APIs from two providers, to reveal common patterns of how ML API output affects applications' decision processes. Inspired by the findings, we propose ChameleonAPI, an optimization framework for ML APIs, which takes effect without changing the application source code. ChameleonAPI provides application developers with a parser that automatically analyzes the application to produce an abstract of its decision process, which is then used to devise an application-specific loss function that only penalizes API output errors critical to the application. ChameleonAPI uses the loss function to efficiently train a neural network model customized for each application and deploys it to serve API invocations from the respective application via existing interface. Compared to a baseline that selects the best-of-all commercial ML API, we show that ChameleonAPI reduces incorrect application decisions by 43%.
more » « less
Full Text Available
ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications

Liu, Yuhan; Wan, Chengcheng; Du, Kuntai; Hoffmann, Henry; Jiang, Junchen; Lu, Shan; Maire, Michael (July 2024, USENIX, OSDI 2024)

Full Text Available
ExChain: Exception Dependency Analysis for Root Cause Diagnosis

Li, Ao; Lu, Shan; Nath; Suman; Padhye, Rohan; Sekar, Vyas (March 2024, USENIX)

Full Text Available
A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications

Chen, Lei; Liu, Shi; Wang, Chenxi; Ma, Haoran; Qiao, Yifan; Wang, Zhe; Wu, Chenggang; Lu, Youyou; Feng, Xiaobing; Cui, Huimin; et al (July 2024, USENIX Association)

Full Text Available
A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications

Chen, Lei; Liu, Shi; Wang, Chenxi; Ma, Haoran; Qiao, Yifan; Wang, Zhe; Wu, Chenggang; Lu, Youyou; Feng, Xiaobing; Cui, Huimin; et al (July 2024, USENIX Association)

Full Text Available
A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications

Chen, Lei; Liu, Shi; Wang, Chenxi; Ma, Haoran; Qiao, Yifan; Wang, Zhe; Wu, Chenggang; Lu, Youyou; Feng, Xiaobing; Cui, Huimin; et al (July 2024, USENIX Association)

Full Text Available
A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications

Chen, Lei; Liu, Shi; Wang, Chenxi; Ma, Haoran; Qiao, Yifan; Wang, Zhe; Wu, Chenggang; Lu, Youyou; Feng, Xiaobing; Cui, Huimin; et al (July 2024, USENIX Association)

Full Text Available

« Prev Next »

Search for: All records