NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Systematic CXL Memory Characterization and Performance Analysis at Scale

https://doi.org/10.1145/3676641.3715987

Liu, Jinshu; Hadian, Hamid; Wang, Yuyue; Berger, Daniel S; Nguyen, Marie; Jian, Xun; Noh, Sam H; Li, Huaicheng (March 2025, ACM)

Free, publicly-accessible full text available March 30, 2026
Dissecting CXL Memory Performance at Scale: Analysis, Modeling, and Optimization

Liu, Jinshu; Hadian, Hamid; Xu, Hanchen; Berger, Daniel S; Li, Huaicheng (September 2024, arXivorg)

Full Text Available
Coach: Exploiting Temporal Patterns for All-Resource Oversubscription in Cloud Platforms

https://doi.org/10.1145/3669940.3707226

Reidys, Benjamin; Zardoshti, Pantea; Goiri, Íñigo; Irvene, Celine; Berger, Daniel S; Ma, Haoran; Arya, Kapil; Cortez, Eli; Stark, Taylor; Bak, Eugene; et al (February 2025, ACM)

Free, publicly-accessible full text available February 3, 2026
Managing Memory Tiers with CXL in Virtualized Environments

Zhong, Yuhong; Berger, Daniel S; Agarwal, Ishwar; Agarwal, Rajat; Hady, Frank; Waldspurger, Carl; Wee, Ryan; Kumar, Karthik; Hill, Mark D; Chowdhury, Mosharaf; et al (July 2024, USENIX OSDI)

Full Text Available
Designing Cloud Servers for Lower Carbon

https://doi.org/10.1109/ISCA59077.2024.00041

Wang, Jaylen; Berger, Daniel S; Kazhamiaka, Fiodar; Irvene, Celine; Zhang, Chaojie; Choukse, Esha; Frost, Kali; Fonseca, Rodrigo; Warrier, Brijesh; Bansal, Chetan; et al (June 2024, IEEE)

To mitigate climate change, we must reduce carbon emissions from hyperscale cloud computing. We find that cloud compute servers cause the majority of emissions in a general-purpose cloud. Thus, we motivate designing carbon-efficient compute server SKUs, or GreenSKUs, using recently-available low-carbon server components. To this end, we design and build three GreenSKUs using low-carbon components, such as energy-efficient CPUs, reused old DRAM via CXL, and reused old SSDs. We detail several challenges that limit GreenSKUs, carbon savings at scale and may prevent their adoption by cloud providers. To address these challenges, we develop a novel methodology and associated framework, GSF (GreenSKU Framework), that enables a cloud provider to systematically evaluate a GreenSKU’s carbon savings at scale. We implement GSF within Microsoft Azure’s production constraints to evaluate our three GreenSKUs’ carbon savings. Using GSF, we show that our most carbon-efficient GreenSKU reduces emissions per core by 28% compared to currently-deployed cloud servers. When designing GreenSKUs to meet applications’ performance requirements, we reduce emissions by 15%. When incorporating overall data center overheads, our GreenSKU reduces Azure’s net cloud emissions by 8%.
more » « less
Full Text Available
Baleen: ML Admission & Prefetching for Flash Caches

Wong, Daniel Lin-Kit; Wu, Hao; Molder, Carson; Gunasekar, Sathya; Lu, Sathya; Khandkar, Snehal; Sharma, Abhinav; Berger, Daniel S; Beckmann, Nathan; Ganger, Gregory R (February 2024, Usenix)

Flash caches are used to reduce peak backend load for throughput-constrained data center services, reducing the total number of backend servers required. Bulk storage systems are a large-scale example, backed by high-capacity but low-throughput hard disks, and using flash caches to provide a more cost-effective storage layer underlying everything from blobstores to data warehouses. However, flash caches must address the limited write endurance of flash by limiting the long-term average flash write rate to avoid premature wearout. To do so, most flash caches must use admission policies to filter cache insertions and maximize the workload-reduction value of each flash write. The Baleen flash cache uses coordinated ML admission and prefetching to reduce peak backend load. After learning painful lessons with our early ML policy attempts, we exploit a new cache residency model (which we call episodes) to guide model training. We focus on optimizing for an end-to-end system metric (Disk-head Time) that measures backend load more accurately than IO miss rate or byte miss rate. Evaluation using Meta traces from seven storage clusters shows that Baleen reduces Peak Disk-head Time (and hence the number of backend hard disks required) by 12% over state-of-the-art policies for a fixed flash write rate constraint. Baleen-TCO, which chooses an optimal flash write rate, reduces our estimated total cost of ownership (TCO) by 17%. Code and traces are available at https://www.pdl.cmu.edu/CILES/.
more » « less
Full Text Available
CompuCache: Remote Computable Caching using Spot VMs

Zhang, Qizhen; Bernstein, Philip A.; Berger, Daniel S.; Chandramouli, Badrish; Liu, Vincent; Loo, Boon Thau (January 2022, Annual Conference on Innovative Data Systems Research (CIDR ’22))

Full Text Available
Towards a Cost vs. Quality Sweet Spot for Monitoring Networks

https://doi.org/10.1145/3484266.3487390

Yaseen, Nofel; Arzani, Behnaz; Chintalapudi, Krishna; Ranganathan, Vaishnavi; Frujeri, Felipe; Hsieh, Kevin; Berger, Daniel S.; Liu, Vincent; Kandula, Srikanth (November 2021, Proceedings of the Twentieth ACM Workshop on Hot Topics in Networks)

Full Text Available
Kangaroo: Caching Billions of Tiny Objects on Flash

https://doi.org/10.1145/3477132.3483568

McAllister, Sara; Berg, Benjamin; Tutuncu-Macias, Julian; Yang, Juncheng; Gunasekar, Sathya; Lu, Jimmy; Berger, Daniel S.; Beckmann, Nathan; Ganger, Gregory R. (October 2021, Symposium on Operating Systems Principles)
null (Ed.)
Full Text Available
We need kernel interposition over the network dataplane

https://doi.org/10.1145/3458336.3465281

Sadok, Hugo; Zhao, Zhipeng; Choung, Valerie; Atre, Nirav; Berger, Daniel S.; Hoe, James C.; Panda, Aurojit; Sherry, Justine (June 2021, HotOS '21: Proceedings of the Workshop on Hot Topics in Operating Systems)
null (Ed.)
Kernel-bypass network APIs, which allow applications to circumvent the kernel and interface directly with the NIC hardware, have recently emerged as one of the main tools for improving application network performance. However, allowing applications to circumvent the kernel makes it impossible to use tools (e.g., tcpdump) or impose policies (e.g., QoS and filters) that need to consider traffic sent by different applications running on a host. This makes maintainability and manageability a challenge for kernel-bypass applications. In response we propose Kernel On-Path Interposition (KOPI), in which traditional kernel dataplane functionality is retained but implemented in a fully programmable SmartNIC. We hypothesize that KOPI can support the same tools and policies as the kernel stack while retaining the performance benefits of kernel bypass.
more » « less
Full Text Available

« Prev Next »

Search for: All records