NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fusion: An Analytics Object Store Optimized for Query Pushdown

Lu, Jianan; Raina, Ashwini; Cidon, Asaf; Freedman, Michael J (March 2025, ASPLOS '25: Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems)

The prevalence of disaggregated storage in public clouds has led to increased latency in modern OLAP cloud databases, particularly when handling ad-hoc and highly-selective queries on large objects. To address this, cloud databases have adopted computation pushdown, executing query predicates closer to the storage layer. However, existing pushdown solutions are ine!cient in erasure-coded storage. Cloud storage employs erasure coding that partitions analytics file objects into fixed-sized blocks and distributes them across storage nodes. Consequently, when a speci"c part of the object is queried, the storage system must reassemble the object across nodes, incurring significant network latency. In this work, we present Fusion, an object store for analytics that is optimized for query pushdown on erasure-coded data. It co-designs its erasure coding and file placement topologies, taking into account popular analytics file formats (e.g., Parquet). Fusion employs a novel stripe construction algorithm that prevents fragmentation of computable units within an object, and minimizes storage overhead during erasure coding. Compared to existing erasure-coded stores, Fusion improves median and tail latency by 64% and 81%, respectively, on TPC-H, and up to 40% and 48% respectively, on real-world SQL queries. Fusion achieves this while incurring a modest 1.2% storage overhead compared to the optimal.
more » « less
Free, publicly-accessible full text available March 30, 2026
Characterizing the Networks Sending Enterprise Phishing Emails

https://doi.org/10.1007/978-3-031-85960-1_18

Luo, Elisa; Young, Liane; Ho, Grant; Afifi, M H; Schweighauser, Marco; Katz-Bassett, Ethan; Cidon, Asaf (January 2025, Springer Nature Switzerland)

Free, publicly-accessible full text available January 1, 2026
Managing Memory Tiers with CXL in Virtualized Environments

Zhong, Yuhong; Berger, Daniel S; Agarwal, Ishwar; Agarwal, Rajat; Hady, Frank; Waldspurger, Carl; Wee, Ryan; Kumar, Karthik; Hill, Mark D; Chowdhury, Mosharaf; et al (July 2024, USENIX OSDI)

Full Text Available
Treehouse: A Case For Carbon-Aware Datacenter Software

https://doi.org/10.1145/3630614.3630626

Anderson, Thomas; Belay, Adam; Chowdhury, Mosharaf; Cidon, Asaf; Zhang, Irene (October 2023, ACM SIGEnergy Energy Informatics Review)

The end of Dennard scaling and the slowing of Moore's Law has put the energy use of datacenters on an unsustainable path. Datacenters are already a significant fraction of worldwide electricity use, with application demand scaling at a rapid rate. We argue that substantial reductions in the carbon intensity of datacenter computing are possible with a software-centric approach: by making energy and carbon visible to application developers on a fine-grained basis, by modifying system APIs to make it possible to make informed trade offs between performance and carbon emissions, and by raising the level of application programming to allow for flexible use of more energy efficient means of compute and storage. We also lay out a research agenda for systems software to reduce the carbon footprint of datacenter computing.
more » « less
Full Text Available
Karma: Resource Allocation for Dynamic Demands

Vuppalapati, Midhul; Fikioris, Giannis; Agarwal, Rachit; Cidon, Asaf; Khandelwal, Anurag; Tardos, Eva (July 2023, USENIX Symposium on Operating Systems Design and Implementation)

Full Text Available
Efficient Compactions between Storage Tiers with PrismDB

https://doi.org/10.1145/3582016.3582052

Raina, Ashwini; Lu, Jianan; Cidon, Asaf; Freedman, Michael J. (March 2023, ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems)

In recent years, emerging storage hardware technologies have focused on divergent goals: better performance or lower cost-per-bit. Correspondingly, data systems that employ these technologies are typically optimized either to be fast (but expensive) or cheap (but slow). We take a different approach: by architecting a storage engine to natively utilize two tiers of fast and low-cost storage technologies, we can achieve a Pareto efficient balance between performance and cost-per-bit. This paper presents the design and implementation of PrismDB, a novel key-value store that exploits two extreme ends of the spectrum of modern NVMe storage technologies (3D XPoint and QLC NAND) simultaneously. Our key contribution is how to efficiently migrate and compact data between two different storage tiers. Inspired by the classic cost-benefit analysis of log cleaning, we develop a new algorithm for multi-tiered storage compaction that balances the benefit of reclaiming space for hot objects in fast storage with the cost of compaction I/O in slow storage. Compared to the standard use of RocksDB on flash in datacenters today, PrismDB’s average throughput on tiered storage is 3.3x faster, its read tail latency is 2x better, and it is 5x more durable using equivalently-priced hardware.
more » « less
Full Text Available
Memtrade: Marketplace for Disaggregated Memory Clouds

https://doi.org/10.1145/3589985

Maruf, Hasan Al; Zhong, Yuhong; Wang, Hongyi; Chowdhury, Mosharaf; Cidon, Asaf; Waldspurger, Carl (May 2023, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

We present Memtrade, the first practical marketplace for disaggregated memory clouds. Clouds introduce a set of unique challenges for resource disaggregation across different tenants, including resource harvesting, isolation, and matching. Memtrade allows producer virtual machines (VMs) to lease both their unallocated memory and allocated-but-idle application memory to remote consumer VMs for a limited period of time. Memtrade does not require any modifications to host-level system software or support from the cloud provider. It harvests producer memory using an application-aware control loop to form a distributed transient remote memory pool with minimal performance impact; it employs a broker to match producers with consumers while satisfying performance constraints; and it exposes the matched memory to consumers through different abstractions. As a proof of concept, we propose two such memory access interfaces for Memtrade consumers -- a transient KV cache for specified applications and a swap interface that is application-transparent. Our evaluation using real-world cluster traces shows that Memtrade provides significant performance benefit for consumers (improving average read latency up to 2.8X) while preserving confidentiality and integrity, with little impact on producer applications (degrading performance by less than 2.1%).
more » « less
Full Text Available
BPF-oF: Storage Function Pushdown Over the Network

Zarkadas, Ioannis; Zussman, Tal; Carin, Jeremy; Jiang, Sheng; Zhong, Yuhong; Pfefferle, Jonas; Franke, Hubertus; Yang, Junfeng; Kaffes, Kostis; Stutsman, Ryan; et al (September 2023, Arxiv)

Full Text Available
Treehouse: A Case For Carbon-Aware Datacenter Software

Anderson, Thomas; Belay, Adam; Chowdhury, Mosharaf; Cidon, Asaf; Zhang, Irene (July 2022, HotCarbon: 1st Workshop on Sustainable Computer Systems Design and Implementation)

The end of Dennard scaling and the slowing of Moore’s Law has put the energy use of datacenters on an unsustainable path. Datacenters are already a significant fraction of worldwide electricity use, with application demand scaling at a rapid rate. We argue that substantial reductions in the carbon intensity of datacenter computing are possible with a software-centric approach: by making energy and carbon visible to application developers on a fine-grained basis, by modifying system APIs to make it possible to make informed trade offs between performance and carbon emissions, and by raising the level of application programming to allow for flexible use of more energy efficient means of compute and storage.We also lay out a research agenda for systems software to reduce the carbon footprint of datacenter computing.
more » « less
Full Text Available
XRP: In-Kernel Storage Functions with eBPF

Zhong, Yuhong; Li, Haoyu; Wu, Yu Jian; Zarkadas, Ioannis; Tao, Jeffrey; Mesterhazy, Evan; Makris, Michael; Yang, Junfeng; Tai, Amy; Stutsman, Ryan; et al (July 2022, Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation)

With the emergence of microsecond-scale NVMe storage devices, the Linux kernel storage stack overhead has become significant, almost doubling access times. We present XRP, a framework that allows applications to execute user-defined storage functions, such as index lookups or aggregations, from an eBPF hook in the NVMe driver, safely bypassing most of the kernel’s storage stack. To preserve file system semantics, XRP propagates a small amount of kernel state to its NVMe driver hook where the user-registered eBPF functions are called. We show how two key-value stores, BPF-KV, a simple B+-tree key-value store, and WiredTiger, a popular log-structured merge tree storage engine, can leverage XRP to significantly improve throughput and latency.
more » « less
Full Text Available

« Prev Next »

Search for: All records