NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

TLF: Transactional Lock Fusion

https://doi.org/10.1145/3694906.3743341

Blelloch, Guy E; Kent, Zachary; Wei, Yuanhao (July 2025, ACM)

Free, publicly-accessible full text available July 16, 2026
CLEANN: Lock-Free Augmented Trees for Low-Dimensional κ-Nearest Neighbor Search

https://doi.org/10.1145/3694906.3743339

Manohar, Magdalen Dobson; Wei, Yuanhao; Blelloch, Guy E (July 2025, ACM)

Free, publicly-accessible full text available July 16, 2026
VERLIB: Concurrent Versioned Pointers

https://doi.org/10.1145/3627535.3638501

Blelloch, Guy E; Wei, Yuanhao (February 2024, ACM Symposium on Principles and Practice of Parallel Programming (PPoPP '24))

Recent work has shown how to augment any CAS-based concurrent data structure to support taking a snapshot of the current memory state. Taking the snapshot, as well as loads and CAS (Compare and Swap) operations, take constant time. Importantly, such snapshotting can be used to easily implement linearizable queries, such as range queries, over any part of a data structure. In this paper, we make two significant improvements over this approach. The first improvement removes a subtle and hard to reason about restriction that was needed to avoid a level of indirection on pointers. We introduce an approach, which we refer to as indirection-on-need, that removes the restriction, but yet almost always avoids indirection. The second improvement is to efficiently support snapshotting with lock-free locks. This requires supporting an idempotent CAS. We show a particularly simple solution to the problem that leverages the data structures used for snapshotting. Based on these ideas we implemented an easy-to-use C++ library, verlib, centered around a versioned pointer type. The library works with lock (standard or lock-free) and CAS based algorithms, or any combination. Converting existing concurrent data-structures to use the library takes minimal effort. We present results for experiments that use verlib to convert state-of-the-art data structures for ordered maps (a B-tree), radix-ordered maps (an ART-tree), and unordered maps (an optimized hash table) to be snapshottable. The snapshottable versions perform almost as well as the original versions and far outperform any previous implementations that support atomic range queries.
more » « less
Full Text Available
Smarter Atomic Smart Pointers: Safe and Efficient Concurrent Memory Management (Abstract)

https://doi.org/10.1145/3597635.3598027

Anderson, Daniel; Blelloch, Guy E.; Wei, Yuanhao (July 2023, ACM)

Full Text Available
Practically and Theoretically Efficient Garbage Collection for Multiversioning

https://doi.org/10.1145/3572848.3577508

Wei, Yuanhao; Blelloch, Guy E.; Fatourou, Panagiota; Ruppert, Eric (February 2023, Proceedings of the 28th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming)

Multiversioning is widely used in databases, transactional memory, and concurrent data structures. It can be used to support read-only transactions that appear atomic in the presence of concurrent update operations. Any system that maintains multiple versions of each object needs a way of efficiently reclaiming them.We experimentally compare various existing reclamation techniques by applying them to a multiversion tree and a multiversion hash table. Using insights from these experiments, we develop two new multiversion garbage collection (MVGC) techniques. These techniques use two novel concurrent version list data structures. Our experimental evaluation shows that our fastest technique is competitive with the fastest existing MVGC techniques, while using significantly less space on some workloads. Our new techniques provide strong theoretical bounds, especially on space usage. These bounds ensure that the schemes have consistent performance, avoiding the very high worst-case space usage of other techniques.
more » « less
Turning manual concurrent memory reclamation into automatic reference counting

https://doi.org/10.1145/3519939.3523730

Anderson, Daniel; Blelloch, Guy E.; Wei, Yuanhao (June 2022, Proceedings of the 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation)

Safe memory reclamation (SMR) schemes are an essential tool for lock-free data structures and concurrent programming. However, manual SMR schemes are notoriously difficult to apply correctly, and automatic schemes, such as reference counting, have been argued for over a decade to be too slow for practical purposes. A recent wave of work has disproved this long-held notion and shown that reference counting can be as scalable as hazard pointers, one of the most common manual techniques. Despite these tremendous improvements, there remains a gap of up to 2x or more in performance between these schemes and faster manual techniques such as epoch-based reclamation (EBR). In this work, we first advance these ideas and show that in many cases, automatic reference counting can in fact be as fast as the fastest manual SMR techniques.We generalize our previous algorithm called Concurrent Deferred Reference Counting (CDRC) to obtain a method for converting any standard manual SMR technique into an automatic reference counting technique with a similar performance profile. Our second contribution is extending this framework to support weak pointers, which are reference-counted pointers that automatically break pointer cycles by not contributing to the reference count, thus addressing a common weakness in reference-counted garbage collection. Our experiments with a C++-library implementation show that our automatic techniques perform in line with their manual counterparts, and that our weak pointer implementation outperforms the best known atomic weak pointer library by up to an order of magnitude on high thread counts. All together, we show that the ease of use of automatic memory management can be achieved without significant cost to practical performance or general applicability.
more » « less
Lock-free locks revisited

https://doi.org/10.1145/3503221.3508433

Ben-David, Naama; Blelloch, Guy E.; Wei, Yuanhao (March 2022, Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming)

This paper presents a new and practical approach to lock-free locks based on helping, which allows the user to write code using fine-grained locks, but run it in a lock-free manner. Although lock-free locks have been suggested in the past, they are widely viewed as impractical, have some key limitations, and, as far as we know, have never been implemented. The paper presents some key techniques that make lock-free locks practical and more general. The most important technique is an approach to idempotence—i.e. making code that runs multiple times appear as if it ran once. The idea is based on using a shared log among processes running the same protected code. Importantly, the approach can be library based, requiring very little if any change to standard code—code just needs to use the idempotent versions of memory operations (load, store, LL/SC, allocation, free). We have implemented a C++ library called Flock based on the ideas. Flock allows lock-based data structures to run in either lock-free or blocking (traditional locks) mode. We implemented a variety of tree and list-based data structures with Flock and compare the performance of the lock-free and blocking modes under a variety of workloads. The lock-free mode is almost as fast as blocking mode under almost all workloads, and significantly faster when threads are oversubscribed (more threads than processors). We also compare with several existing lock-based and lock-free alternatives.
more » « less
FliT: a library for simple and efficient persistent algorithms

https://doi.org/10.1145/3503221.3508436

Wei, Yuanhao; Ben-David, Naama; Friedman, Michal; Blelloch, Guy E.; Petrank, Erez (March 2022, Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming)

Non-volatile random access memory (NVRAM) offers byte-addressable persistence at speeds comparable to DRAM. However, with caches remaining volatile, automatic cache evictions can reorder updates to memory, potentially leaving persistent memory in an inconsistent state upon a system crash. Flush and fence instructions can be used to force ordering among updates, but are expensive. This has motivated significant work studying how to write correct and efficient persistent programs for NVRAM. In this paper, we present FliT, a C++ library that facilitates writing efficient persistent code. Using the library's default mode makes any linearizable data structure durable with minimal changes to the code. FliT avoids many redundant flush instructions by using a novel algorithm to track dirty cache lines. It also allows for extra optimizations, but achieves good performance even in its default setting. To describe the FliT library's capabilities and guarantees, we define a persistent programming interface, called the P-V Interface, which FliT implements. The P-V Interface captures the expected behavior of code in which some instructions' effects are persisted and some are not. We show that the interface captures the desired semantics of many practical algorithms in the literature. We apply the FliT library to four different persistent data structures, and show that across several workloads, persistence implementations, and data structure sizes, the FliT library always improves operation throughput, by at least 2.1X over a naive implementation in all but one workload.
more » « less
Concurrent deferred reference counting with constant-time overhead

https://doi.org/10.1145/3453483.3454060

Anderson, Daniel; Blelloch, Guy E.; Wei, Yuanhao (June 2021, ACM/SIGPLAN Internaltional Conference on Programming Language Design and Implementation (PLDI))
null (Ed.)
Full Text Available
Constant-time snapshots with applications to concurrent data structures

https://doi.org/10.1145/3437801.3441602

Wei, Yuanhao; Ben-David, Naama; Blelloch, Guy E.; Fatourou, Panagiota; Ruppert, Eric; Sun, Yihan (February 2021, ACM/SIGPLAN Conference on Principles and Practice of Parallel Programming)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records