NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Scaler: Efficient and Effective Cross Flow Analysis

https://doi.org/10.1145/3691620.3695473

Tang, Steven Jiaxun; Xiang, Mingcan; Wang, Yang; Wu, Bo; Chen, Jianjun; Liu, Tongping (October 2024, ACM)

Full Text Available
On the Feasibility and Benefits of Extensive Evaluation

https://doi.org/10.1145/3677137

Hui, Yujie; Yu, Miao; Qi, Hao; Gan, Yifan; Li, Tianxi; Li, Yuke; Ren, Xueyuan; Ma, Sixiang; Lu, Xiaoyi; Wang, Yang (October 2024, Proceedings of the ACM on Management of Data)

Benchmark and system parameters often have a significant impact on performance evaluation, which raises a long-lasting question about which settings we should use. This paper studies the feasibility and benefits of extensive evaluation. A full extensive evaluation, which tests all possible settings, is usually too expensive. This work investigates whether it is possible to sample a subset of the settings and, upon them, generate observations that match those from a full extensive evaluation. Towards this goal, we have explored the incremental sampling approach, which starts by measuring a small subset of random settings, builds a prediction model on these samples using the popular ANOVA approach, adds more samples if the model is not accurate enough, and terminates otherwise. To summarize our findings: 1) Enhancing a research prototype to support extensive evaluation mostly involves changing hard-coded configurations, which does not take much effort. 2) Some systems are highly predictable, which means that they can achieve accurate predictions with a low sampling rate, but some systems are less predictable. 3) We have not found a method that can consistently outperform random sampling + ANOVA. Based on these findings, we provide recommendations to improve artifact predictability and strategies for selecting parameter values during evaluation.
more » « less
Full Text Available
IsoPredict: Dynamic Predictive Analysis for Detecting Unserializable Behaviors in Weakly Isolated Data Store Applications

https://doi.org/10.1145/3656391

Geng, Chujun; Blanas, Spyros; Bond, Michael D; Wang, Yang (June 2024, Proceedings of the ACM on Programming Languages)

Distributed data stores typically provide weak isolation levels, which are efficient but can lead to unserializable behaviors, which are hard for programmers to understand and often result in errors. This paper presents the first dynamic predictive analysis for data store applications under weak isolation levels, called IsoPredict. Given an observed serializable execution of a data store application, IsoPredict generates and solves SMT constraints to find an unserializable execution that is a feasible execution of the application. IsoPredict introduces novel techniques to handle divergent application behavior; to solve mutually recursive sets of constraints; and to balance coverage, precision, and performance. An evaluation shows IsoPredict finds unserializable behaviors in four data store benchmarks, and that more than 99% of its predicted executions are feasible.
more » « less
Full Text Available
Learning Distributed Protocols with Zero Knowledge

Hui, Yujie; Ripberger, Drew; Lu, Xiaoyi; Wang, Yang (December 2023, NeurIPS)

The success of AlphaGo Zero shows that a computer can learn to play a complicated board game without relying on the knowledge from human players. We observe that designing a distributed protocol is similar to playing board games to some extent: when determining the next action to take, they both want to ensure they can win even when a smart opponent tries to drive the game/protocol to the worst case. In this work, we explore whether we can apply similar techniques to learn a distributed protocol with zero knowledge. Towards this goal, we model the process in a distributed protocol as a state machine, and further rely on model checking to validate the correctness of the learned state machine. With this approach, we successfully learned a correct atomic commit protocol with three processes, and upon that, we further discuss future work.
more » « less
Full Text Available
EdgeCut: Fast and Low-overhead Access of User-associated Contents from Edge Servers

Liu, Yi; Wang, Minmei; Shi, Shouqian; Wang, Yang; Qian, Chen (December 2023, ACM)

User-associated contents play an increasingly important role in modern network applications. With growing deployments of edge servers, the capacity of content storage in edge clusters significantly increases, which provides great potential to satisfy content requests with much shorter latency. However, the large number of contents also causes the difficulty of searching contents on edge servers in different locations because indexing contents costs huge DRAM on each edge server. In this work, we explore the opportunity of efficiently indexing user-associated contents and propose a scalable content-sharing mechanism for edge servers, called EdgeCut, that significantly reduces content access latency by allowing many edge servers to share their cached contents. We design a compact and dynamic data structure called Ludo Locator that returns the IP address of the edge server that stores the requested user-associated content. We have implemented a prototype of EdgeCut in a real network environment running in a public geo-distributed cloud. The experiment results show that EdgeCut reduces content access latency by up to 50% and reduces cloud traffic by up to 50% compared to existing solutions. The memory cost is less than 50MB for 10 million mobile users. The simulations using real network latency data show EdgeCut’s advantages over existing solutions on a large scale.
more » « less
Full Text Available
On the Discontinuation of Persistent Memory: Looking Back to Look Forward

Li, Tianxi; Wang, Yang; Lu, Xiaoyi (June 2023, Workshop on Hot Topics in System Infrastructure June 18, 2023, Orlando, Florida, USA Co-located with ISCA 2023)

Full Text Available
Developer’s Responsibility or Database’s Responsibility? Rethinking Concurrency Control in Databases

Cheng, Chaoyi; Han, Mingzhe; Xu, Nuo; Blanas, Spyros; Bond, Michael D.; Wang, Yang (January 2023, 13th Annual Conference on Innovative Data Systems Research (CIDR ’23). January 8-11, 2023, Amsterdam, The Netherlands.)

Full Text Available
IsoBugView: Interactively Debugging Isolation Bugs in Database Applications

Ripberger, Drew; Gan, Yifan; Ren, Xueyuan; Blanas, Spyros; Wang, Yang. (September 2022, Proceedings of the VLDB Endowment)

Full Text Available
A Study of Database Performance Sensitivity to Experiment Settings.

https://doi.org/10.14778/3523210.3523221

Wang, Yang; Yu, Miao; Hui, Yujie; Zhou, Fang; Huang, Yuyang; Zhu, Rui; Ren, Xueyuan; Li, Tianxi; Lu, Xiaoyi. (September 2022, Proceedings of the VLDB Endowment)

Full Text Available

Search for: All records