- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources5
- Resource Type
-
05000000000
- More
- Availability
-
50
- Author / Contributor
- Filter by Author / Creator
-
-
Xu, Tianyin (4)
-
Sun, Xudong (3)
-
Gu, Jiawei Tyler (2)
-
Alagappan, Ramnatthan (1)
-
Bhandari, Chaitanya (1)
-
Chen, Yinfang (1)
-
Ganesan, Aishwarya (1)
-
Gasch, Michael (1)
-
Gupta, Indranil (1)
-
Ji, Shuyang (1)
-
Jiang, Yuxuan (1)
-
Karanika, Anna (1)
-
Legunsen, Owolabi (1)
-
Lian, Xinyu (1)
-
Luo, Wenqing (1)
-
Marinov, Darko (1)
-
Nath, Suman (1)
-
Suresh, Lalith (1)
-
Tang, Lilia (1)
-
Vaziri, Mandana (1)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Wang, Shuai ; Lian, Xinyu ; Marinov, Darko ; Xu, Tianyin. ( , 45th IEEE/ACM International Conference on Software Engineering (ICSE '23))
-
Tang, Lilia ; Bhandari, Chaitanya ; Zhang, Yongle ; Karanika, Anna ; Ji, Shuyang ; Gupta, Indranil ; Xu, Tianyin ( , 18th European Conference on Computer Systems (EuroSys '23))
-
Chen, Yinfang ; Sun, Xudong ; Nath, Suman ; Yang, Ze ; Xu, Tianyin ( , 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI '23))
-
Sun, Xudong ; Luo, Wenqing ; Gu, Jiawei Tyler ; Ganesan, Aishwarya ; Alagappan, Ramnatthan ; Gasch, Michael ; Suresh, Lalith ; Xu, Tianyin ( , Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI'22))Modern cluster managers like Borg, Omega and Kubernetes rely on the state-reconciliation principle to be highly resilient and extensible. In these systems, all cluster-management logic is embedded in a loosely coupled collection of microservices called controllers. Each controller independently observes the current cluster state and issues corrective actions to converge the cluster to a desired state. However, the complex distributed nature of the overall system makes it hard to build reliable and correct controllers – we find that controllers face myriad reliability issues that lead to severe consequences like data loss, security vulnerabilities, and resource leaks. We present Sieve, the first automatic reliability-testing tool for cluster-management controllers. Sieve drives controllers to their potentially buggy corners by systematically and extensively perturbing the controller’s view of the current cluster state in ways it is expected to tolerate. It then compares the cluster state’s evolution with and without perturbations to detect safety and liveness issues. Sieve’s design is powered by a fundamental opportunity in state-reconciliation systems – these systems are based on state-centric interfaces between the controllers and the cluster state; such interfaces are highly transparent and thereby enable fully-automated reliability testing. To date, Sieve has efficiently found 46 serious safety and liveness bugs (35 confirmed and 22 fixed) in ten popular controllers with a low false-positive rate of 3.5%.more » « less