A fault-tolerance shim for serverless computing

Sreekanti, Vikram; Wu, Chenggang; Chhatrapati, Saurav; Gonzalez, Joseph E.; Hellerstein, Joseph M.; Faleiro, Jose M.

doi:10.1145/3342195.3387535

Citation Details

A fault-tolerance shim for serverless computing

Serverless computing has grown in popularity in recent years, with an increasing number of applications being built on Functions-as-a-Service (FaaS) platforms. By default, FaaS platforms support retry-based fault tolerance, but this is insufficient for programs that modify shared state, as they can unwittingly persist partial sets of updates in case of failures. To address this challenge, we would like atomic visibility of the updates made by a FaaS application. In this paper, we present aft, an atomic fault tolerance shim for serverless applications. aft interposes between a commodity FaaS platform and storage engine and ensures atomic visibility of updates by enforcing the read atomic isolation guarantee. aft supports new protocols to guarantee read atomic isolation in the serverless setting. We demonstrate that aft introduces minimal overhead relative to existing storage engines and scales smoothly to thousands of requests per second, while preventing a significant number of consistency anomalies. more »

Award ID(s):: 1730628

PAR ID:: 10221266

Author(s) / Creator(s):: Sreekanti, Vikram; Wu, Chenggang; Chhatrapati, Saurav; Gonzalez, Joseph E.; Hellerstein, Joseph M.; Faleiro, Jose M.

Date Published:: 2020-04-15

Journal Name:: EuroSys '20: Proceedings of the Fifteenth European Conference on Computer Systems

Page Range / eLocation ID:: 1 to 15

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3342195.3387535

More Like this