Is Function-as-a-Service a Good Fit for Latency-Critical Services?

Qiu, Haoran; Jha, Saurabh; Banerjee, Subho S.; Patke, Archit; Wang, Chen; Hubertus, Franke; Kalbarczyk, Zbigniew T.; Iyer, Ravishankar K.

doi:10.1145/3493651.3493666

Citation Details

Is Function-as-a-Service a Good Fit for Latency-Critical Services?

Function-as-a-Service (FaaS) is becoming an increasingly popular cloud-deployment paradigm for serverless computing that frees application developers from managing the infrastructure. At the same time, it allows cloud providers to assert control in workload consolidation, i.e., co-locating multiple containers on the same server, thereby achieving higher server utilization, often at the cost of higher end-to-end function request latency. Interestingly, a key aspect of serverless latency management has not been well studied: the trade-off between application developers' latency goals and the FaaS providers' utilization goals. This paper presents a multi-faceted, measurement-driven study of latency variation in serverless platforms that elucidates this trade-off space. We obtained production measurements by executing FaaS benchmarks on IBM Cloud and a private cloud to study the impact of workload consolidation, queuing delay, and cold starts on the end-to-end function request latency. We draw several conclusions from the characterization results. For example, increasing a container's allocated memory limit from 128 MB to 256 MB reduces the tail latency by 2× but has 1.75× higher power consumption and 59% lower CPU utilization. more »

Award ID(s):: 2029049

NSF-PAR ID:: 10358766

Author(s) / Creator(s):: Qiu, Haoran; Jha, Saurabh; Banerjee, Subho S.; Patke, Archit; Wang, Chen; Hubertus, Franke; Kalbarczyk, Zbigniew T.; Iyer, Ravishankar K.

Date Published:: 2021-12-06

Journal Name:: WoSC '21: Proceedings of the Seventh International Workshop on Serverless Computing (WoSC7) 2021

Page Range / eLocation ID:: 1 to 8

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3493651.3493666

More Like this