Kamino: Efficient VM Allocation at Scale with Latency-Driven Cache-Aware Scheduling

Domingo, David; Barbalho, Hugo; Molinaro, Marco; Liu, Kuan; Pan, Abhisek; Dion, David; Moscibroda, Thomas; Kannan, Sudarsun; Menache, Ishai

Citation Details

This content will become publicly available on July 7, 2026

Kamino: Efficient VM Allocation at Scale with Latency-Driven Cache-Aware Scheduling

In virtual machine (VM) allocation systems, caching repetitive and similar VM allocation requests and associated resolution rules is crucial for reducing computational costs and meeting strict latency requirements. While modern allocation systems distribute requests among multiple allocator agents and use caching to improve performance, current schedulers often neglect the cache state and latency considerations when assigning each new request to an agent. Due to the high variance in costs of cache hits and misses and the associated processing overheads of updating the caches, simple load-balancing and cache-aware mechanisms result in high latencies. We introduce Kamino, a high-performance, latencydriven and cache-aware request scheduling system aimed at minimizing end-to-end latencies. Kamino employs a novel scheduling algorithm grounded in theory which uses partial indicators from the cache state to assign each new request to the agent with the lowest estimated latency. Evaluation of Kamino using a high-fidelity simulator on large-scale production workloads shows a 42% reduction in average request latencies. Our deployment of Kamino in the control plane of a large public cloud confirms these improvements, with a 33% decrease in cache miss rates and a 17% reduction in memory usage more »

Award ID(s):: 2231724

PAR ID:: 10636717

Author(s) / Creator(s):: Domingo, David; Barbalho, Hugo; Molinaro, Marco; Liu, Kuan; Pan, Abhisek; Dion, David; Moscibroda, Thomas; Kannan, Sudarsun; Menache, Ishai

Publisher / Repository:: ACM

Date Published:: 2025-07-07

ISBN:: 978-1-939133-47-2

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 7, 2026
Conference Paper:
The DOI is not currently available.

More Like this