NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

AutoBurst: Autoscaling Burstable Instances for Cost-effective Latency SLOs

https://doi.org/10.1145/3698038.3698530

Hasan, Rubaba; Zhu, Timothy; Urgaonkar, Bhuvan (November 2024, ACM)

Free, publicly-accessible full text available November 20, 2025
TraceUpscaler: Upscaling Traces to Evaluate Systems at High Load

https://doi.org/10.1145/3627703.3629581

Sajal, Sultan Mahmud; Zhu, Timothy; Urgaonkar, Bhuvan; Sen, Siddhartha (April 2024, ACM)

Full Text Available
Multi-resource fair allocation for consolidated flash-based caching systems

https://doi.org/10.1145/3528535.3565245

Choi, Wonil; Urgaonkar, Bhuvan; Kandemir, Mahmut Taylan; Kesidis, George (November 2022, ACM/IFIP Middleware)

Full Text Available
On a Caching System with Object Sharing

https://doi.org/10.1145/3429881.3430107

Alfares, Nader; Kesidis, George; Li, Xi; Urgaonkar, Bhuvan; Kandemir, Mahmut; Konstantopoulos, Takis (December 2020, Workshop on Middleware and Applications for the Internet of Things)
null (Ed.)
Full Text Available
SplitServe: Efficiently Splitting Apache Spark Jobs Across FaaS and IaaS

https://doi.org/10.1145/3423211.3425695

Jain, Aman; Baarzi, Ata F.; Kesidis, George; Urgaonkar, Bhuvan; Alfares, Nader; Kandemir, Mahmut (December 2020, Middleware'20)
null (Ed.)
Full Text Available
Fair Write Attribution and Allocation for Consolidated Flash Cache

https://doi.org/10.1145/3373376.3378502

Choi, Wonil; Urgaonkar, Bhuvan; Kandemir, Mahmut; Jung, Myoungsoo; Evans, David (March 2020, Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems)

Consolidating multiple workloads on a single flash-based storage device is now a common practice. We identify a new problem related to lifetime management in such settings: how should one partition device resources among consolidated workloads such that their allowed contributions to the device's wear (resulting from their writes including hidden writes due to garbage collection) may be deemed fairly assigned? When flash is used as a cache/buffer, such fairness is important because it impacts what and how much traffic from various workloads may be serviced using flash which in turn affects their performance. We first clarify why the write attribution problem (i.e., which workload contributed how many writes) is non-trivial. We then present a technique for it inspired by the Shapley value, a classical concept from cooperative game theory, and demonstrate that it is accurate, fair, and feasible. We next consider how to treat an overall "write budget" (i.e., total allowable writes during a given time period) for the device as a first-class resource worthy of explicit management. Towards this, we propose a novel write budget allocation technique. Finally, we construct a dynamic lifetime management framework for consolidated devices by putting the above elements together. Our experiments using real-world workloads demonstrate that our write allocation and attribution techniques lead to performance fairness across consolidated workloads.
more » « less
Full Text Available
Scheduling Distributed Resources in Heterogeneous Private Clouds

https://doi.org/10.1109/MASCOTS.2018.00018

Kesidis, George; Shan, Yuquan; Jain, Aman; Urgaonkar, Bhuvan; Khamse-Ashari, Jalal; Lambadaris, Ioannis (September 2018, IEEE MASCOTS)

We first consider the static problem of allocating resources to (i.e., scheduling) multiple distributed application frameworks, possibly with different priorities and server preferences, in a private cloud with heterogeneous servers. Several fair scheduling mechanisms have been proposed for this purpose. We extend prior results on max-min fair (MMF) and proportional fair (PF) scheduling to this constrained multiresource and multiserver case for generic fair scheduling criteria. The task efficiencies (a metric related to proportional fairness) of max- min fair allocations found by progressive filling are compared by illustrative examples. In the second part of this paper, we consider the online problem (with framework churn) by implementing variants of these schedulers in Apache Mesos using progressive filling to dynamically approximate max-min fair allocations. We evaluate the implemented schedulers in terms of overall execution time of realistic distributed Spark workloads. Our experiments show that resource efficiency is improved and execution times are reduced when the scheduler is “server specific” or when it leverages characterized required resources of the workloads (when known).
more » « less
Full Text Available

Search for: All records