NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Distributed Speed Scaling in Large-Scale Service Systems

https://doi.org/10.1287/opre.2024.1012

Rutten, Daan; Zubeldia, Martin; Mukherjee, Debankur (June 2025, Operations Research)

Smart Servers, Smarter Speed Scaling: A Decentralized Algorithm for Data Center Efficiency A team of researchers from Georgia Tech and the University of Minnesota has introduced a cutting-edge algorithm designed to optimize energy use in large-scale data centers. As detailed in their paper “Distributed Rate Scaling in Large-Scale Service Systems,” the team developed a decentralized method allowing each server to adjust its processing speed autonomously without the need for communication or knowledge of system-wide traffic. The algorithm uses idle time as a local signal to guide processing speed, ensuring that all servers converge toward a globally optimal performance rate. This innovation addresses a critical issue in modern computing infrastructure: balancing energy efficiency with performance under uncertainty and scale. The authors demonstrate that their approach not only stabilizes the system but achieves asymptotic optimality as the number of servers increases. The work is poised to significantly reduce energy consumption in data centers, which are projected to account for up to 8% of U.S. electricity use by 2030.
more » « less
Free, publicly-accessible full text available June 3, 2026
Stochastic Approximation with Unbounded Markovian Noise: A General-Purpose Theorem

Haque, Shaan Ul; Maguluri, Siva Theja (May 2025, AISTATS)

Motivated by engineering applications such as resource allocation in networks and inventory systems, we consider average-reward Reinforcement Learning with unbounded state space and reward function. Recent work Murthy et al. (2024) studied this problem in the actor-critic framework and established finite sample bounds assuming access to a critic with certain error guarantees. We complement their work by studying Temporal Difference (TD) learning with linear function approximation and establishing finite-time bounds with the optimal sample complexity. These results are obtained using the following general-purpose theorem for non-linear Stochastic Approximation (SA). Suppose that one constructs a Lyapunov function for a non-linear SA with certain drift condition. Then, our theorem establishes finite-time bounds when this SA is driven by unbounded Markovian noise under suitable conditions. It serves as a black box tool to generalize sample guarantees on SA from i.i.d. or martingale difference case to potentially unbounded Markovian noise. The generality and the mild assumptions of the setup enables broad applicability of our theorem. We illustrate its power by studying two more systems: (i) We improve upon the finite-time bounds of Q-learning in Chen et al. (2024) by tightening the error bounds and also allowing for a larger class of behavior policies. (ii) We establish the first ever finite-time bounds for distributed stochastic optimization of high-dimensional smooth strongly convex function using cyclic block coordinate descent.
more » « less
Free, publicly-accessible full text available May 5, 2026
Concentration of contractive stochastic approximation: Additive and multiplicative noise

https://doi.org/10.1214/24-AAP2143

Chen, Zaiwei; Maguluri, Siva Theja; Zubeldia, Martin (April 2025, The Annals of Applied Probability)

Free, publicly-accessible full text available April 1, 2026
On r-to-p norms of random matrices with nonnegative entries: Asymptotic normality and ℓ∞-bounds for the maximizer

https://doi.org/10.1214/24-AAP2061

Dhara, Souvik; Mukherjee, Debankur; Ramanan, Kavita (December 2024, The Annals of Applied Probability)

Free, publicly-accessible full text available December 1, 2025
Mean-field analysis for load balancing on spatial graphs

https://doi.org/10.1214/24-AAP2090

Rutten, Daan; Mukherjee, Debankur (December 2024, The Annals of Applied Probability)

Free, publicly-accessible full text available December 1, 2025
Policy Evaluation for Variance in Average Reward Reinforcement Learning

Agrawal, Shubhada; L_A, Prashanth; Maguluri, Siva_Theja (July 2024, ICML)

Full Text Available
Best of both worlds guarantees for smoothed online quadratic optimization

Bhuyan, Neelkamal; Mukherjee, Debankur; Wierman, Adam (July 2024, JMLR.org)

We study the smoothed online quadratic optimization (SOQO) problem where, at each round t, a player plays an action xt in response to a quadratic hitting cost and an additional squared ℓ2-norm cost for switching actions. This problem class has strong connections to a wide range of application domains including smart grid management, adaptive control, and data center management, where switching-efficient algorithms are highly sought after. We study the SOQO problem in both adversarial and stochastic settings, and in this process, perform the first stochastic analysis of this class of problems. We provide the online optimal algorithm when the minimizers of the hitting cost function evolve as a general stochastic process, which, for the case of martingale process, takes the form of a distribution-agnostic dynamic interpolation algorithm that we call Lazy Adaptive Interpolation (LAI). Next, we present the stochastic-adversarial trade-off by proving an Ω(T) expected regret for the adversarial optimal algorithm in the literature (ROBD) with respect to LAI and, a sub-optimal competitive ratio for LAI in the adversarial setting. Finally, we present a best-of-both-worlds algorithm that obtains a robust adversarial performance while simultaneously achieving a near-optimal stochastic performance.
more » « less
Full Text Available
Distributed Speed Scaling in Large-Scale Service Systems

https://doi.org/10.1145/3673660.3655053

Rutten, Daan; Zubeldia, Martin; Mukherjee, Debankur (June 2024, ACM SIGMETRICS Performance Evaluation Review)

We consider a large-scale parallel-server loss system with an unknown arrival rate, where each server is able to adjust its processing speed. The objective is to minimize the system cost, which consists of a power cost to maintain the servers' processing speeds and a quality of service cost depending on the tasks' processing times, among others. We draw on ideas from stochastic approximation to design a novel speed scaling algorithm and prove that the servers' processing speeds converge to the globally asymptotically optimum value. Curiously, the algorithm is fully distributed and does not require any communication between servers. Apart from the algorithm design, a key contribution of our approach lies in demonstrating how concepts from the stochastic approximation literature can be leveraged to effectively tackle learning problems in large-scale, distributed systems. En route, we also analyze the performance of a fully heterogeneous parallel-server loss system, where each server has a distinct processing speed, which might be of independent interest.
more » « less
Full Text Available
Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning

https://doi.org/10.1137/22M1499261

Chen, Zaiwei; Clarke, John-Paul; Maguluri, Siva Theja (December 2023, SIAM Journal on Mathematics of Data Science)

Full Text Available
Stochastic Approximation for Nonlinear Discrete Stochastic Control: Finite-Sample Bounds for Exponentially Stable Systems

https://doi.org/10.1109/CDC49753.2023.10384244

Nguyen, Hoang; Maguluri, Siva Theja (December 2023, IEEE)

Full Text Available

« Prev Next »

Search for: All records