NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multi-Agent Reinforcement Learning with Serverless Computing

https://doi.org/10.1145/3772052.3772227

Wei, Rui; Yu, Hanfei; Song, Xikang; Li, Jian; Tiwari, Devesh; Mao, Ying; Wang, Hao (November 2025, ACM Symposium on Cloud Computing 2025)

Free, publicly-accessible full text available November 19, 2026
Nitro: Boosting Distributed Reinforcement Learning with Serverless Computing

https://doi.org/10.14778/3696435.3696441

Yu, Hanfei; Carter, Jacob; Wang, Hao; Tiwari, Devesh; Li, Jian; Park, Seung-Jong (September 2025, Proceedings of the VLDB Endowment)

Deep reinforcement learning (DRL) has demonstrated significant potential in various applications, including gaming AI, robotics, and system scheduling. DRL algorithms produce, sample, and learn from training data online through a trial-and-error process, demanding considerable time and computational resources. To address this, distributed DRL algorithms and paradigms have been developed to expedite training using extensive resources. Through carefully designed experiments, we are the first to observe that strategically increasing the actor-environment interactions by spawning more concurrent actors at certain training rounds within ephemeral time frames can significantly enhance training efficiency. Yet, current distributed DRL solutions, which are predominantly server-based (or serverful), fail to capitalize on these opportunities due to their long startup times, limited adaptability, and cumbersome scalability. This paper proposesNitro, a generic training engine for distributed DRL algorithms that enforces timely and effective boosting with concurrent actors instantaneously spawned by serverless computing. With serverless functions,Nitroadjusts data sampling strategies dynamically according to the DRL training demands.Nitroseizes the opportunity of real-time boosting by accurately and swiftly detecting an empirical metric. To achieve cost efficiency, we design a heuristic actor scaling algorithm to guideNitrofor cost-aware boosting budget allocation. We integrateNitrowith state-of-the-art DRL algorithms and frameworks and evaluate them on AWS EC2 and Lambda. Experiments with Mujoco and Atari benchmarks show thatNitroimproves the final rewards (i.e., training quality) by up to 6× and reduces training costs by up to 42%.
more » « less
Free, publicly-accessible full text available September 1, 2026
Stellaris: Staleness-Aware Distributed Reinforcement Learning with Serverless Computing

https://doi.org/10.1109/SC41406.2024.00045

Yu, Hanfei; Wang, Hao; Tiwari, Devesh; Li, Jian; Park, Seung-Jong (November 2024, IEEE)

Full Text Available
RainbowCake: Mitigating Cold-starts in Serverless with Layer-wise Container Caching and Sharing

https://doi.org/10.1145/3617232.3624871

Yu, Hanfei; Basu_Roy, Rohan; Fontenot, Christian; Tiwari, Devesh; Li, Jian; Zhang, Hong; Wang, Hao; Park, Seung-Jong (April 2024, ACM)

Full Text Available
The globus compute dataset: An open function-as-a-service dataset from the edge to the cloud

https://doi.org/10.1016/j.future.2023.12.007

Bauer, André; Pan, Haochen; Chard, Ryan; Babuji, Yadu; Bryan, Josh; Tiwari, Devesh; Foster, Ian; Chard, Kyle (April 2024, Future Generation Computer Systems)

Full Text Available
SupeRBNN: Randomized Binary Neural Network Using Adiabatic Superconductor Josephson Devices

https://doi.org/10.1145/3613424.3623771

Li, Zhengang; Yuan, Geng; Yamauchi, Tomoharu; Masoud, Zabihi; Xie, Yanyue; Dong, Peiyan; Tang, Xulong; Yoshikawa, Nobuyuki; Tiwari, Devesh; Wang, Yanzhi; et al (October 2023, ACM)

Full Text Available
Quantum Computing Reliability: Problems, Tools, and Potential Solutions

https://doi.org/10.1109/DSN-S58398.2023.00015

Giusto, Edoardo; Dri, Emanuele; Montrucchio, Bartolomeo; Baheri, Betis; Guan, Qiang; Tiwari, Devesh; Rech, Paolo (June 2023, DSN)

Full Text Available
IceBreaker: warming serverless functions better with heterogeneity

https://doi.org/10.1145/3503222.3507750

Roy, Rohan Basu; Patel, Tirthak; Tiwari, Devesh (February 2022, ASPLOS '22: Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems)

Full Text Available
Do Temperature and Humidity Exposures Hurt or Benefit Your SSDs?

https://doi.org/10.23919/DATE54114.2022.9774582

Maruf, Adnan; Brahmakshatriya, Sashri; Li, Baolin; Tiwari, Devesh; Quan, Gang; Bhimani, Janki (January 2022, Design, Automation and Test in Europe Conference. The European Event for Electronic System Design and Test (DATE’22))

Full Text Available
AI-driven Storage Resource Provisioning and Operations: Revisiting Old Assumptions and Meeting New Expectations.

Anantharaj, Valentine; da Silva, Rafael Ferreira; Butt, Ali R.; Oral, Sarp; Tiwari. Devesh (January 2022, Proceedings of the ASCR Workshop on the Management and Storage of Scientific Data)

Full Text Available

« Prev Next »

Search for: All records