NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A decoupled, linear, unconditionally stable, and fully discrete finite element scheme for two-phase ferrofluid flows with different densities and viscosities

https://doi.org/10.1016/j.jcp.2025.114209

Chen, Xiaoyong; Li, Rui; Li, Jian; He, Xiaoming (October 2025, Journal of Computational Physics)

Free, publicly-accessible full text available October 1, 2026
Nitro: Boosting Distributed Reinforcement Learning with Serverless Computing

https://doi.org/10.14778/3696435.3696441

Yu, Hanfei; Carter, Jacob; Wang, Hao; Tiwari, Devesh; Li, Jian; Park, Seung-Jong (September 2025, Proceedings of the VLDB Endowment)

Deep reinforcement learning (DRL) has demonstrated significant potential in various applications, including gaming AI, robotics, and system scheduling. DRL algorithms produce, sample, and learn from training data online through a trial-and-error process, demanding considerable time and computational resources. To address this, distributed DRL algorithms and paradigms have been developed to expedite training using extensive resources. Through carefully designed experiments, we are the first to observe that strategically increasing the actor-environment interactions by spawning more concurrent actors at certain training rounds within ephemeral time frames can significantly enhance training efficiency. Yet, current distributed DRL solutions, which are predominantly server-based (or serverful), fail to capitalize on these opportunities due to their long startup times, limited adaptability, and cumbersome scalability. This paper proposesNitro, a generic training engine for distributed DRL algorithms that enforces timely and effective boosting with concurrent actors instantaneously spawned by serverless computing. With serverless functions,Nitroadjusts data sampling strategies dynamically according to the DRL training demands.Nitroseizes the opportunity of real-time boosting by accurately and swiftly detecting an empirical metric. To achieve cost efficiency, we design a heuristic actor scaling algorithm to guideNitrofor cost-aware boosting budget allocation. We integrateNitrowith state-of-the-art DRL algorithms and frameworks and evaluate them on AWS EC2 and Lambda. Experiments with Mujoco and Atari benchmarks show thatNitroimproves the final rewards (i.e., training quality) by up to 6× and reduces training costs by up to 42%.
more » « less
Free, publicly-accessible full text available September 1, 2026
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations

Xiong, Guojun; Wang, Shufan; Jiang, Daniel; Li, Jian (April 2025, The Thirteenth International Conference on Learning Representations (ICLR 2025))

Federated reinforcement learning (FedRL) enables multiple agents to collaboratively learn a policy without needing to share the local trajectories collected during agent-environment interactions. However, in practice, the environments faced by different agents are often heterogeneous, but since existing FedRL algorithms learn a single policy across all agents, this may lead to poor performance. In this paper, we introduce a \emph{personalized} FedRL framework (PFedRL) by taking advantage of possibly shared common structure among agents in heterogeneous environments. Specifically, we develop a class of PFedRL algorithms named PFedRL-Rep that learns (1) a shared feature representation collaboratively among all agents, and (2) an agent-specific weight vector personalized to its local environment. We analyze the convergence of PFedTD-Rep, a particular instance of the framework with temporal difference (TD) learning and linear representations. To the best of our knowledge, we are the first to prove a linear convergence speedup with respect to the number of agents in the PFedRL setting. To achieve this, we show that PFedTD-Rep is an example of federated two-timescale stochastic approximation with Markovian noise. Experimental results demonstrate that PFedTD-Rep, along with an extension to the control setting based on deep Q-networks (DQN), not only improve learning in heterogeneous settings, but also provide better generalization to new environments.
more » « less
Free, publicly-accessible full text available April 24, 2026
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback

Xiong, Guojun; Dinesha, Ujwal; Mukherjee, Debajoy; Li, Jian; Shakkottai, Srinivas (April 2025, The Thirteenth International Conference on Learning Representations (ICLR 2025))

Restless multi-armed bandits (RMAB) has been widely used to model constrained sequential decision making problems, where the state of each restless arm evolves according to a Markov chain and each state transition generates a scalar reward. However, the success of RMAB crucially relies on the availability and quality of reward signals. Unfortunately, specifying an exact reward function in practice can be challenging and even infeasible. In this paper, we introduce Pref-RMAB, a new RMAB model in the presence of preference signals, where the decision maker only observes pairwise preference feedback rather than scalar reward from the activated arms at each decision epoch. Preference feedback, however, arguably contains less information than the scalar reward, which makes Pref-RMAB seemingly more difficult. To address this challenge, we present a direct online preference learning (DOPL) algorithm for Pref-RMAB to efficiently explore the unknown environments, adaptively collect preference data in an online manner, and directly leverage the preference feedback for decision-makings. We prove that DOPL yields a sublinear regret. To our best knowledge, this is the first algorithm to ensure $$\tilde{\mathcal{O}}(\sqrt{T\ln T})$$ regret for RMAB with preference feedback. Experimental results further demonstrate the effectiveness of DOPL.
more » « less
Free, publicly-accessible full text available April 24, 2026
Model-Based Knowledge-Driven Learning Approach for Enhanced High-Resolution Automotive Radar Imaging

https://doi.org/10.1109/TRS.2025.3563492

Zheng, Ruxin; Sun, Shunqiao; Liu, Hongshan; Chen, Honglei; Li, Jian (April 2025, IEEE Transactions on Radar Systems)

Free, publicly-accessible full text available April 22, 2026
Decentralized Federated Learning with Model Caching on Mobile Agents

https://doi.org/10.1609/aaai.v39i20.35429

Wang, Xiaoyu; Xiong, Guojun; Cao, Houwei; Li, Jian; Liu, Yong (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Federated Learning (FL) trains a shared model using data and computation power on distributed agents coordinated by a central server. Decentralized FL (DFL) utilizes local model exchange and aggregation between agents to reduce the communication and computation overheads on the central server. However, when agents are mobile, the communication opportunity between agents can be sporadic, largely hindering the convergence and accuracy of DFL. In this paper, we propose Cached Decentralized Federated Learning (Cached-DFL) to investigate delay-tolerant model spreading and aggregation enabled by model caching on mobile agents. Each agent stores not only its own model, but also models of agents encountered in the recent past. When two agents meet, they exchange their own models as well as the cached models. Local model aggregation utilizes all models stored in the cache. We theoretically analyze the convergence of Cached-DFL,explicitly taking into account the model staleness introduced by caching. We design and compare different model caching algorithms for different DFL and mobility scenarios. We conduct detailed case studies in a vehicular network to systematically investigate the interplay between agent mobility, cache staleness, and model convergence. In our experiments, Cached-DFL converges quickly, and significantly outperforms DFL without caching.
more » « less
Free, publicly-accessible full text available April 11, 2026
Advancing High-Resolution and Efficient Automotive Radar Imaging through Domain-Informed 1D Deep Learning

https://doi.org/10.1109/ICASSP49660.2025.10890731

Zheng, Ruxin; Sun, Shunqiao; Liu, Hongshan; Caesar, Holger; Chen, Honglei; Li, Jian (April 2025, IEEE)

Free, publicly-accessible full text available April 6, 2026
Decentralized Federated Learning with Model Caching on Mobile Agents

Wang, Xiaoyu; Xiong, Guojun; Cao, Houwei; Li, Jian; Liu, Yong (February 2025, AAAI 2025)

Free, publicly-accessible full text available February 28, 2026
Decentralized Federated Learning with Model Caching on Mobile Agents

Wang, Xiaoyu; Xiong, Guojun; Cao, Houwei; Li, Jian Li; Liu, Yong (February 2025, Proceedings of Thirty-Ninth AAAI Conference on Artificial Intelligence)

Federated Learning (FL) aims to train a shared model using data and computation power on distributed agents coordinated by a central server. Decentralized FL (DFL) utilizes local model exchange and aggregation between agents to reduce the communication and computation overheads on the central server. However, when agents are mobile, the communication opportunity between agents can be sporadic, largely hindering the convergence and accuracy of DFL. In this paper, we study delay-tolerant model spreading and aggregation enabled by model caching on mobile agents. Each agent stores not only its own model, but also models of agents encountered in the recent past. When two agents meet, they exchange their own models as well as the cached models. Local model aggregation works on all models in the cache. We theoretically analyze the convergence of DFL with cached models, explicitly taking into account the model staleness introduced by caching. We design and compare different model caching algorithms for different DFL and mobility scenarios. We conduct detailed case studies in a vehicular network to systematically investigate the interplay between agent mobility, cache staleness, and model convergence. In our experiments, cached DFL converges quickly, and significantly outperforms DFL without caching.
more » « less
Free, publicly-accessible full text available February 27, 2026
MBA-RAG:aBandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity

Tang, Xiaqiang; Gao, Qiang; Li, Jian; Du, Nan; Li, Qi; Xie, Sihong (January 2025, The 31st International Conference on Computational Linguistics)

Free, publicly-accessible full text available January 19, 2026

« Prev Next »

Search for: All records