NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Domain Randomization is Sample Efficient for Linear Quadratic Control

Fujinami, Tesshu; Lee, Bruce D; Matni, Nikolai; Pappas, George J (June 2025, PMLR --- L4DC 2025)

Free, publicly-accessible full text available June 1, 2026
State space models, emergence, and ergodicity: How many parameters are needed for stable predictions?

Ziemann, Ingvar; Matni, Nikolai; Pappas, George (June 2025, PMLR --- L4DC 2025)

Free, publicly-accessible full text available June 1, 2026
Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control

https://doi.org/10.1609/aaai.v39i17.33987

Lee, Bruce D; Toso, Leonardo F; Zhang, Thomas T; Anderson, James; Matni, Nikolai (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Representation learning is a powerful tool that enables learning over large multitudes of agents or domains by enforcing that all agents operate on a shared set of learned features. However, many robotics or controls applications that would benefit from collaboration operate in settings with changing environments and goals, whereas most guarantees for representation learning are stated for static settings. Toward rigorously establishing the benefit of representation learning in dynamic settings, we analyze the regret of multi-task representation learning for linear-quadratic control. This setting introduces unique challenges. Firstly, we must account for and balance the misspecification introduced by an approximate representation. Secondly, we cannot rely on the parameter update schemes of single-task online LQR, for which least-squares often suffices, and must devise a novel scheme to ensure sufficient improvement. We demonstrate that for settings where exploration is benign, the regret of any agent after T timesteps scales with the square root of T/H, where H is the number of agents. In settings with difficult exploration, the regret scales as the square root of the input dimension times the parameter dimension multiplied by T, plus a term which scales with T to the three quarters divided by H to the one fifth. In both cases, by comparing to the minimax single-task regret, we see a benefit of a large number of agents. Notably, in the difficult exploration case, by sharing a representation across tasks, the effective task-specific parameter count can often be small. Lastly, we validate the trends we predict.
more » « less
Free, publicly-accessible full text available April 11, 2026
Rate-Optimal Non-Asymptotics for the Quadratic Prediction Error Method

https://doi.org/10.1109/CDC56724.2024.10886130

Stamouli, Charis; Ziemann, Ingvar; Pappas, George J (December 2024, IEEE)

Full Text Available
Single Trajectory Conformal Prediction

https://doi.org/10.1109/CDC56724.2024.10886644

Lee, Brian; Matni, Nikolai (December 2024, IEEE)

Full Text Available
Uncertainty-Aware Deployment of Pre-trained Language-Conditioned Imitation Learning Policies

https://doi.org/10.1109/IROS58592.2024.10802849

Wu, Bo; Lee, Bruce D; Daniilidis, Kostas; Bucher, Bernadette; Matni, Nikolai (October 2024, IEEE)

Full Text Available
Recursively Feasible Shrinking-Horizon MPC in Dynamic Environments with Conformal Prediction Guarantees

Stamouli, Charis; Lindemann, Lars; Pappas, George J (July 2024, L4DC - PMLR)

n this paper, we focus on the problem of shrinking-horizon Model Predictive Control (MPC) in uncertain dynamic environments. We consider controlling a deterministic autonomous system that interacts with uncontrollable stochastic agents during its mission. Employing tools from conformal prediction, existing works derive high-confidence prediction regions for the unknown agent trajectories, and integrate these regions in the design of suitable safety constraints for MPC. Despite guaranteeing probabilistic safety of the closed-loop trajectories, these constraints do not ensure feasibility of the respective MPC schemes for the entire duration of the mission. We propose a shrinking-horizon MPC that guarantees recursive feasibility via a gradual relaxation of the safety constraints as new prediction regions become available online. This relaxation enforces the safety constraints to hold over the least restrictive prediction region from the set of all available prediction regions. In a comparative case study with the state of the art, we empirically show that our approach results in tighter prediction regions and verify recursive feasibility of our MPC scheme.
more » « less
Full Text Available
Nonasymptotic regret analysis of adaptive linear quadratic control with model misspecification

Lee, Bruce; Rantzer, Anders; Matni, Nikolai (July 2024, L4DC - PMLR)

Full Text Available
Sample-Efficient Linear Representation Learning from Non-IID Non-Isotropic Data

Zhang, Thomas TCK; Toso, Leonardo Felipe; Anderson, James; Matni, Nikolai (January 2024, ICLR 2024)

Full Text Available

Search for: All records