NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Efficient and Robust Transfer Learning of Optimal Individualized Treatment Regimes with Right-Censored Survival Data

Zhao, Pan; Josse, Julie; Yang, Shu (February 2025, Journal of Machine Learning Research)

Free, publicly-accessible full text available February 1, 2026
L1Quad: L1 Adaptive Augmentation of Geometric Control for Agile Quadrotors With Performance Guarantees

https://doi.org/10.1109/TCST.2024.3521182

Wu, Zhuohuan; Cheng, Sheng; Zhao, Pan; Gahlawat, Aditya; Ackerman, Kasey A; Lakshmanan, Arun; Yang, Chengyu; Yu, Jiahao; Hovakimyan, Naira (March 2025, IEEE Transactions on Control Systems Technology)

Free, publicly-accessible full text available March 1, 2026
Model Predictive Control Barrier Functions: Guaranteed Safety with Reduced Conservatism and Shortened Horizon

https://doi.org/10.23919/ACC60939.2024.10644741

Abdi, Hossein; Zhao, Pan; Hovakimyan, Naira; Ghabcheloo, Reza (July 2024, IEEE)

In this study, we address the problem of safe control in systems subject to state and input constraints by integrating the Control Barrier Function (CBF) into the Model Predictive Control (MPC) formulation. While CBF offers a conservative policy and traditional MPC lacks the safety guarantee beyond the finite horizon, the proposed scheme takes advantage of both MPC and CBF approaches to provide a guaranteed safe control policy with reduced conservatism and a shortened horizon. The proposed methodology leverages the sum-of-square (SOS) technique to construct CBFs that make forward invariant safe sets in the state space that are then used as a terminal constraint on the last predicted state. CBF invariant sets cover the state space around system fixed points. These islands of forward invariant CBF sets will be connected to each other using MPC. To do this, we proposed a technique to handle the MPC optimization problem subject to the combination of intersections and union of constraints. Our approach, termed Model Predictive Control Barrier Functions (MPCBF), is validated using numerical examples to demonstrate its efficacy, showing improved performance compared to classical MPC and CBF.
more » « less
Full Text Available
Positivity-free Policy Learning with Observational Dat

Zhao, Pan; Chambaz, Antoine; Josse, Julie; Yang, Shu (May 2024, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics)

Full Text Available
Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions

Cheng, Yikun; Zhao, Pan; Hovakimyan, Naira (June 2023, Proceedings of Machine Learning Research)

Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training has recently received a lot of attention. Safety filters, e.g., based on control barrier functions (CBFs), provide a promising way for safe RL via modifying the unsafe actions of an RL agent on the fly. Existing safety filter-based approaches typically involve learning of uncertain dynamics and quantifying the learned model error, which leads to conservative filters before a large amount of data is collected to learn a good model, thereby preventing efficient exploration. This paper presents a method for safe and efficient RL using disturbance observers (DOBs) and control barrier functions (CBFs). Unlike most existing safe RL methods that deal with hard state constraints, our method does not involve model learning, and leverages DOBs to accurately estimate the pointwise value of the uncertainty, which is then incorporated into a robust CBF condition to generate safe actions. The DOB-based CBF can be used as a safety filter with model-free RL algorithms by minimally modifying the actions of an RL agent whenever necessary to ensure safety throughout the learning process. Simulation results on a unicycle and a 2D quadrotor demonstrate that the proposed method outperforms a state-of-the-art safe RL algorithm using CBFs and Gaussian processes-based model learning, in terms of safety violation rate, and sample and computational efficiency.
more » « less
Full Text Available
Safe and Efficient Reinforcement Learning using Disturbance-Observer-Based Control Barrier Functions

Cheng, Yikun; Zhao, Pan; Hovakimyan, Naira (June 2023, Proceedings of The 5th Annual Learning for Dynamics and Control Conference)
Matni, Nikolai; Morari, Manfred; Pappas, George J. (Ed.)
Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training has recently received a lot of attention. Safety filters, e.g., based on control barrier functions (CBFs), provide a promising way for safe RL via modifying the unsafe actions of an RL agent on the fly. Existing safety filter-based approaches typically involve learning of uncertain dynamics and quantifying the learned model error, which leads to conservative filters before a large amount of data is collected to learn a good model, thereby preventing efficient exploration. This paper presents a method for safe and efficient RL using disturbance observers (DOBs) and control barrier functions (CBFs). Unlike most existing safe RL methods that deal with hard state constraints, our method does not involve model learning, and leverages DOBs to accurately estimate the pointwise value of the uncertainty, which is then incorporated into a robust CBF condition to generate safe actions. The DOB-based CBF can be used as a safety filter with model-free RL algorithms by minimally modifying the actions of an RL agent whenever necessary to ensure safety throughout the learning process. Simulation results on a unicycle and a 2D quadrotor demonstrate that the proposed method outperforms a state-of-the-art safe RL algorithm using CBFs and Gaussian processes-based model learning, in terms of safety violation rate, and sample and computational efficiency.
more » « less
Full Text Available
Convex Synthesis of Control Barrier Functions Under Input Constraints

https://doi.org/10.1109/LCSYS.2023.3293765

Zhao, Pan; Ghabcheloo, Reza; Cheng, Yikun; Abdi, Hossein; Hovakimyan, Naira (January 2023, IEEE Control Systems Letters)

Full Text Available
Robust Nonlinear Tracking Control with Exponential Convergence Using Contraction Metrics and Disturbance Estimation

https://doi.org/10.3390/s22134743

Zhao, Pan; Guo, Ziyao; Hovakimyan, Naira (July 2022, Sensors)

This paper presents a tracking controller for nonlinear systems with matched uncertainties based on contraction metrics and disturbance estimation that provides exponential convergence guarantees. Within the proposed approach, a disturbance estimator is proposed to estimate the pointwise value of the uncertainties, with a pre-computable estimation error bounds (EEB). The estimated disturbance and the EEB are then incorporated in a robust Riemannian energy condition to compute the control law that guarantees exponential convergence of actual state trajectories to desired ones. Simulation results on aircraft and planar quadrotor systems demonstrate the efficacy of the proposed controller, which yields better tracking performance than existing controllers for both systems.
more » « less
Full Text Available
Improving the Robustness of Reinforcement Learning Policies With $${\mathcal {L}_{1}}$$ Adaptive Control

https://doi.org/10.1109/LRA.2022.3169309

Cheng, Yikun; Zhao, Pan; Wang, Fanxin; Block, Daniel J.; Hovakimyan, Naira (July 2022, IEEE Robotics and Automation Letters)

Full Text Available
Tube-Certified Trajectory Tracking for Nonlinear Systems With Robust Control Contraction Metrics

https://doi.org/10.1109/LRA.2022.3153712

Zhao, Pan; Lakshmanan, Arun; Ackerman, Kasey; Gahlawat, Aditya; Pavone, Marco; Hovakimyan, Naira (April 2022, IEEE Robotics and Automation Letters)

Full Text Available

« Prev Next »

Search for: All records