NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

L1Quad: L1 Adaptive Augmentation of Geometric Control for Agile Quadrotors With Performance Guarantees

https://doi.org/10.1109/TCST.2024.3521182

Wu, Zhuohuan; Cheng, Sheng; Zhao, Pan; Gahlawat, Aditya; Ackerman, Kasey A; Lakshmanan, Arun; Yang, Chengyu; Yu, Jiahao; Hovakimyan, Naira (March 2025, IEEE Transactions on Control Systems Technology)

Free, publicly-accessible full text available March 1, 2026
An Optimization-Based Planner with B-spline Parameterized Continuous-Time Reference Signals

https://doi.org/10.1109/IROS58592.2024.10802083

Tao, Chuyuan; Cheng, Sheng; Wang, Fanxin; Zhao, Yang; Hovakimyan, Naira (October 2024, IEEE)

For the cascaded planning and control modules implemented for robot navigation, the frequency gap between the planner and controller has received limited attention. In this study, we introduce a novel B-spline parameterized optimization-based planner (BSPOP) designed to address the frequency gap challenge with limited onboard computational power in robots. The proposed planner generates continuous-time control inputs for low-level controllers running at arbitrary frequencies to track. Furthermore, when considering the convex control action sets, BSPOP uses the convex hull property to automatically constrain the continuous-time control inputs within the convex set. Consequently, compared with the discrete-time optimization-based planners, BSPOP reduces the number of decision variables and inequality constraints, which improves computational efficiency as a byproduct. Simulation results demonstrate that our approach can achieve a comparable planning performance to the high-frequency baseline optimization-based planners while demanding less computational power. Both simulation and experiment results show that the proposed method performs better in planning compared with baseline planners in the same frequency.
more » « less
Full Text Available
DiffTune-MPC: Closed-Loop Learning for Model Predictive Control

https://doi.org/10.1109/LRA.2024.3422836

Tao, Ran; Cheng, Sheng; Wang, Xiaofeng; Wang, Shenlong; Hovakimyan, Naira (August 2024, IEEE Robotics and Automation Letters)

Model predictive control (MPC) has been applied to many platforms in robotics and autonomous systems for its capability to predict a system’s future behavior while incorporating constraints that a system may have. To enhance the performance of a system with an MPC controller, one can manually tunethe MPC’s cost function. However, it can be challenging due to the possibly high dimension of the parameter space as well as the potential difference between the open-loop cost function in MPC and the overall closed-loop performance metric function. This letter presents Difffune-MPC, a novel learning method, to learn the cost function of an MPC in a closed-loop manner. The proposed framework is compatible with the scenario where the time interval for performance evaluation and MPC’s planning horizon have different lengths. We show the auxiliary problem whose solution admits the analytical gradients of MPC and discuss its variations in different MPC settings, including nonlinear MPCs that are solved using sequential quadratic programming. Simulation results demonstrate the learning capability of DiffTune-MPC and the generalization capability of the learned MPC parameters.
more » « less
Full Text Available
Proto-MPC: An Encoder-Prototype-Decoder Approach for Quadrotor Control in Challenging Winds

Gu, Y; Cheng, S; Hovakimyan, N (July 2024, Proceedings of Machine Learning Research)

Full Text Available
Model Predictive Control Barrier Functions: Guaranteed Safety with Reduced Conservatism and Shortened Horizon

https://doi.org/10.23919/ACC60939.2024.10644741

Abdi, Hossein; Zhao, Pan; Hovakimyan, Naira; Ghabcheloo, Reza (July 2024, IEEE)

In this study, we address the problem of safe control in systems subject to state and input constraints by integrating the Control Barrier Function (CBF) into the Model Predictive Control (MPC) formulation. While CBF offers a conservative policy and traditional MPC lacks the safety guarantee beyond the finite horizon, the proposed scheme takes advantage of both MPC and CBF approaches to provide a guaranteed safe control policy with reduced conservatism and a shortened horizon. The proposed methodology leverages the sum-of-square (SOS) technique to construct CBFs that make forward invariant safe sets in the state space that are then used as a terminal constraint on the last predicted state. CBF invariant sets cover the state space around system fixed points. These islands of forward invariant CBF sets will be connected to each other using MPC. To do this, we proposed a technique to handle the MPC optimization problem subject to the combination of intersections and union of constraints. Our approach, termed Model Predictive Control Barrier Functions (MPCBF), is validated using numerical examples to demonstrate its efficacy, showing improved performance compared to classical MPC and CBF.
more » « less
Full Text Available
ROBUST MODEL BASED REINFORCEMENT LEARNING USING L1 ADAPTIVE CONTROL

Sung, Minjun; Karumanchi, Sambhu_Harimanas; Gahlawat, Aditya; Hovakimyan, Naira (January 2024, International Conference on learning representations, openreview.net)

We introduce L1-MBRL, a control-theoretic augmentation scheme for Model-Based Reinforcement Learning (MBRL) algorithms. Unlike model-free approaches, MBRL algorithms learn a model of the transition function using data and use it to design a control input. Our approach generates a series of approximate control-affine models of the learned transition function according to the proposed switching law. Using the approximate model, control input produced by the underlying MBRL is perturbed by the L1 adaptive control, which is designed to enhance the robustness of the system against uncertainties. Importantly, this approach is agnostic to the choice of MBRL algorithm, enabling the use of the scheme with various MBRL algorithms. MBRL algorithms with L1 augmentation exhibit enhanced performance and sample efficiency across multiple MuJoCo environments, outperforming the original MBRL algorithms, both with and without system noise.
more » « less
Full Text Available
DiffTune: Autotuning Through Autodifferentiation

https://doi.org/10.1109/TRO.2024.3429191

Cheng, Sheng; Kim, Minkyung; Song, Lin; Yang, Chengyu; Jin, Yiquan; Wang, Shenlong; Hovakimyan, Naira (January 2024, IEEE Transactions on Robotics)

Full Text Available
Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions

Cheng, Yikun; Zhao, Pan; Hovakimyan, Naira (June 2023, Proceedings of Machine Learning Research)

Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training has recently received a lot of attention. Safety filters, e.g., based on control barrier functions (CBFs), provide a promising way for safe RL via modifying the unsafe actions of an RL agent on the fly. Existing safety filter-based approaches typically involve learning of uncertain dynamics and quantifying the learned model error, which leads to conservative filters before a large amount of data is collected to learn a good model, thereby preventing efficient exploration. This paper presents a method for safe and efficient RL using disturbance observers (DOBs) and control barrier functions (CBFs). Unlike most existing safe RL methods that deal with hard state constraints, our method does not involve model learning, and leverages DOBs to accurately estimate the pointwise value of the uncertainty, which is then incorporated into a robust CBF condition to generate safe actions. The DOB-based CBF can be used as a safety filter with model-free RL algorithms by minimally modifying the actions of an RL agent whenever necessary to ensure safety throughout the learning process. Simulation results on a unicycle and a 2D quadrotor demonstrate that the proposed method outperforms a state-of-the-art safe RL algorithm using CBFs and Gaussian processes-based model learning, in terms of safety violation rate, and sample and computational efficiency.
more » « less
Full Text Available
Simultaneous Spatial and Temporal Assignment for Fast UAV Trajectory Optimization Using Bilevel Optimization

https://doi.org/10.1109/LRA.2023.3273399

Chen, Qianzhong; Cheng, Sheng; Hovakimyan, Naira (June 2023, IEEE Robotics and Automation Letters)

Full Text Available
Safe and Efficient Reinforcement Learning using Disturbance-Observer-Based Control Barrier Functions

Cheng, Yikun; Zhao, Pan; Hovakimyan, Naira (June 2023, Proceedings of The 5th Annual Learning for Dynamics and Control Conference)
Matni, Nikolai; Morari, Manfred; Pappas, George J. (Ed.)
Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training has recently received a lot of attention. Safety filters, e.g., based on control barrier functions (CBFs), provide a promising way for safe RL via modifying the unsafe actions of an RL agent on the fly. Existing safety filter-based approaches typically involve learning of uncertain dynamics and quantifying the learned model error, which leads to conservative filters before a large amount of data is collected to learn a good model, thereby preventing efficient exploration. This paper presents a method for safe and efficient RL using disturbance observers (DOBs) and control barrier functions (CBFs). Unlike most existing safe RL methods that deal with hard state constraints, our method does not involve model learning, and leverages DOBs to accurately estimate the pointwise value of the uncertainty, which is then incorporated into a robust CBF condition to generate safe actions. The DOB-based CBF can be used as a safety filter with model-free RL algorithms by minimally modifying the actions of an RL agent whenever necessary to ensure safety throughout the learning process. Simulation results on a unicycle and a 2D quadrotor demonstrate that the proposed method outperforms a state-of-the-art safe RL algorithm using CBFs and Gaussian processes-based model learning, in terms of safety violation rate, and sample and computational efficiency.
more » « less
Full Text Available

« Prev Next »

Search for: All records