NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Backup Plan Constrained Model Predictive Control with Guaranteed Stability

https://doi.org/10.2514/1.G007627

Tao, Ran; Kim, Hunmin; Yoon, Hyung-Jin; Wan, Wenbin; Hovakimyan, Naira; Sha, Lui; Voulgaris, Petros (October 2023, Journal of Guidance, Control, and Dynamics)

Safe control designs for robotic systems remain challenging because of the difficulties of explicitly solving optimal control with nonlinear dynamics perturbed by stochastic noise. However, recent technological advances in computing devices enable online optimization or sampling-based methods to solve control problems. For example, Control Barrier Functions (CBFs) have been proposed to numerically solve convex optimization problems that ensure the control input to stay in the safe set. Model Predictive Path Integral (MPPI) control uses forward sampling of stochastic differential equations to solve optimal control problems online. Both control algorithms are widely used for nonlinear systems because they avoid calculating the derivatives of the nonlinear dynamic functions. In this paper, we use Stochastic Control Barrier Functions (SCBFs) constraints to limit sample regions in the samplingbased algorithm, ensuring safety in a probabilistic sense and improving sample efficiency with a stochastic differential equation. We also show that our algorithm needs fewer samples than the original MPPI algorithm does by providing a sampling complexity analysis.
more » « less
Full Text Available
Adaptive Risk Sensitive Path Integral for Model Predictive Control via Reinforcement Learning

https://doi.org/10.1109/MED59994.2023.10185876

Yoon, Hyung-Jin; Tao, Chuyuan; Kim, Hunmin; Hovakimyan, Naira; Voulgaris, Petros (June 2023, editerranean Conference on Control and Automation (MED))

We propose a reinforcement learning framework where an agent uses an internal nominal model for stochastic model predictive control (MPC) while compensating for a disturbance. Our work builds on the existing risk-aware optimal control with stochastic differential equations (SDEs) that aims to deal with such disturbance. However, the risk sensitivity and the noise strength of the nominal SDE in the riskaware optimal control are often heuristically chosen. In the proposed framework, the risk-taking policy determines the behavior of the MPC to be risk-seeking (exploration) or riskaverse (exploitation). Specifcally, we employ the risk-aware path integral control that can be implemented as a Monte-Carlo (MC) sampling with fast parallel simulations using a GPU. The MC sampling implementations of the MPC have been successful in robotic applications due to their real-time computation capability. The proposed framework that adapts the noise model and the risk sensitivity outperforms the standard model predictive path integ
more » « less
RRT Guided Model Predictive Path Integral Method

https://doi.org/10.23919/ACC55779.2023.10155837

Tao, Chuyuan; Kim, Hunmin; Hovakimyan, Naira (May 2023, IEEE)

This work presents an optimal sampling-based method to solve the real-time motion planning problem in static and dynamic environments, exploiting the Rapid-exploring Random Trees (RRT) algorithm and the Model Predictive Path Integral (MPPI) algorithm. The RRT algorithm provides a nominal mean value of the random control distribution in the MPPI algorithm, resulting in satisfactory control performance in static and dynamic environments without a need for fine parameter tuning. We also discuss the importance of choosing the right mean of the MPPI algorithm, which balances exploration and optimality gap, given a fixed sample size. In particular, a sufficiently large mean is required to explore the state space enough, and a sufficiently small mean is required to guarantee that the samples reconstruct the optimal control. The proposed methodology automates the procedure of choosing the right mean by incorporating the RRT algorithm. The simulations demonstrate that the proposed algorithm can solve the motion planning problem for static or dynamic environments.
more » « less
Full Text Available
Sℒ 1-Simplex: Safe Velocity Regulation of Self-Driving Vehicles in Dynamic and Unforeseen Environments

https://doi.org/10.1145/3564273

Mao, Yanbing; Gu, Yuliang; Hovakimyan, Naira; Sha, Lui; Voulgaris, Petros (January 2023, ACM Transactions on Cyber-Physical Systems)
Tarek Abdelzaher, Karl-Erik Arzen (Ed.)
This article proposes a novel extension of the Simplex architecture with model switching and model learning to achieve safe velocity regulation of self-driving vehicles in dynamic and unforeseen environments. To guarantee the reliability of autonomous vehicles, an ℒ₁adaptive controller that compensates for uncertainties and disturbances is employed by the Simplex architecture as a verified high-assurance controller (HAC) to tolerate concurrent software and physical failures. Meanwhile, the safe switching controller is incorporated into the HAC for safe velocity regulation in the dynamic (prepared) environments, through the integration of the traction control system and anti-lock braking system. Due to the high dependence of vehicle dynamics on the driving environments, the HAC leverages the finite-time model learning to timely learn and update the vehicle model for ℒ₁adaptive controller, when any deviation from the safety envelope or the uncertainty measurement threshold occurs in the unforeseen driving environments. With the integration of ℒ₁adaptive controller, safe switching controller and finite-time model learning, the vehicle’s angular and longitudinal velocities can asymptotically track the provided references in the dynamic and unforeseen driving environments, while the wheel slips are restricted to safety envelopes to prevent slipping and sliding. Finally, the effectiveness of the proposed Simplex architecture for safe velocity regulation is validated by the AutoRally platform.
more » « less
Full Text Available
Path Integral Methods with Stochastic Control Barrier Functions

https://doi.org/10.1109/CDC51059.2022.9993190

Tao, Chuyuan; Yoon, Hyung-Jin; Kim, Hunmin; Hovakimyan, Naira; Voulgaris, Petros (December 2022, IEEE)
Improving the Robustness of Reinforcement Learning Policies With L1 Adaptive Control

Yikun Cheng, Pan Zhao (July 2022, IEEE robotics automation letters)
Tamim Asfour, editor in (Ed.)
A reinforcement learning (RL) control policy could fail in a new/perturbed environment that is different from the training environment, due to the presence of dynamic variations. For controlling systems with continuous state and action spaces, we propose an add-on approach to robustifying a pre-trained RL policy by augmenting it with an L1 adaptive controller (L1AC). Leveraging the capability of an L1AC for fast estimation and active ompensation of dynamic variations, the proposed approach can improve the robustness of an RL policy which is trained either in a simulator or in the real world without consideration of a broad class of dynamic variations. Numerical and real-world experiments empirically demonstrate the efficacy of the proposed approach in robustifying RL policies trained using both model-free and modelbased methods.
more » « less
Full Text Available
Risk Ranked Recall: Collision Safety Metric for Object Detection Systems in Autonomous Vehicles

https://doi.org/10.1109/MECO52532.2021.9460196

Bansal, Ayoosh; Singh, Jayati; Verucchi, Micaela; Caccamo, Marco; Sha, Lui (June 2021, 10th Mediterranean Conference on Embedded Computing)
null (Ed.)
Commonly used metrics for evaluation of object detection systems (precision, recall, mAP) do not give complete information about their suitability of use in safety-critical tasks, like obstacle detection for collision avoidance in Autonomous Vehicles (AV). This work introduces the Risk Ranked Recall ($R^3$) metrics for object detection systems. The $R^3$ metrics categorize objects within three ranks. Ranks are assigned based on an objective cyber-physical model for the risk of collision. Recall is measured for each rank
more » « less
Full Text Available
Impact of Confirmation Bias on Competitive Information Spread in Social Networks

https://doi.org/10.1109/TCNS.2021.3050117

Mao, Yanbing; Akyol, Emrah; Hovakimyan, Naira (June 2021, IEEE Transactions on Control of Network Systems)
null (Ed.)
Full Text Available
Robust Vehicle Lane Keeping Control with Networked Proactive Adaptation

https://doi.org/10.23919/ACC50511.2021.9482669

Kim, Hunmin; Wan, Wenbin; Hovakimyan, Naira; Sha, Lui; Voulgaris, Petros (May 2021, American Control Conference)
null (Ed.)
Road condition is an important environmental factor for autonomous vehicle control. A dramatic change in the road condition from the nominal status is a source of uncertainty that can lead to a system failure. Once the vehicle encounters an uncertain environment, such as hitting an ice patch, it is too late to reduce the speed, and the vehicle can lose control. To cope with unforeseen uncertainties in advance, we study a proactive robust adaptive control architecture for autonomous vehicles' lane-keeping control problems. The data center generates a prior environmental uncertainty estimate by combining weather forecasts and measurements from anonymous vehicles through a spatio-temporal filter. The prior estimate contributes to designing a robust heading controller and nominal longitudinal velocity for proactive adaptation to each new condition. The control parameters are updated based on posterior information fusion with on-board measurements.
more » « less
Full Text Available
Design of Command Limiting Control Law Using Exponential Potential Functions

https://doi.org/10.2514/1.G004972

Sun, Donglei; Hovakimyan, Naira; Jafarnejadsani, Hamidreza (February 2021, Journal of Guidance, Control, and Dynamics)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records