skip to main content


Title: Decoupled Data-Based Approach for Learning to Control Nonlinear Dynamical Systems
This paper addresses the problem of learning the optimal control policy for a nonlinear stochastic dynam- ical. This problem is subject to the ‘curse of dimension- ality’ associated with the dynamic programming method. This paper proposes a novel decoupled data-based con- trol (D2C) algorithm that addresses this problem using a decoupled, ‘open-loop - closed-loop’, approach. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system. Then, closed-loop control is developed around this open-loop trajectory by linearization of the dynamics about this nominal trajectory. By virtue of linearization, a linear quadratic regulator based algorithm can be used for this closed-loop control. We show that the performance of D2C algorithm is approximately optimal. Moreover, simulation performance suggests a significant reduction in training time compared to other state of the art algorithms.  more » « less
Award ID(s):
1850206
NSF-PAR ID:
10315328
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
IEEE Transactions on Automatic Control
ISSN:
0018-9286
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. This paper addresses trajectory optimization for hypersonic vehicles under atmospheric and aerodynamic uncertainties using techniques from desensitized optimal control (DOC), wherein open-loop optimal controls are obtained by minimizing the sum of the standard objective function and a first-order penalty on trajectory variations due to parametric uncertainty. The proposed approach is demonstrated via numerical simulations of a minimum-final-time Earth reentry trajectory for an X-33 vehicle with an uncertain atmospheric scale height and drag coefficient. Monte Carlo simulations indicate that dispersions in the final position footprint and the final energy can be significantly reduced without closed-loop control and with little tradeoff in the performance metric set for the trajectory. 
    more » « less
  2. This paper develops a closed-loop approach for ink-jet 3D printing. The control design is based on a distributed model predictive control scheme, which can handle constraints (such as droplet volume) as well as the large-scale nature of the problem. The high resolution of ink-jet 3D printing make centralized methods extremely time-consuming, thus a distributed implementation of the controller is developed. First a graph-based height evolution model that can capture the liquid flow dynamics is proposed. Then, a scalable closed-loop control algorithm is designed based on the model using Distributed MPC, that reduces computation time significantly. The performance and efficiency of the algorithm are shown to outperform open-loop printing and closed-loop printing with existing Centralized MPC methods through simulation results. 
    more » « less
  3. We study an optimal control problem arising from a generalization of rock-paper-scissors in which the number of strategies may be selected from any positive odd number greater than 1 and in which the payoff to the winner is controlled by a control variable \begin{document}$ \gamma $\end{document}. Using the replicator dynamics as the equations of motion, we show that a quasi-linearization of the problem admits a special optimal control form in which explicit dynamics for the controller can be identified. We show that all optimal controls must satisfy a specific second order differential equation parameterized by the number of strategies in the game. We show that as the number of strategies increases, a limiting case admits a closed form for the open-loop optimal control. In performing our analysis we show necessary conditions on an optimal control problem that allow this analytic approach to function.

     
    more » « less
  4. This paper develops a control co-design (CCD) framework to simultaneously optimize the spacecraft’s trajectory and onboard system (rocket engine) and quantify its benefit. An open-loop optimal control problem (two-finite burn Mars missions) is used as the benchmark, and the engine design considers the combustion equilibrium and nozzle geometry. The objective function is the fuel burn. The design variables are the trajectory control parameters (such as burn times, burn directions, and time of flight), initial fuel mass, and engine design parameters (such as throat area, mixture ratio, and chamber pressure). The constraints include the final velocities and positions of spacecraft. Single-point optimizations are conducted for three departure dates in May, July, and September 2020. A multipoint optimization is also performed to balance the engine performance for these dates with 49 design variables and 20 constraints. It is found that the CCD optimizations exhibit 22–28% more fuel burn reduction than the trajectory-only optimization with fixed engine parameters and 16–20% more fuel burn reduction than the decoupled trajectory-engine optimization. The proposed CCD optimization framework can be extended to more spacecraft trajectory control parameters and onboard systems and has the potential to design more efficient spacecraft missions. 
    more » « less
  5. Jin, Shi (Ed.)
    In this paper, we apply the idea of fictitious play to design deep neural networks (DNNs), and develop deep learning theory and algorithms for computing the Nash equilibrium of asymmetric N-player non-zero-sum stochastic differential games, for which we refer as deep fictitious play, a multi-stage learning process. Specifically at each stage, we propose the strategy of letting individual player optimize her own payoff subject to the other players’ previous actions, equivalent to solving N decoupled stochastic control optimization problems, which are approximated by DNNs. Therefore, the fictitious play strategy leads to a structure consisting of N DNNs, which only communicate at the end of each stage. The resulting deep learning algorithm based on fictitious play is scalable, parallel and model-free, i.e., using GPU parallelization, it can be applied to any N-player stochastic differential game with different symmetries and heterogeneities (e.g., existence of major players). We illustrate the performance of the deep learning algorithm by comparing to the closed-form solution of the linear quadratic game. Moreover, we prove the convergence of fictitious play under appropriate assumptions, and verify that the convergent limit forms an open-loop Nash equilibrium. We also discuss the extensions to other strategies designed upon fictitious play and closed-loop Nash equilibrium in the end. 
    more » « less