This paper proposes a novel learning-based adaptive optimal controller design method for a class of continuous-time linear time-delay systems. A key strategy is to exploit the state-of-the-art reinforcement learning (RL) techniques and adaptive dynamic programming (ADP), and propose a data-driven method to learn the near-optimal controller without the precise knowledge of system dynamics. Specifically, a value iteration (VI) algorithm is proposed to solve the infinite-dimensional Riccati equation for the linear quadratic optimal control problem of time-delay systems using finite samples of input-state trajectory data. It is rigorously proved that the proposed VI algorithm converges to the near-optimal solution. Compared with the previous literature, the nice features of the proposed VI algorithm are that it is directly developed for continuous-time systems without discretization and an initial admissible controller is not required for implementing the algorithm. The efficacy of the proposed methodology is demonstrated by two practical examples of metal cutting and autonomous driving.
more »
« less
Hybrid robust minimum-time control for a class of non-exponentially unstable planar systems
This paper deals with robust minimum-time control of a class of asymptotically null-controllable with bounded input planar systems. A hybrid controller is proposed to robustly achieve global finite time stability of a set of points wherein the plant state is zero. The resulting controller provides time optimal response from initial conditions in a certain subset of the state space, and finite time convergence elsewhere. Finally, the effectiveness of the proposed methods is demonstrated in a numerical example.
more »
« less
- Award ID(s):
- 1710621
- PAR ID:
- 10066576
- Date Published:
- Journal Name:
- Proceedings of the 2017 IEEE 56th Annual Conference on Decision and Control (CDC)
- Page Range / eLocation ID:
- 139 to 144
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
. (Ed.)This work is concerned with the use of mobile sensors to approximate and replace the full state feedback controller by static output feedback controllers for a class of PDEs. Assuming the feedback operator associated with the full-state feedback controller admits a kernel representation, the proposed optimization aims to approximate the inner product of the kernel and the full state by a finite sum of weighted scalar outputs provided by the mobile sensors. When the full state feedback operator is time-dependent thus rendering its associated kernel time-varying, the approximation results in moving sensors with time-varying static gains. To calculate the velocity of the mobile sensors within the spatial domain the time-varying kernel is set equal to the sensor density and thus the solution to an associated advection PDE reveals the velocity field of the sensor network. To obtain the speed of the finite number of sensors, a domain decomposition based on a modification of the Centroidal Voronoi Tessellations (µ-CVT) is used to decompose the kernel into a finite number of cells, each of which contains a single sensor. A subsequent application of the µ-CVT on the velocity field provides the individual sensor speeds. The nature of this µ-CVT ensures collision avoidance by the very structure of the kernel decomposition into non-intersecting cells. Numerical simulations are provided to highlight the proposed sensor guidance.more » « less
-
Experimental and Analytical Prescribed-Time Trajectory Tracking Control of a 7-DOF Robot ManipulatorWe present an analytical design and experimental verification of trajectory tracking control of a 7-DOF robot manipulator, which achieves convergence of all tracking errors to the origin within a finite terminal time. A key feature of this control strategy is that this terminal convergence time is explicitly prescribed by the control designer, and is thus independent of the initial conditions of the tracking errors. In order to achieve this beneficial property of the proposed controller, a scaling of the state by a function of time that grows unbounded towards the terminal time is employed. Through Lyapunov analysis, we first demonstrate that the proposed controller achieves regulation of all tracking errors within the prescribed time as well as the uniform boundedness of the joint torques, even in the presence of a matched, non-vanishing disturbance. Then, through both simulation and experiment, we demonstrate that the proposed controller is capable of converging to the desired trajectory within the prescribed time, despite large initial conditions of the tracking errors and a sinusoidal disturbance being applied in each joint.more » « less
-
null (Ed.)We consider abstraction-based design of output-feedback controllers for dynamical systemswith a finite set of inputs and outputs against specifications in linear-time temporal logic. The usual procedure for abstraction-based controller design (ABCD) first constructs a finite-state abstraction of the underlying dynamical system, and second, uses reactive synthesis techniques to compute an abstract state-feedback controller on the abstraction. In this context, our contribution is two-fold: (I) we define a suitable relation between the original systemand its abstractionwhich characterizes the soundness and completeness conditions for an abstract state-feedback controller to be refined to a concrete output-feedback controller for the original system, and (II) we provide an algorithm to compute a sound finite-state abstraction fulfilling this relation. Our relation generalizes feedback-refinement relations fromABCD with state-feedback. Our algorithm for constructing sound finitestate abstractions is inspired by the simultaneous reachability and bisimulation minimization algorithm of Lee and Yannakakis. We lift their idea to the computation of an observation-equivalent system and show how sound abstractions can be obtained by stopping this algorithm at any point. Additionally, our new algorithm produces a realization of the topological closure of the input/output behavior of the original system if it is finite state realizable.more » « less
-
Abstract We present an analytical design and experimental verification of trajectory tracking control of a 7-DOF robot manipulator, which achieves convergence of all tracking errors to the origin within a finite terminal time, also referred to as the “settling time.” A key feature of this control strategy is that the settling time is explicitly assigned by the control designer to a value desired, or “prescribed” by the user and that the settling time is independent of the initial conditions and of the reference signal. In order to achieve this beneficial property with the controller, a scaling of the state by a function of time that grows unbounded toward the terminal time is employed. Through Lyapunov analysis, we first demonstrate that the proposed controller achieves regulation of all tracking errors within the prescribed time as well as the uniform boundedness of the joint torques, even in the presence of a matched, nonvanishing disturbance. Then, through both simulation and experiment, we demonstrate that the proposed controller is capable of converging to the desired trajectory within the prescribed time, despite large distance between the initial conditions and the reference trajectory, i.e., in spite of large initial tracking errors, and in spite of a sinusoidal disturbance being applied in each joint.more » « less
An official website of the United States government

