Continuous trajectory control of fixed-wing unmanned aerial vehicles (UAVs) is complicated when considering hidden dynamics. Due to UAV multi degrees of freedom, tracking methodologies based on conventional control theory, such as Proportional-Integral-Derivative (PID) has limitations in response time and adjustment robustness, while a model based approach that calculates the force and torques based on UAV’s current status is complicated and rigid.We present an actor-critic reinforcement learning framework that controls UAV trajectory through a set of desired waypoints. A deep neural network is constructed to learn the optimal tracking policy and reinforcement learning is developed to optimize the resulting tracking scheme. The experimental results show that our proposed approach can achieve 58.14% less position error, 21.77% less system power consumption and 9:23% faster attainment than the baseline. The actor network consists of only linear operations, hence Field Programmable Gate Arrays (FPGA) based hardware acceleration can easily be designed for energy efficient real-time control.
more »
« less
Autonomous Waypoint Planning, Optimal Trajectory Generation and Nonlinear Tracking Control for Multi-rotor UAVs
A framework for autonomous waypoint planning, trajectory generation through waypoints, and trajectory tracking for multi-rotor unmanned aerial vehicles (UAVs) is proposed in this work. Safe and effective operations of these UAVs is a problem that demands obstacle avoidance strategies
and advanced trajectory planning and control schemes for stability and energy efficiency. To address this problem, a two-level optimization strategy is used for trajectory generation, then the trajectory is tracked in a stable manner. The framework given here consists of the following components:
(a) a deep reinforcement learning (DRL)-based algorithm for optimal waypoint planning while minimizing control energy and avoiding obstacles in a given environment; (b) an optimal, smooth trajectory generation algorithm through waypoints, that minimizes a combinaton of velocity, acceleration, jerk and snap; and (c) a stable tracking control law that determines a control
thrust force for an UAV to track the generated trajectory.
more »
« less
- Award ID(s):
- 1739748
- NSF-PAR ID:
- 10112507
- Date Published:
- Journal Name:
- European Control Conference
- Page Range / eLocation ID:
- 2695 to 2700
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
This paper addresses the problem of generating a position trajectory with pointing direction constraints at given waypoints for underactuated unmanned vehicles. The problem is initially posed on the configuration space ℝ 3 × ℝ 2 and thereafter, upon suitable modifications, is re-posed as a problem on the Lie group SE(3). This is done by determining a vector orthogonal to the pointing direction and using it as the vehicle's thrust direction. This translates to converting reduced attitude constraints to full attitude constraints at the waypoints. For the position trajectory, in addition to position constraints, this modification adds acceleration constraints at the waypoints. For real-time implementation with low computational expenses, a linear-quadratic regulator (LQR) approach is adopted to determine the position trajectory with smoothness upto the fourth time derivative of position (snap). For the attitude trajectory, the thrust direction extracted from the position trajectory is used to first propagate the attitude to the subsequent waypoint and then correct it over time to achieve the desired attitude at this waypoint. Finally, numerical simulation results are presented to validate the trajectory generation scheme.more » « less
-
A discrete time, optimal trajectory planning scheme for position trajectory generation of a vehicle is given here, considering the mission duration as a free variable. The vehicle is actuated in three rotational degrees of freedom and one translational degree of freedom. This model is applicable to vehicles that have a body-fixed thrust vector direction for translational motion control, including fixed-wing and rotorcraft unmanned aerial vehicles (UAVs), unmanned underwater vehicles (UUVs) and spacecraft. The lightweight scheme proposed here generates the trajectory in inertial coordinates, and is intended for real time, on-the-go applications. The unspecified terminal time can be considered as an additional design parameter. This is done by deriving the optimality conditions in a discrete time setting, which results in the discrete transversality condition. The trajectory starts from an initial position and reaches a desired final position in an unspecified final time that ensures the cost on state and control is optimized. The trajectory generated by this scheme can be considered as the desired trajectory for a tracking control scheme. Numerical simulation results validate the performance of this trajectory generation scheme used in conjunction with a nonlinear tracking control scheme.more » « less
-
We consider the problem of trajectory planning for optimal relative orbit determination in the cislunar environment. The recent interest in cislunar space has created a need to develop autonomous tracking technologies that can maintain situational awareness of this dynamically complex regime. Optical sensors provide an ideal observation platform because of their low cost and versatility in tracking both cooperative and non-cooperative space objects. The estimation performance of an optical observer can be significantly enhanced through manuevering. This work develops a trajectory planning tool, compatible with low-thrust propulsion, for tracking one or multiple targets operating in proximity to the observer. We formulate an objective function that is a convex combination of the mutual information between target states and measurements, and the low-thrust control effort. The subsequent optimal control problem is solved via direct collocation using the successive convexification algorithm, which, we argue, is well suited for a potential onboard trajectory planning application. We demonstrate the tool for several relevant scenarios with one and multiple targets operating around periodic orbits in the circular restricted three-body problem. A sequential estimator's performance is evaluated using the Cramer-Rao lower bound and, compared to a purely passive observer, we show that optimizing the observer's trajectory can decrease this bound by up to several orders of magnitude within a planning window. This investigation provides an initial proof-of-concept to future onboard planning technologies for relative tracking in the cislunar domain.more » « less
-
Implicit Trajectory Planning for Feedback Linearizable Systems: A Time-varying Optimization ApproachWe develop an optimization-based framework for joint real-time trajectory planning and feedback control of feedback-linearizable systems. To achieve this goal, we define a target trajectory as the optimal solution of a time-varying optimization problem. In general, however, such trajectory may not be feasible due to , e.g., nonholonomic constraints. To solve this problem, we design a control law that generates feasible trajectories that asymptotically converge to the target trajectory. More precisely, for systems that are (dynamic) full-state linearizable, the proposed control law implicitly transforms the nonlinear system into an optimization algorithm of sufficiently high order. We prove global exponential convergence to the target trajectory for both the optimization algorithm and the original system. We illustrate the effectiveness of our proposed method on multi-target or multi-agent tracking problems with constraints.more » « less