Time-inconsistent stochastic optimal control problems and backward stochastic volterra integral equations
An optimal control problem is considered for a stochastic differential equation with the cost functional determined by a backward stochastic Volterra integral equation (BSVIE, for short). This kind of cost functional can cover the general discounting (including exponential and non-exponential) situations with a recursive feature. It is known that such a problem is time-inconsistent in general. Therefore, instead of finding a global optimal control, we look for a time-consistent locally near optimal equilibrium strategy. With the idea of multi-person differential games, a family of approximate equilibrium strategies is constructed associated with partitions of the time intervals. By sending the mesh size of the time interval partition to zero, an equilibrium Hamilton–Jacobi–Bellman (HJB, for short) equation is derived, through which the equilibrium value function and an equilibrium strategy are obtained. Under certain conditions, a verification theorem is proved and the well-posedness of the equilibrium HJB is established. As a sort of Feynman–Kac formula for the equilibrium HJB equation, a new class of BSVIEs (containing the diagonal value Z ( r , r ) of Z (⋅ , ⋅)) is naturally introduced and the well-posedness of such kind of equations is briefly presented.
- Editors:
- Buttazzo, G.; Casas, E.; de Teresa, L.; Glowinski, R.; Leugering, G.; Trélat, E.; Zhang, X.
- Award ID(s):
- 1812921
- Publication Date:
- NSF-PAR ID:
- 10220031
- Journal Name:
- ESAIM: Control, Optimisation and Calculus of Variations
- Volume:
- 27
- Page Range or eLocation-ID:
- 22
- ISSN:
- 1292-8119
- Sponsoring Org:
- National Science Foundation
More Like this
-
This paper studies an optimal stochastic impulse control problem in a finite time horizon with a decision lag, by which we mean that after an impulse is made, a fixed number units of time has to be elapsed before the next impulse is allowed to be made. The continuity of the value function is proved. A suitable version of dynamic programming principle is established, which takes into account the dependence of state process on the elapsed time. The corresponding Hamilton-Jacobi-Bellman (HJB) equation is derived, which exhibits some special feature of the problem. The value function of this optimal impulse controlmore »
-
In magnetic confinement fusion devices, the equilibrium configuration of a plasma is determined by the balance between the hydrostatic pressure in the fluid and the magnetic forces generated by an array of external coils and the plasma itself. The location of the plasma is not known a priori and must be obtained as the solution to a free boundary problem. The partial differential equation that determines the behavior of the combined magnetic field depends on a set of physical parameters (location of the coils, intensity of the electric currents going through them, magnetic permeability, etc.) that are subject to uncertaintymore »
-
This paper investigates optimal power management of a fuel cell hybrid small unmanned aerial vehicle (sUAV) from the perspective of endurance (time of flight) maximization in a stochastic environment. Stochastic drift counteraction optimal control is exploited to obtain an optimal policy for power management that coordinates the operation of the fuel cell and battery to maximize the expected flight time while accounting for the limits on the rate of change of fuel cell power output and the orientation dependence of fuel cell efficiency. The proposed power management strategy accounts for known statistics in transitions of propeller power and climb anglemore »
-
We consider particles obeying Langevin dynamics while being at known positions and having known velocities at the two end-points of a given interval. Their motion in phase space can be modeled as an Ornstein–Uhlenbeck process conditioned at the two end-points—a generalization of the Brownian bridge. Using standard ideas from stochastic optimal control we construct a stochastic differential equation (SDE) that generates such a bridge that agrees with the statistics of the conditioned process, as a degenerate diffusion. Higher order linear diffusions are also considered. In general, a time-varying drift is sufficient to modify the prior SDE and meet the end-pointmore »
-
Abstract We consider an optimal control problem where the state equations are a coupled hyperbolic–elliptic system. This system arises in elastodynamics with piezoelectric effects—the elastic stress tensor is a function of elastic displacement and electric potential. The electric flux acts as the control variable and bound constraints on the control are considered. We develop a complete analysis for the state equations and the control problem. The requisite regularity on the control, to show the well-posedness of the state equations, is enforced using the cost functional. We rigorously derive the first-order necessary and sufficient conditions using adjoint equations and further studymore »