skip to main content

Title: Geometric Methods for Adjoint Systems

Adjoint systems are widely used to inform control, optimization, and design in systems described by ordinary differential equations or differential-algebraic equations. In this paper, we explore the geometric properties and develop methods for such adjoint systems. In particular, we utilize symplectic and presymplectic geometry to investigate the properties of adjoint systems associated with ordinary differential equations and differential-algebraic equations, respectively. We show that the adjoint variational quadratic conservation laws, which are key to adjoint sensitivity analysis, arise from (pre)symplecticity of such adjoint systems. We discuss various additional geometric properties of adjoint systems, such as symmetries and variational characterizations. For adjoint systems associated with a differential-algebraic equation, we relate the index of the differential-algebraic equation to the presymplectic constraint algorithm of Gotay et al. (J Math Phys 19(11):2388–2399, 1978). As an application of this geometric framework, we discuss how the adjoint variational quadratic conservation laws can be used to compute sensitivities of terminal or running cost functions. Furthermore, we develop structure-preserving numerical methods for such systems using Galerkin Hamiltonian variational integrators (Leok and Zhang in IMA J. Numer. Anal. 31(4):1497–1532, 2011) which admit discrete analogues of these quadratic conservation laws. We additionally show that such methods are natural, in the sense that reduction, forming the adjoint system, and discretization all commute, for suitable choices of these processes. We utilize this naturality to derive a variational error analysis result for the presymplectic variational integrator that we use to discretize the adjoint DAE system. Finally, we discuss the application of adjoint systems in the context of optimal control problems, where we prove a similar naturality result.

more » « less
Award ID(s):
Author(s) / Creator(s):
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
Journal of Nonlinear Science
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Modern control theory provides us with a spectrum of methods for studying the interconnection of dynamic systems using input-output properties of the interconnected subsystems. Perhaps the most advanced framework for such inputoutput analysis is the use of Integral Quadratic Constraints (IQCs), which considers the interconnection of a nominal linear system with an unmodelled nonlinear or uncertain subsystem with known input-output properties. Although these methods are widely used for Ordinary Differential Equations (ODEs), there have been fewer attempts to extend IQCs to infinitedimensional systems. In this paper, we present an IQC-based framework for Partial Differential Equations (PDEs) and Delay Differential Equations (DDEs). First, we introduce infinitedimensional signal spaces, operators, and feedback interconnections. Next, in the main result, we propose a formulation of hard IQC-based input-output stability conditions, allowing for infinite-dimensional multipliers. We then show how to test hard IQC conditions with infinite-dimensional multipliers on a nominal linear PDE or DDE system via the Partial Integral Equation (PIE) state-space representation using a sufficient version of the Kalman-Yakubovich-Popov lemma (KYP). The results are then illustrated using four example problems with uncertainty and nonlinearity. 
    more » « less
  2. The spike variation technique plays a crucial role in deriving Pontryagin's type maximum principle of optimal controls for ordinary differential equations (ODEs), partial differential equations (PDEs), stochastic differential equations (SDEs), and (deterministic forward) Volterra integral equations (FVIEs), when the control domains are not assumed to be convex. It is natural to expect that such a technique could be extended to the case of (forward) stochastic Volterra integral equations (FSVIEs). However, by mimicking the case of SDEs, one encounters an essential difficulty of handling an involved quadratic term. To overcome this difficulty, we introduce an auxiliary process for which one can use It\^o's formula, and develop new technologies inspired by stochastic linear-quadratic optimal control problems. Then the suitable representation of the above-mentioned quadratic form is obtained, and the second-order adjoint equations are derived. Consequently, the maximum principle of Pontryagin type is established. Some relevant extensions are investigated as well. 
    more » « less
  3. Conservation laws are fomulated for systems of di erential equations by using symmetries and adjoint symmetries, and an application to systems of evolution equations is made, together with illustrative examples. The formulation does not require the existence of a Lagrangian for a given system, and the presented examples include computations of conserved densities for the heat equation, Burgers' equation and the Korteweg-de Vries equation. 
    more » « less
  4. Modern cyber-physical systems (CPS) are often developed in a model-based development (MBD) paradigm. The MBD paradigm involves the construction of different kinds of models: (1) a plant model that encapsulates the physical components of the system (e.g., mechanical, electrical, chemical components) using representations based on differential and algebraic equations, (2) a controller model that encapsulates the embedded software components of the system, and (3) an environment model that encapsulates physical assumptions on the external environment of the CPS application. In order to reason about the correctness of CPS applications, we typically pose the following question: For all possible environment scenarios, does the closed-loop system consisting of the plant and the controller exhibit the desired behavior? Typically, the desired behavior is expressed in terms of properties that specify unsafe behaviors of the closed-loop system. Often, such behaviors are expressed using variants of real-time temporal logics. In this chapter, we will examine formal methods based on bounded-time reachability analysis, simulation-guided reachability analysis, deductive techniques based on safety invariants, and formal, requirement-driven testing techniques. We will review key results in the literature, and discuss the scalability and applicability of such systems to various academic and industrial contexts. We conclude this chapter by discussing the challenge to formal verification and testing techniques posed by newer CPS applications that use AI-based software components. 
    more » « less
  5. Abstract

    Wave front propagation with nontrivial bottom topography is studied within the formalism of hyperbolic long wave models. Evolution of nonsmooth initial data is examined, and, in particular, the splitting of singular points and their short time behavior is described. In the opposite limit of longer times, the local analysis of wave fronts is used to estimate the gradient catastrophe formation and how this is influenced by the topography. The limiting cases when the free surface intersects the bottom boundary, belonging to the so‐called “physical” and “nonphysical” vacuum classes, are examined. Solutions expressed by power series in the spatial variable lead to a hierarchy of ordinary differential equations for the time‐dependent series coefficients, which are shown to reveal basic differences between the two vacuum cases: for nonphysical vacuums, the equations of the hierarchy are recursive and linear past the first two pairs, whereas for physical vacuums, the hierarchy is nonrecursive, fully coupled, and nonlinear. The former case may admit solutions that are free of singularities for nonzero time intervals, whereas the latter is shown to develop nonstandard velocity shocks instantaneously. Polynomial bottom topographies simplify the hierarchy, as they contribute only a finite number of inhomogeneous forcing terms to the equations in the recursion relations. However, we show that truncation to finite‐dimensional systems and polynomial solutions is in general only possible for the case of a quadratic bottom profile. In this case, the system's evolution can reduce to, and is completely described by, a low‐dimensional dynamical system for the time‐dependent coefficients. This system encapsulates all the nonlinear properties of the solution for general power series initial data, and, in particular, governs the loss of regularity in finite times at the dry point. For the special case of parabolic bottom topographies, an exact, self‐similar solution class is introduced and studied to illustrate via closed‐form expressions the general results.

    more » « less