Neural network approaches for parameterized optimal control

Verma, Deepanshu; Winovich, Nick; Ruthotto, Lars; van_Bloemen_Waanders, Bart

doi:10.3934/fods.2024042

Citation Details

Neural network approaches for parameterized optimal control

We consider numerical approaches for deterministic, finite-dimensional optimal control problems whose dynamics depend on unknown or uncertain parameters. We seek to amortize the solution over a set of relevant parameters in an offline stage to enable rapid decision-making and be able to react to changes in the parameter in the online stage. To tackle the curse of dimensionality arising when the state and/or parameter are high-dimensional, we represent the policy using neural networks. We compare two training paradigms: First, our model-based approach leverages the dynamics and definition of the objective function to learn the value function of the parameterized optimal control problem and obtain the policy using a feedback form. Second, we use actor-critic reinforcement learning to approximate the policy in a data-driven way. Using an example involving a two-dimensional convection-diffusion equation, which features high-dimensional state and parameter spaces, we investigate the accuracy and efficiency of both training paradigms. While both paradigms lead to a reasonable approximation of the policy, the model-based approach is more accurate and considerably reduces the number of PDE solves. more »

Award ID(s):: 2038118 1751636

PAR ID:: 10594480

Author(s) / Creator(s):: Verma, Deepanshu; Winovich, Nick; Ruthotto, Lars; van_Bloemen_Waanders, Bart

Publisher / Repository:: American Institute of Mathematical Sciences

Date Published:: 2025-01-01

Journal Name:: Foundations of Data Science

Volume:: 7

Issue:: 1

ISSN:: 2639-8001

Page Range / eLocation ID:: 363 to 385

Subject(s) / Keyword(s):: Hamilton-Jacobi-Bellman equation Reinforcement Learning high dimensional optimal control PDE constrained optimization neural networks policy optimization actor-critic

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.3934/fods.2024042

More Like this