Deductive Synthesis of Reinforcement Learning Agents for Infinite Horizon Tasks

Wang, Yuning; Zhu, He

Citation Details

This content will become publicly available on July 21, 2026

Deductive Synthesis of Reinforcement Learning Agents for Infinite Horizon Tasks

We propose a deductive synthesis framework for construct- ing reinforcement learning (RL) agents that provably satisfy temporal reach-avoid specifications over infinite horizons. Our approach decomposes these temporal specifications into a sequence of finite-horizon subtasks, for which we synthesize individual RL policies. Using formal verification techniques, we ensure that the composition of a finite number of subtask policies guarantees satisfaction of the overall specification over infinite horizons. Experimental results on a suite of benchmarks show that our synthesized agents outperform standard RL methods in both task performance and compliance with safety and temporal requirements. more »

Award ID(s):: 2007799 2124155

PAR ID:: 10593428

Author(s) / Creator(s):: Wang, Yuning; Zhu, He

Publisher / Repository:: 37th International Conference on Computer Aided Verification (CAV)

Date Published:: 2025-07-21

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 21, 2026
Conference Paper:
The DOI is not currently available.

More Like this