Instructing Goal-Conditioned Reinforcement Learning Agents with Temporal Logic Objectives

Qiu, Wenjie; Mao, Wensen; Zhu, He

Citation Details

Goal-conditioned reinforcement learning (RL) is a powerful approach for learning general-purpose skills by reaching diverse goals. However, it has limitations when it comes to task-conditioned policies, where goals are specified by temporally extended instructions written in the Linear Temporal Logic (LTL) formal language. Existing approaches for finding LTL-satisfying policies rely on sampling a large set of LTL instructions during training to adapt to unseen tasks at inference time. However, these approaches do not guarantee generalization to out-of-distribution LTL objectives, which may have increased complexity. In this paper, we propose a novel approach to address this challenge. We show that simple goal-conditioned RL agents can be instructed to follow arbitrary LTL specifications without additional training over the LTL task space. Unlike existing approaches that focus on LTL specifications expressible as regular expressions, our technique is unrestricted and generalizes to ω-regular expressions. Experiment results demonstrate the effectiveness of our approach in adapting goal-conditioned RL agents to satisfy complex temporal logic task specifications zero-shot. more »

Award ID(s):: 2007799 2124155

PAR ID:: 10511032

Author(s) / Creator(s):: Qiu, Wenjie; Mao, Wensen; Zhu, He

Publisher / Repository:: NeurIPS Proceedings

Date Published:: 2023-12-10

Journal Name:: Advances in Neural Information Processing Systems (NeurIPS)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this