LTL-Based Non-Markovian Inverse Reinforcement Learning

Afzal, M.; Gambhir, S.; Gupta, A.; Krishna, S.; Trivedi, A.; Velasquez, A.

Citation Details

The successes of reinforcement learning in recent years are underpinned by the characterization of suitable reward functions. However, in settings where such rewards are non-intuitive, difficult to define, or otherwise error-prone in their definition, it is useful to instead learn the reward signal from expert demonstrations. This is the crux of inverse reinforcement learning (IRL). While eliciting learning requirements in the form of scalar reward signals has been shown to be effective, such representations lack explainability and lead to opaque learning. We aim to mitigate this situation by presenting a novel IRL method for eliciting declarative learning requirements in the form of a popular formal logic---Linear Temporal Logic (LTL)---from a set of traces given by the expert policy. more »

Award ID(s):: 2146563

PAR ID:: 10419684

Author(s) / Creator(s):: Afzal, M.; Gambhir, S.; Gupta, A.; Krishna, S.; Trivedi, A.; Velasquez, A.

Date Published:: 2023-05-30

Journal Name:: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023)

Page Range / eLocation ID:: 2857–2859

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this