Learning True Objectives: Linear Algebraic Characterizations of Identifiability in Inverse Reinforcement Learning

Shehab, Mohamad Louai; Aspeel, Antoine; Arechiga, Nikos; Best, Andrew; Ozay, Necmiye

Citation Details

Inverse reinforcement Learning (IRL) has emerged as a powerful paradigm for extracting expert skills from observed behavior, with applications ranging from autonomous systems to humanrobot interaction. However, the identifiability issue within IRL poses a significant challenge, as multiple reward functions can explain the same observed behavior. This paper provides a linear algebraic characterization of several identifiability notions for an entropy-regularized finite horizon Markov decision process (MDP). Moreover, our approach allows for the seamless integration of prior knowledge, in the form of featurized reward functions, to enhance the identifiability of IRL problems. The results are demonstrated with experiments on a grid world environment. more »

Award ID(s):: 1931982

PAR ID:: 10568117

Author(s) / Creator(s):: Shehab, Mohamad Louai; Aspeel, Antoine; Arechiga, Nikos; Best, Andrew; Ozay, Necmiye

Publisher / Repository:: Proceedings of the 6th Annual Learning for Dynamics & Control Conference, PMLR 242:1266-1277

Date Published:: 2024-07-01

Format(s):: Medium: X

Location:: https://proceedings.mlr.press/v242/shehab24a.html

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this