Min-Max Entropy Inverse RL of Multiple Tasks

Arora, Saurabh; Doshi, Prashant; Banerjee, Bikramjit

Citation Details

Multi-task IRL recognizes that expert(s) could be switching between multiple ways of solving the same problem, or interleaving demonstrations of multiple tasks. The learner aims to learn the reward functions that individually guide these distinct ways. We present a new method for multi-task IRL that generalizes the well-known maximum entropy approach by combining it with a Dirichlet process based minimum entropy clustering of the observed data. This yields a single nonlinear optimization problem, called MinMaxEnt Multi-task IRL (MME-MTIRL), which can be solved using the Lagrangian relaxation and gradient descent methods. We evaluate MME- MTIRL on the robotic task of sorting onions on a processing line where the expert utilizes multiple ways of detecting and removing blemished onions. The method is able to learn the underlying reward functions to a high level of accuracy and it improves on the previous approaches. more »

Award ID(s):: 1830421

PAR ID:: 10293575

Author(s) / Creator(s):: Arora, Saurabh; Doshi, Prashant; Banerjee, Bikramjit

Date Published:: 2021-07-01

Journal Name:: IEEE International Conference on Robotics and Automation

ISSN:: 2379-9544

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this