Gaussian Processes as Multiagent Reward Models

Dixit, Gaurav; Zerbel, Nick; Tumer, Kagan

Citation Details

In multiagent problems that require complex joint actions, reward shaping methods yield good behavior by incentivizing the agents’ potentially valuable actions. However, reward shaping often requires access to the functional form of the reward function and the global state of the system. In this work, we introduce the Exploratory Gaussian Reward (EGR), a new reward model that creates optimistic stepping stone rewards linking the agents potentially good actions to the desired joint action. EGR models the system reward as a Gaussian Process to leverage the inherent uncertainty in reward estimates that push agents to explore unobserved state space. In the tightly coupled rover coordination problem, we show that EGR significantly outperforms a neural network approximation baseline and is comparable to the system with access to the functional form of the global reward. Finally, we demonstrate how EGR improves performance over other reward shaping methods by forcing agents to explore and escape local optima. more »

Award ID(s):: 1815886

PAR ID:: 10197809

Author(s) / Creator(s):: Dixit, Gaurav; Zerbel, Nick; Tumer, Kagan

Date Published:: 2020-05-01

Journal Name:: AAMAS Conference proceedings

ISSN:: 2523-5699

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this