Multi-level Fitness Critics for Cooperative Coevolution

Rockefeller, Golden; Khadka, Shauharda; Tumer, Kagan

Citation Details

In many multiagent domains, and particularly in tightly coupled domains, teasing an agent’s contribution to the system performance based on a single episodic return is difficult. This well-known difficulty hits state-to-action mapping approaches such as neural net- works trained by evolutionary algorithms particularly hard. This paper introduces fitness critics, which leverage the expected fitness to evaluate an agent’s performance. This approach turns a sparse performance metric (policy evaluation) into a dense performance metric (state-action evaluation) by relating the episodic feedback to the state-action pairs experienced during the execution of that policy. In the tightly-coupled multi-rover domain (where multiple rovers have to perform a particular task simultaneously), only teams using fitness critics were able to demonstrate effective learning on tasks with tight coupling while other coevolved teams were unable to learn at all. more »

Award ID(s):: 1815886

PAR ID:: 10197808

Author(s) / Creator(s):: Rockefeller, Golden; Khadka, Shauharda; Tumer, Kagan

Date Published:: 2020-05-01

Journal Name:: AAMAS Conference proceedings

ISSN:: 2523-5699

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this