Learned Shields for Multi-Agent Reinforcement Learning

Melcer, Daniel; Amato, Christopher; Tripakis, Stavros

Citation Details

This content will become publicly available on May 19, 2026

Learned Shields for Multi-Agent Reinforcement Learning

Shielding is an effective method for ensuring safety in multi-agent domains; however, its applicability has previously been limited to environments for which an approximate discrete model and safety specification are known in advance. We present a method for learning shields in cooperative fully-observable multi-agent environments where neither a model nor safety specification are provided, using architectural constraints to realize several important properties of a shield. We show through a series of experiments that our learned shielding method is effective at significantly reducing safety violations, while largely maintaining the ability of an underlying reinforcement learning agent to optimize for reward. more »

Award ID(s):: 2319500

PAR ID:: 10614500

Author(s) / Creator(s):: Melcer, Daniel; Amato, Christopher; Tripakis, Stavros

Publisher / Repository:: https://ala-workshop.github.io/

Date Published:: 2025-05-19

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on May 19, 2026
Workshop Report:
The DOI is not currently available.

More Like this