Formal Ethical Obligations in Reinforcement Learning Agents: Verification and Policy Updates

Shea-Blymyer, Colin; Abbas, Houssam

doi:10.1609/aies.v7i1.31730

Citation Details

Formal Ethical Obligations in Reinforcement Learning Agents: Verification and Policy Updates

When designing agents for operation in uncertain environments, designers need tools to automatically reason about what agents ought to do, how that conflicts with what is actually happening, and how a policy might be modified to remove the conflict.These obligations include ethical and social obligations, permissions and prohibitions, which constrain how the agent achieves its mission and executes its policy.We propose a new deontic logic, Expected Act Utilitarian deontic logic, for enabling this reasoning at design time: for specifying and verifying the agent's strategic obligations, then modifying its policy from a reference policy to meet those obligations.Unlike approaches that work at the reward level, working at the logical level increases the transparency of the trade-offs.We introduce two algorithms: one for model-checking whether an RL agent has the right strategic obligations, and one for modifying a reference decision policy to make it meet obligations expressed in our logic.We illustrate our algorithms on DAC-MDPs which accurately abstract neural decision policies, and on toy gridworld environments. more »

Award ID(s):: 2145291

PAR ID:: 10612074

Author(s) / Creator(s):: Shea-Blymyer, Colin; Abbas, Houssam

Publisher / Repository:: AAAI

Date Published:: 2024-10-17

Journal Name:: Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society

Volume:: 7

ISSN:: 3065-8365

Page Range / eLocation ID:: 1368 to 1378

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1609/aies.v7i1.31730

More Like this