Verification and Validation of AI Systems Using Explanations

Mahmud, Saaduddin; Saisubramanian, Sandhya; Zilberstein, Shlomo

doi:10.1609/aaaiss.v4i1.31774

Citation Details

Verification and Validation of AI Systems Using Explanations

Verification and validation of AI systems, particularly learning-enabled systems, is hard because often they lack formal specifications and rely instead on incomplete data and human subjective feedback. Aligning the behavior of such systems with the intended objectives and values of human designers and stakeholders is very challenging, and deploying AI systems that are misaligned can be risky. We propose to use both existing and new forms of explanations to improve the verification and validation of AI systems. Toward that goal, we preseant a framework, the agent explains its behavior and a critic signals whether the explanation passes a test. In cases where the explanation fails, the agent offers alternative explanations to gather feedback, which is then used to improve the system's alignment. We discuss examples of this approach that proved to be effective, and how to extend the scope of explanations and minimize human effort involved in this process. more »

Award ID(s):: 2416459

PAR ID:: 10599148

Author(s) / Creator(s):: Mahmud, Saaduddin; Saisubramanian, Sandhya; Zilberstein, Shlomo

Publisher / Repository:: AAAI

Date Published:: 2024-11-08

Journal Name:: Proceedings of the AAAI Symposium Series

Volume:: 4

Issue:: 1

ISSN:: 2994-4317

Page Range / eLocation ID:: 76 to 80

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1609/aaaiss.v4i1.31774

More Like this