Generative AI models should include detection mechanisms as a condition for public release

Knott, Alistair; Pedreschi, Dino; Chatila, Raja; Chakraborti, Tapabrata; Leavy, Susan; Baeza-Yates, Ricardo; Eyers, David; Trotman, Andrew; Teal, Paul D; Biecek, Przemyslaw; Russell, Stuart; Bengio, Yoshua

doi:10.1007/s10676-023-09728-4

Citation Details

Generative AI models should include detection mechanisms as a condition for public release

Abstract The new wave of ‘foundation models’—general-purpose generative AI models, for production of text (e.g., ChatGPT) or images (e.g., MidJourney)—represent a dramatic advance in the state of the art for AI. But their use also introduces a range of new risks, which has prompted an ongoing conversation about possible regulatory mechanisms. Here we propose a specific principle that should be incorporated into legislation: that any organization developing a foundation model intended for public use must demonstrate a reliabledetection mechanismfor the content it generates, as a condition of its public release. The detection mechanism should be made publicly available in a tool that allows users to query, for an arbitrary item of content, whether the item was generated (wholly or partly) by the model. In this paper, we argue that this requirement is technically feasible and would play an important role in reducing certain risks from new AI models in many domains. We also outline a number of options for the tool’s design, and summarize a number of points where further input from policymakers and researchers would be required. more »

Award ID(s):: 2229876

PAR ID:: 10524776

Author(s) / Creator(s):: Knott, Alistair; Pedreschi, Dino; Chatila, Raja; Chakraborti, Tapabrata; Leavy, Susan; Baeza-Yates, Ricardo; Eyers, David; Trotman, Andrew; Teal, Paul D; Biecek, Przemyslaw; Russell, Stuart; Bengio, Yoshua

Publisher / Repository:: Ethics and Information Technology

Date Published:: 2023-12-01

Journal Name:: Ethics and Information Technology

Volume:: 25

Issue:: 4

ISSN:: 1388-1957

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1007/s10676-023-09728-4

More Like this