Learning Optimal Advantage from Preferences and Mistaking it for Reward. | NSF Public Access Repository

skip to main content

An official website of the United States government Here's how you know

Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Citation Details

Learning Optimal Advantage from Preferences and Mistaking it for Reward.

Award ID(s):: 2125858

PAR ID:: 10536880

Author(s) / Creator(s):: Knox, WB; Hatgis-Kessell, S; Adalgeirsson, SO; Booth, S; Dragan, A; Stone, P; Niekum, S

Publisher / Repository:: AAAI

Date Published:: 2024-02-01

Journal Name:: Annual AAAI Conference

ISSN:: ####-####

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.