Goals as reward-producing programs

Davidson, Guy; Todd, Graham; Togelius, Julian; Gureckis, Todd M; Lake, Brenden M

doi:10.1038/s42256-025-00981-4

Citation Details

This content will become publicly available on February 1, 2026

Goals as reward-producing programs

People are remarkably capable of generating their own goals, beginning with child’s play and continuing into adulthood. Despite considerable empirical and computational work on goals and goal-oriented behaviour, models are still far from capturing the richness of everyday human goals. Here we bridge this gap by collecting a dataset of human-generated playful goals (in the form of scorable, single-player games), modelling them as reward-producing programs and generating novel human-like goals through program synthesis. Reward-producing programs capture the rich semantics of goals through symbolic operations that compose, add temporal constraints and allow program execution on behavioural traces to evaluate progress. To build a generative model of goals, we learn a fitness function over the infinite set of possible goal programs and sample novel goals with a quality-diversity algorithm. Human evaluators found that model-generated goals, when sampled from partitions of program space occupied by human examples, were indistinguishable from human-created games. We also discovered that our model’s internal fitness scores predict games that are evaluated as more fun to play and more human-like. more »

Award ID(s):: 2121102

PAR ID:: 10576869

Author(s) / Creator(s):: Davidson, Guy; Todd, Graham; Togelius, Julian; Gureckis, Todd M; Lake, Brenden M

Publisher / Repository:: Nature Machine Intelligence

Date Published:: 2025-02-01

Journal Name:: Nature Machine Intelligence

Volume:: 7

Issue:: 2

ISSN:: 2522-5839

Page Range / eLocation ID:: 205 to 220

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on February 1, 2026
Journal Article:
https://doi.org/10.1038/s42256-025-00981-4

More Like this