Learning to Edit Visual Programs with Self-Supervision

Jones, R Kenny; Zhang, Renhao; Ganeshan, Aditya; Ritchie, Daniel

Citation Details

This content will become publicly available on December 10, 2025

Learning to Edit Visual Programs with Self-Supervision

We design a system that learns how to edit visual programs. Our edit network consumes a complete input program and a visual target. From this input, we task our network with predicting a local edit operation that could be applied to the input program to improve its similarity to the target. In order to apply this scheme for domains that lack program annotations, we develop a self-supervised learning approach that integrates this edit network into a bootstrapped finetuning loop along with a network that predicts entire programs in one-shot. Our joint finetuning scheme, when coupled with an inference procedure that initializes a population from the one-shot model and evolves members of this population with the edit network, helps to infer more accurate visual programs. Over multiple domains, we experimentally compare our method against the alternative of using only the one-shot model, and find that even under equal search-time budgets, our editing-based paradigm provides significant advantages. more »

Award ID(s):: 1941808

PAR ID:: 10580887

Author(s) / Creator(s):: Jones, R Kenny; Zhang, Renhao; Ganeshan, Aditya; Ritchie, Daniel

Publisher / Repository:: NeurIPS '24: 2024 Conference on Neural Information Processing Systems

Date Published:: 2024-12-10

Format(s):: Medium: X

Location:: Vancouver, BC, Canada

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on December 10, 2025
Conference Paper:
The DOI is not currently available.

More Like this