Learning to infer generative template programs for visual concepts

Jones, R Kenny; Chaudhuri, Siddhartha; Ritchie, Daniel

Citation Details

People grasp flexible visual concepts from a few examples. We explore a neurosymbolic system that learns how to infer programs that capture visual concepts in a domain-general fashion. We introduce Template Programs: programmatic expressions from a domain-specific language that specify structural and parametric patterns common to an input concept. Our framework supports multiple concept-related tasks, including few-shot generation and co-segmentation through parsing. We develop a learning paradigm that allows us to train networks that infer Template Programs directly from visual datasets that contain concept groupings. We run experiments across multiple visual domains: 2D layouts, Omniglot characters, and 3D shapes. We find that our method outperforms task-specific alternatives, and performs competitively against domain-specific approaches for the limited domains where they exist. more »

Award ID(s):: 1941808

PAR ID:: 10580883

Author(s) / Creator(s):: Jones, R Kenny; Chaudhuri, Siddhartha; Ritchie, Daniel

Publisher / Repository:: ICML'24: Proceedings of the 41st International Conference on Machine Learning

Date Published:: 2024-07-21

Format(s):: Medium: X

Location:: Vienna, Austria

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this