NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SYNAPSE: SYmbolic Neural-Aided Preference Synthesis Engine

https://doi.org/10.1609/aaai.v39i26.34965

Modak, Sadanand; Patton, Noah Tobias; Dillig, Isil; Biswas, Joydeep (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

This paper addresses the problem of preference learning, which aims to align robot behaviors through learning user-specific preferences (e.g. “good pull-over location”) from visual demonstrations. Despite its similarity to learning factualconcepts (e.g. “red door”), preference learning is a fundamentally harder problem due to its subjective nature and the paucity of person-specific training data. We address this problem using a novel framework called SYNAPSE, which is aneuro-symbolic approach designed to efficiently learn preferential concepts from limited data. SYNAPSE represents preferences as neuro-symbolic programs – facilitating inspection of individual parts for alignment – in a domain-specificlanguage (DSL) that operates over images and leverages a novel combination of visual parsing, large language models, and program synthesis to learn programs representing individual preferences. We perform extensive evaluations on various preferential concepts as well as user case studies demonstrating its ability to align well with dissimilar user preferences. Our method significantly outperforms baselines, especially when it comes to out-of-distribution generalization. We show the importance of the design choices in the framework through multiple ablation studies.
more » « less
Free, publicly-accessible full text available April 11, 2026
Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning

Banerjee, Arko; Rahmani, Kia; Biswas, Joydeep; Dillig, Isil (December 2024, Advances in Neural Information Processing Systems (Neurips))

Free, publicly-accessible full text available December 12, 2025
Programmatic Imitation Learning From Unlabeled and Noisy Demonstrations

https://doi.org/10.1109/LRA.2024.3385691

Xin, Jimmy; Zheng, Linus; Rahmani, Kia; Wei, Jiayi; Holtz, Jarrett; Dillig, Isil; Biswas, Joydeep (June 2024, IEEE Robotics and Automation Letters)

Full Text Available
Programming-by-Demonstration for Long-Horizon Robot Tasks

https://doi.org/10.1145/3632860

Patton, Noah; Rahmani, Kia; Missula, Meghana; Biswas, Joydeep; Dillig, Işıl (January 2024, Proceedings of the ACM on Programming Languages)

The goal ofprogrammatic Learning from Demonstration (LfD)is to learn a policy in a programming language that can be used to control a robot’s behavior from a set of user demonstrations. This paper presents a new programmatic LfD algorithm that targetslong-horizon robot taskswhich require synthesizing programs with complex control flow structures, including nested loops with multiple conditionals. Our proposed method first learns a program sketch that captures the target program’s control flow and then completes this sketch using an LLM-guided search procedure that incorporates a novel technique for proving unrealizability of programming-by-demonstration problems. We have implemented our approach in a new tool calledprolexand present the results of a comprehensive experimental evaluation on 120 benchmarks involving complex tasks and environments. We show that, given a 120 second time limit,prolexcan find a program consistent with the demonstrations in 80% of the cases. Furthermore, for 81% of the tasks for which a solution is returned,prolexis able to find the ground truth program with just one demonstration. In comparison, CVC5, a syntaxguided synthesis tool, is only able to solve 25% of the caseseven when given the ground truth program sketch, and an LLM-based approach, GPT-Synth, is unable to solve any of the tasks due to the environment complexity.
more » « less

Search for: All records