NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

IRIS: LLM-Assisted Static Analysis for Detecting Security Vulnerabilities

Li, Ziyang; Dutta, Saikat; Naik, Mayur (April 2025, ICLR 2025)

Free, publicly-accessible full text available April 24, 2026
IRIS: LLM-Assisted Static Analysis for Detecting Security Vulnerabilities

Li, Ziyang; Dutta, Saikat; Naik, Mayur (April 2025, ICLR 2025)

Free, publicly-accessible full text available April 1, 2026
LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision

Huang, Jiani; Li, Ziyang; Naik, Mayur; Lim, Ser Nam (April 2025, ICLR 2025)

Free, publicly-accessible full text available April 1, 2026
Data-Efficient Learning with Neural Programs

Solko-Breslin, Alaia; Choi, Seewon; Li, Ziyang; Velingker, Neelay; Alur, Rajeev; Naik, Mayur; Wong, Eric (December 2024, Neural Information Processing Systems Foundation (NIPS Foundation))

Many computational tasks can be naturally expressed as a composition of a DNN followed by a program written in a traditional programming language or an API call to an LLM. We call such composites "neural programs" and focus on the problem of learning the DNN parameters when the training data consist of end-to-end input-output labels for the composite. When the program is written in a differentiable logic programming language, techniques from neurosymbolic learning are applicable, but in general, the learning for neural programs requires estimating the gradients of black-box components. We present an algorithm for learning neural programs, called ISED, that only relies on input-output samples of black-box components. For evaluation, we introduce new benchmarks that involve calls to modern LLMs such as GPT-4 and also consider benchmarks from the neurosymbolic learning literature. Our evaluation shows that for the latter benchmarks, ISED has comparable performance to state-of-the-art neurosymbolic frameworks. For the former, we use adaptations of prior work on gradient approximations of black-box components as a baseline, and show that ISED achieves comparable accuracy but in a more data- and sample-efficient manner.
more » « less
Full Text Available
TyGr: Type Inference on Stripped Binaries using Graph Neural Networks

Zhu, Chang; Li, Ziyang; Xue, Anton; Bajaj, Ati Priya; Gibbs, Wil; Liu, Yibo; Alur, Rajeev; Bao, Tiffany; Dai, Hanjun; Doupé, Adam; et al (August 2024, USENIX)

Full Text Available
TYGR: type inference on stripped binaries using graph neural networks

Zhu, Chang; Li, Ziyang; Xue, Anton; Bajaj, Ati Priya; Gibbs, Wil; Liu, Yibo; Alur, Rajeev; Bao, Tiffany; Dai, Hanjun; Doupé, Adam; et al (August 2024, Proceedings of the 33rd USENIX Conference on Security Symposium)

Full Text Available
Relational Programming with Foundational Models

https://doi.org/10.1609/aaai.v38i9.28934

Li, Ziyang; Huang, Jiani; Liu, Jason; Zhu, Felix; Zhao, Eric; Dodds, William; Velingker, Neelay; Alur, Rajeev; Naik, Mayur (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

Foundation models have vast potential to enable diverse AI applications. The powerful yet incomplete nature of these models has spurred a wide range of mechanisms to augment them with capabilities such as in-context learning, information retrieval, and code interpreting. We propose Vieira, a declarative framework that unifies these mechanisms in a general solution for programming with foundation models. Vieira follows a probabilistic relational paradigm and treats foundation models as stateless functions with relational inputs and outputs. It supports neuro-symbolic applications by enabling the seamless combination of such models with logic programs, as well as complex, multi-modal applications by streamlining the composition of diverse sub-models. We implement Vieira by extending the Scallop compiler with a foreign interface that supports foundation models as plugins. We implement plugins for 12 foundation models including GPT, CLIP, and SAM. We evaluate Vieira on 9 challenging tasks that span language, vision, and structured and vector databases. Our evaluation shows that programs in Vieira are concise, can incorporate modern foundation models, and have comparable or better accuracy than competitive baselines.
more » « less
Full Text Available
Scallop: A Language for Neurosymbolic Programming

https://doi.org/10.1145/3591280

Li, Ziyang; Huang, Jiani; Naik, Mayur (June 2023, Proceedings of the ACM on Programming Languages)

We present Scallop, a language which combines the benefits of deep learning and logical reasoning. Scallop enables users to write a wide range of neurosymbolic applications and train them in a data- and compute-efficient manner. It achieves these goals through three key features: 1) a flexible symbolic representation that is based on the relational data model; 2) a declarative logic programming language that is based on Datalog and supports recursion, aggregation, and negation; and 3) a framework for automatic and efficient differentiable reasoning that is based on the theory of provenance semirings. We evaluate Scallop on a suite of eight neurosymbolic applications from the literature. Our evaluation demonstrates that Scallop is capable of expressing algorithmic reasoning in diverse and challenging AI tasks, provides a succinct interface for machine learning programmers to integrate logical domain knowledge, and yields solutions that are comparable or superior to state-of-the-art models in terms of accuracy. Furthermore, Scallop's solutions outperform these models in aspects such as runtime and data efficiency, interpretability, and generalizability.
more » « less
Full Text Available
Scallop: From Probabilistic Deductive Databases to Scalable Differentiable Reasoning

Huang, Jiani; Li, Ziyang; Chen, Binghong; Samel, Karan; Naik, Mayur; Song, Le; Si, Xujie (December 2021, Advances in Neural Information Processing Systems)

Deep learning and symbolic reasoning are complementary techniques for an intelligent system. However, principled combinations of these techniques are typically limited in scalability, rendering them ill-suited for real-world applications. We propose Scallop, a system that builds upon probabilistic deductive databases, to bridge this gap. The key insight underlying Scallop is a provenance framework that introduces a tunable parameter to specify the level of reasoning granularity. Scallop thereby i) generalizes exact probabilistic reasoning, ii) asymptotically reduces computational cost, and iii) provides relative accuracy guarantees. On synthetic tasks involving mathematical and logical reasoning, Scallop scales significantly better without sacrificing accuracy compared to DeepProbLog, a principled neural logic programming approach. Scallop also scales to a newly created real-world Visual Question Answering (VQA) benchmark that requires multi-hop reasoning, achieving 84.22% accuracy and outperforming two VQA-tailored models based on Neural Module Networks and transformers by 12.42% and 21.66% respectively.
more » « less
Full Text Available
Arbitrar: User-Guided API Misuse Detection

https://doi.org/10.1109/SP40001.2021.00090

Li, Ziyang; Machiry, Aravind; Chen, Binghong; Wang, Ke; Naik, Mayur; Song, Le (May 2021, IEEE Symposium on Security and Privacy)

Software APIs exhibit rich diversity and complexity which not only renders them a common source of programming errors but also hinders program analysis tools for checking them. Such tools either expect a precise API specification, which requires program analysis expertise, or presume that correct API usages follow simple idioms that can be automatically mined from code, which suffers from poor accuracy. We propose a new approach that allows regular programmers to find API misuses. Our approach interacts with the user to classify valid and invalid usages of each target API method. It minimizes user burden by employing an active learning algorithm that ranks API usages by their likelihood of being invalid. We implemented our approach in a tool called ARBITRAR for C/C++ programs, and applied it to check the uses of 18 API methods in 21 large real-world programs, including OpenSSL and Linux Kernel. Within just 3 rounds of user interaction on average per API method, ARBITRAR found 40 new bugs, with patches accepted for 18 of them. Moreover, ARBITRAR finds all known bugs reported by a state-of-the-art tool APISAN in a benchmark suite comprising 92 bugs with a false positive rate of only 51.5% compared to APISAN’s 87.9%
more » « less
Full Text Available

« Prev Next »

Search for: All records