NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Neural Network Verification with Branch-and-Bound for General Nonlinearities

Shi, Zhouxing; Jin, Qirui; Kolter, Zico; Jana, Suman; Hsieh, Cho-Jui; Zhang, Huan (May 2025, 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems)

Free, publicly-accessible full text available May 3, 2026
PatchCURE: Improving Certifiable Robustness, Model Utility, and Computation Efficiency of Adversarial Patch Defenses

Xiang, Chong; Wu, Tong; Dai, Sihui; Petit, Jonathan; Jana, Suman; Mittal, Prateek (August 2024, USENIX Security)

Full Text Available
Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain

Min, Marcus J; Ding, Yangruibo; Buratti, Luca; Pujar, Saurabh; Kaiser, Gail; Jana, Suman; Ray, Baishakhi (April 2024, OpenReview)

Code Large Language Models (Code LLMs) are being increasingly employed in real-life applications, so evaluating them is critical. While the conventional accuracy evaluates the performance of Code LLMs on a set of individual tasks, their self-consistency across different tasks is overlooked. Intuitively, a trustworthy model should be self-consistent when generating natural language specifications for its own code and generating code for its own specifications. Failure to preserve self-consistency reveals a lack of understanding of the shared semantics underlying natural language and programming language, and therefore undermines the trustworthiness of a model. In this paper, we first formally define the self-consistency of Code LLMs and then design a framework, IdentityChain, which effectively and efficiently evaluates the self-consistency and conventional accuracy of a model at the same time. We study eleven Code LLMs and show that they fail to preserve self-consistency, which is indeed a distinct aspect from conventional accuracy. Furthermore, we show that IdentityChain can be used as a model debugging tool to expose weaknesses of Code LLMs by demonstrating three major weaknesses that we identify in current models using IdentityChain. Our code is available at https://github.com/marcusm117/IdentityChain.
more » « less
Full Text Available
FreePart: Hardening Data Processing Software via Framework-based Partitioning and Isolation

https://doi.org/10.1145/3623278.3624760

Ahad, Ali; Wang, Gang; Kim, Chung Hwan; Jana, Suman; Lin, Zhiqiang; Kwon, Yonghwi (March 2024, ASPLOS)

Data processing oriented software, especially machine learning applications, are heavily dependent on standard frameworks/libraries such as TensorFlow and OpenCV. As those frameworks have gained significant popularity, the exploitation of vulnerabilities in the frameworks has become a critical security concern. While software isolation can minimize the impact of exploitation, existing approaches suffer from difficulty analyzing complex program dependencies or excessive overhead, making them ineffective in practice. We propose FreePart, a framework-focused software partitioning technique specialized for data processing applications. It is based on an observation that the execution of a data processing application, including data flows and usage of critical data, is closely related to the invocations of framework APIs. Hence, we conduct a temporal partitioning of the host application’s execution based on the invocations of framework APIs and the data objects used by the APIs. By focusing on data accesses at runtime instead of static program code, it provides effective and practical isolation from the perspective of data. Our evaluation on 23 applications using popular frameworks (e.g., OpenCV, Caffe, PyTorch, and TensorFlow) shows that FreePart is effective against all attacks composed of 18 real-world vulnerabilities with a low overhead (3.68%).
more » « less
Full Text Available
Learning Approximate Execution Semantics From Traces for Binary Function Similarity

https://doi.org/10.1109/TSE.2022.3231621

Pei, Kexin; Xuan, Zhou; Yang, Junfeng; Jana, Suman; Ray, Baishakhi (April 2023, IEEE Transactions on Software Engineering)

Full Text Available
NeuDep: neural binary memory dependence analysis

https://doi.org/10.1145/3540250.3549147

Pei, Kexin; She, Dongdong; Wang, Michael; Geng, Scott; Xuan, Zhou; David, Yaniv; Yang, Junfeng; Jana, Suman; Ray, Baishakhi (November 2022, ESEC/FSE 2022: Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering)

Full Text Available
A Branch and Bound Framework for Stronger Adversarial Attacks of ReLU Networks

Zhang, Huan; Wang, Shiqi; Xu, Kaidi; Wang, Yihan; Jana, Suman; Hsieh, Cho-Jui; Kolter, Zico (January 2022, International Conference on Machine Learning (ICML))

Full Text Available
General Cutting Planes for Bound-Propagation-Based Neural Network Verification

Zhang, Huan; Wang, Shiqi; Xu, Kaidi; Li, Linyi; Li, Bo; Jana, Suman; Hsieh, Cho-Jui; Kolter, Zico (January 2022, Advances in neural information processing systems)

Full Text Available
DistAI: Data-Driven Automated Invariant Learning for Distributed Protocols

Yao, Jianan; Tao, Runzhou; Gu, Ronghui; Nieh, Jason; Jana, Suman; Ryan, Gabriel. (July 2021, Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2021))

Distributed systems are notoriously hard to implement correctly due to non-determinism. Finding the inductive invariant of the distributed protocol is a critical step in verifying the correctness of distributed systems, but takes a long time to do even for simple protocols. We present DistAI, a data-driven automated system for learning inductive invariants for distributed protocols. DistAI generates data by simulating the distributed protocol at different instance sizes and recording states as samples. Based on the observation that invariants are often concise in practice, DistAI starts with small invariant formulas and enumerates all strongest possible invariants that hold for all samples. It then feeds those invariants and the desired safety properties to an SMT solver to check if the conjunction of the invariants and the safety properties is inductive. Starting with small invariant formulas and strongest possible invariants avoids large SMT queries, improving SMT solver performance. Because DistAI starts with the strongest possible invariants, if the SMT solver fails, DistAI does not need to discard failed invariants, but knows to monotonically weaken them and try again with the solver, repeating the process until it eventually succeeds. We prove that DistAI is guaranteed to find the∃-free inductive invariant that proves the desired safety properties in finite time, if one exists. Our evaluation shows that DistAI successfully verifies 13 common distributed protocols automatically and outperforms alternative methods both in the number of protocols it verifies and the speed at which it does so, in some cases by more than two orders of magnitude.
more » « less
Full Text Available
Position: TrustLLM: Trustworthiness in Large Language Models

Huang, Yue; Sun, Lichao; Wang, Haoran; Wu, Siyuan; Zhang, Qihui; Li, Yuan; Gao, Chujie; Huang, Yixin; Lyu, Wenhan; Zhang, Yixuan; et al (July 2024, PMLR)

Full Text Available

« Prev Next »

Search for: All records