NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Revisiting Physical-World Adversarial Attack on Traffic Sign Recognition: A Commercial Systems Perspective

Wang, Ningfei; Xie, Shaoyuan; Sato, Takami; Luo, Yunpeng; Xu, Kaidi; Chen, Qi Alfred (February 2025, ISOC Network and Distributed System Security Symposium (NDSS))

Free, publicly-accessible full text available February 24, 2026
GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations

Duan, Jinhao; Zhang, Renming; Diffenderfer, James; Kailkhura, Bhavya; Sun, Lichao; Stengel-Eskin, Elias; Bansal, Mohit; Chen, Tianlong; Xu, Kaidi (December 2024, Neural Information Processing Systems Foundation, Inc. (NeurIPS))

As Large Language Models (LLMs) are integrated into critical real-world applications, their strategic and logical reasoning abilities are increasingly crucial. This paper evaluates LLMs' reasoning abilities in competitive environments through game-theoretic tasks, e.g., board and card games that require pure logic and strategic reasoning to compete with opponents. We first propose GTBench, a language-driven environment composing 10 widely-recognized tasks, across a comprehensive game taxonomy: complete versus incomplete information, dynamic versus static, and probabilistic versus deterministic scenarios. Then, we (1) Characterize the game-theoretic reasoning of LLMs; and (2) Perform LLM-vs.-LLM competitions as reasoning evaluation. We observe that (1) LLMs have distinct behaviors regarding various gaming scenarios; for example, LLMs fail in complete and deterministic games yet they are competitive in probabilistic gaming scenarios; (2) Most open-source LLMs, e.g., CodeLlama-34b-Instruct and Llama-2-70b-chat, are less competitive than commercial LLMs, e.g., GPT-4, in complex games, yet the recently released Llama-3-70b-Instruct makes up for this shortcoming. In addition, code-pretraining greatly benefits strategic reasoning, while advanced reasoning methods such as Chain-of-Thought (CoT) and Tree-of-Thought (ToT) do not always help. We further characterize the game-theoretic properties of LLMs, such as equilibrium and Pareto Efficiency in repeated games. Detailed error profiles are provided for a better understanding of LLMs' behavior. We hope our research provides standardized protocols and serves as a foundation to spur further explorations in the strategic reasoning of LLMs.
more » « less
Full Text Available
GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing

https://doi.org/10.18653/v1/2025.naacl-long.287

Duan, Jinhao; Zhao, Xinyu; Zhang, Zhuoxuan; Ko, Eunhye Grace; Boddy, Lily; Wang, Chenan; Li, Tianhao; Rasgon, Alexander; Hong, Junyuan; Lee, Min Kyung; et al (January 2025, Association for Computational Linguistics)

Full Text Available
A Survey on Large Language Model (LLM) Security and Privacy: The Good, The Bad, and The Ugly

https://doi.org/10.1016/j.hcc.2024.100211

Yao, Yifan; Duan, Jinhao; Xu, Kaidi; Cai, Yuanfang; Sun, Zhibo; Zhang, Yue (June 2024, High-Confidence Computing)

Large Language Models (LLMs), such as ChatGPT and Bard, have revolutionized natural language understanding and generation. They possess deep language comprehension, human-like text generation capabilities, contextual awareness, and robust problem-solving skills, making them invaluable in various domains (e.g., search engines, customer support, translation). In the meantime, LLMs have also gained traction in the security community, revealing security vulnerabilities and showcasing their potential in security-related tasks. This paper explores the intersection of LLMs with security and privacy. Specifically, we investigate how LLMs positively impact security and privacy, potential risks and threats associated with their use, and inherent vulnerabilities within LLMs. Through a comprehensive literature review, the paper categorizes the papers into “The Good” (beneficial LLM applications), “The Bad” (offensive applications), and “The Ugly” (vulnerabilities of LLMs and their defenses). We have some interesting findings. For example, LLMs have proven to enhance code security (code vulnerability detection) and data privacy (data confidentiality protection), outperforming traditional methods. However, they can also be harnessed for various attacks (particularly user-level attacks) due to their human-like reasoning abilities. We have identified areas that require further research efforts. For example, Research on model and parameter extraction attacks is limited and often theoretical, hindered by LLM parameter scale and confidentiality. Safe instruction tuning, a recent development, requires more exploration. We hope that our work can shed light on the LLMs’ potential to both bolster and jeopardize cybersecurity.
more » « less
Full Text Available
E3: Ensemble of Expert Embedders for Adapting Synthetic Image Detectors to New Generators Using Limited Data

https://doi.org/10.1109/CVPRW63382.2024.00437

Azizpour, Aref; Nguyen, Tai D; Shrestha, Manil; Xu, Kaidi; Kim, Edward; Stamm, Matthew C (June 2024, IEEE)

Full Text Available
Stable Unlearnable Example: Enhancing the Robustness of Unlearnable Examples via Stable Error-Minimizing Noise

https://doi.org/10.1609/aaai.v38i4.28169

Liu, Yixin; Xu, Kaidi; Chen, Xun; Sun, Lichao (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

The open sourcing of large amounts of image data promotes the development of deep learning techniques. Along with this comes the privacy risk of these image datasets being exploited by unauthorized third parties to train deep learning models for commercial or illegal purposes. To avoid the abuse of data, a poisoning-based technique, unlearnable example, has been proposed to significantly degrade the generalization performance of models by adding imperceptible noise to the data. To further enhance its robustness against adversarial training, existing works leverage iterative adversarial training on both the defensive noise and the surrogate model. However, it still remains unknown whether the robustness of unlearnable examples primarily comes from the effect of enhancement in the surrogate model or the defensive noise. Observing that simply removing the adversarial perturbation on the training process of the defensive noise can improve the performance of robust unlearnable examples, we identify that solely the surrogate model's robustness contributes to the performance. Furthermore, we found a negative correlation exists between the robustness of defensive noise and the protection performance, indicating defensive noise's instability issue. Motivated by this, to further boost the robust unlearnable example, we introduce Stable Error-Minimizing noise (SEM), which trains the defensive noise against random perturbation instead of the time-consuming adversarial perturbation to improve the stability of defensive noise. Through comprehensive experiments, we demonstrate that SEM achieves a new state-of-the-art performance on CIFAR-10, CIFAR-100, and ImageNet Subset regarding both effectiveness and efficiency.
more » « less
Full Text Available
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Hong, Junyuan; Duan, Jinhao; Zhang, Chenhui; Li, Zhangheng; Xie, Chulin; Lieberman, Kelsey; Diffenderfer, James; Bartoldson, Brian; Jaiswal, Ajay; Xu, Kaidi; et al (July 2024, International Conference on Machine Learning (ICML 2024))

Full Text Available
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Hong, Junyuan; Duan, Jinhao; Zhang, Chenhui; Li, Zhangheng; Xie, Chulin; Lieberman, Kelsey; Diffenderfer, James; Bartoldson, Brian; Jaiswal, Ajay; Xu, Kaidi; et al (July 2024, International Conference on Machine Learning (ICML))

Full Text Available
ReTA: Recursively Thinking Ahead to Improve the Strategic Reasoning of Large Language Models

https://doi.org/10.18653/v1/2024.naacl-long.123

Duan, Jinhao; Wang, Shiqi; Diffenderfer, James; Sun, Lichao; Chen, Tianlong; Kailkhura, Bhavya; Xu, Kaidi (January 2024, Association for Computational Linguistics)

Full Text Available
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Hong, Junyuan; Duan, Jinhao; Zhang, Chenhui; Li, Zhangheng; Xie, Chulin; Lieberman, Kelsey; Diffenderfer, James; Bartoldson, Brian R; Jaiswal, Ajay Kumar; Xu, Kaidi; et al (July 2024, Proceedings of Machine Learning Research)

Compressing high-capability Large Language Models (LLMs) has emerged as a favored strategy for resource-efficient inferences. While state-of-the-art (SoTA) compression methods boast impressive advancements in preserving benign task performance, the potential risks of compression in terms of safety and trustworthiness have been largely neglected. This study conducts the first, thorough evaluation of three (3) leading LLMs using five (5) SoTA compression techniques across eight (8) trustworthiness dimensions. Our experiments highlight the intricate interplay between compression and trustworthiness, revealing some interesting patterns. We find that quantization is currently a more effective approach than pruning in achieving efficiency and trustworthiness simultaneously. For instance, a 4-bit quantized model retains the trustworthiness of its original counterpart, but model pruning significantly degrades trustworthiness, even at 50% sparsity. Moreover, employing quantization within a moderate bit range could unexpectedly improve certain trustworthiness dimensions such as ethics and fairness. Conversely, extreme quantization to very low bit levels (3 bits) tends to reduce trustworthiness significantly. This increased risk cannot be uncovered by looking at benign performance alone, in turn, mandating comprehensive trustworthiness evaluation in practice. These findings culminate in practical recommendations for simultaneously achieving high utility, efficiency, and trustworthiness in LLMs.
more » « less
Full Text Available

« Prev Next »

Search for: All records