NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Don't listen to me: understanding and exploring jailbreak prompts of large language models

Yu, Zhiyuan; Liu, Xiaogeng; Liang, Shunning; Cameron, Zach; Xiao, Chaowei; Zhang, Ning (August 2025, IEEE security privacy)

Free, publicly-accessible full text available August 14, 2026
PhySense: Defending Physically Realizable Attacks for Autonomous Systems via Consistency Reasoning

https://doi.org/10.1145/3658644.3690236

Yu, Zhiyuan; Li, Ao; Wen, Ruoyao; Chen, Yijia; Zhang, Ning (December 2024, ACM)

Full Text Available
Leveraging mathematical modeling framework to guide regimen strategy for phage therapy

https://doi.org/10.1371/journal.pcsy.0000015

Yu, Zhiyuan; Luong, Tiffany; Banuelos, Selenne; Sue, Andrew; Ryu, Hwayeon; Segal, Rebecca; Roach, Dwayne R; Huang, Qimin (November 2024, PLOS Complex Systems)
Kim, Jaehee (Ed.)
Bacteriophage (phage) cocktail therapy has been relied upon more and more to treat antibiotic-resistant infections. Understanding of the complex kinetics between phages, target bacteria, and the emergence of phage resistance remain hurdles to successful clinical outcomes. Building upon previous mathematical concepts, we develop biologically-motivated nonlinear ordinary differential equation models to explore single, cocktail, and sequential phage treatment modalities. While the optimal pairwise phage treatment strategy was the double simultaneous administration of two highly potent and asymmetrically binding phage strains, it appears unable to prevent the evolution of resistance. This treatment regimen did have a greater lysis efficiency, promoted higher phage population sizes, reduced bacterial density the most, and suppressed the evolution of resistance the longest compared to all other treatments strategies tested. Conversely, the combination of phages with polar potencies allows the more efficiently replicating phages to monopolize susceptible host cells, thereby quickly negating the intended compounding effect of cocktails. Together, we demonstrate that a biologically-motivated modeling-based framework can be leveraged to quantify the effects of each phage’s properties to more precisely predict treatment responses.
more » « less
Full Text Available
Please Tell Me More: Privacy Impact of Explainability through the Lens of Membership Inference Attack

Liu, Han; Wu, Yuhao; Yu, Zhiyuan; Zhang, Ning (May 2024, 2024 IEEE Symposium on Security and Privacy (SP))

Explainability is increasingly recognized as an enabling technology for the broader adoption of machine learning (ML), particularly for safety-critical applications. This has given rise to explainable ML, which seeks to enhance the explainability of neural networks through the use of explanators. Yet, the pursuit for better explainability inadvertently leads to increased security and privacy risks. While there has been considerable research into the security risks of explainable ML, its potential privacy risks remain under-explored. To bridge this gap, we present a systematic study of privacy risks in explainable ML through the lens of membership inference. Building on the observation that, besides the accuracy of the model, robustness also exhibits observable differences among member samples and non-member samples, we develop a new membership inference attack. This attack extracts additional membership features from changes in model confidence under different levels of perturbations guided by the importance highlighted by the attribution maps in the explanators. Intuitively, perturbing important features generally results in a bigger loss in confidence for member samples. Using the member-non-member differences in both model performance and robustness, an attack model is trained to distinguish the membership. We evaluated our approach with seven popular explanators across various benchmark models and datasets. Our attack demonstrates there is non-trivial privacy leakage in current explainable ML methods. Furthermore, such leakage issue persists even if the attacker lacks the knowledge of training datasets or target model architectures. Lastly, we also found existing model and output-based defense mechanisms are not effective in mitigating this new attack.
more » « less
Full Text Available
AntiFake: Using Adversarial Audio to Prevent Unauthorized Speech Synthesis

https://doi.org/10.1145/3576915.3623209

Yu, Zhiyuan; Zhai, Shixuan; Zhang, Ning (November 2023, Proceedings of the ACM Conference on Computer and Communications Security)

The rapid development of deep neural networks and generative AI has catalyzed growth in realistic speech synthesis. While this technology has great potential to improve lives, it also leads to the emergence of ''DeepFake'' where synthesized speech can be misused to deceive humans and machines for nefarious purposes. In response to this evolving threat, there has been a significant amount of interest in mitigating this threat by DeepFake detection. Complementary to the existing work, we propose to take the preventative approach and introduce AntiFake, a defense mechanism that relies on adversarial examples to prevent unauthorized speech synthesis. To ensure the transferability to attackers' unknown synthesis models, an ensemble learning approach is adopted to improve the generalizability of the optimization process. To validate the efficacy of the proposed system, we evaluated AntiFake against five state-of-the-art synthesizers using real-world DeepFake speech samples. The experiments indicated that AntiFake achieved over 95% protection rate even to unknown black-box models. We have also conducted usability tests involving 24 human participants to ensure the solution is accessible to diverse populations.
more » « less
CODEIPPROMPT: intellectual property infringement assessment of code language models

Yu, Zhiyuan; Wu, Yuhao; Zhang, Ning; Wang, Chenguang; Vorobeychik, Yevgeniy; Xiao, Chaowei (July 2023, Proceedings of the 40th International Conference on Machine Learning)

Recent advances in large language models (LMs) have facilitated their ability to synthesize programming code. However, they have also raised concerns about intellectual property (IP) rights violations. Despite the significance of this issue, it has been relatively less explored. In this paper, we aim to bridge the gap by presenting CODEIPPROMPT, a platform for automatic evaluation of the extent to which code language models may reproduce licensed programs. It comprises two key components: prompts constructed from a licensed code database to elicit LMs to generate IP-violating code, and a measurement tool to evaluate the extent of IP violation of code LMs. We conducted an extensive evaluation of existing open-source code LMs and commercial products, and revealed the prevalence of IP violations in all these models. We further identified that the root cause is the substantial proportion of training corpus subject to restrictive licenses, resulting from both intentional inclusion and inconsistent license practice in the real world. To address this issue, we also explored potential mitigation strategies, including fine-tuning and dynamic token filtering. Our study provides a testbed for evaluating the IP violation issues of the existing code generation platforms and stresses the need for a better mitigation strategy.
more » « less
Full Text Available
SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial Examples

https://doi.org/10.1109/CVPR52729.2023.00498

Liu, Han; Wu, Yuhao; Yu, Zhiyuan; Vorobeychik, Yevgeniy; Zhang, Ning (June 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available
PowerTouch: A Security Objective-Guided Automation Framework for Generating Wired Ghost Touch Attacks on Touchscreens

https://doi.org/10.1145/3508352.3549395

Zhu, Huifeng; Yu, Zhiyuan; Cao, Weidong; Zhang, Ning; Zhang, Xuan (October 2022, Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design)

Full Text Available

Search for: All records