NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

You Can't Steal Nothing: Mitigating Prompt Leakages in LLMs via System Vectors

https://doi.org/10.1145/3719027.3765124

Cao, Bochuan; Li, Changjiang; Cao, Yuanpu; Ge, Yameng; Wang, Ting; Chen, Jinghui (November 2025, ACM)

Free, publicly-accessible full text available November 19, 2026
RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models.

Jiang, T; Li, C; Ma, F; Wang, T (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction

Jiang, T; Wang, Z; Liang, J; Li, C; Wang, Y; Wang, T (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction

Jiang, T; Wang, Z; Liang, J; Li, C; Wang, Y; Wang, T (January 2025, International Conference on Learning Representations)

Full Text Available
RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models

Jiang, T; Li, C; Ma, F; Wang, T (January 2025, International Conference on Learning Representations)

Full Text Available
Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion Models

Li, C; Pang, R; Cao, B; Chen, J; Ma, F; Ji, S; Wang, T (January 2025, USENIX Security Symposium (Security’25))

Full Text Available
Shadow-Activated Backdoor Attacks on Multimodal Large Language Models

https://doi.org/10.18653/v1/2025.findings-acl.248

Yin, Ziyi; Ye, Muchao; Cao, Yuanpu; Wang, Jiaqi; Chang, Aofei; Liu, Han; Chen, Jinghui; Wang, Ting; Ma, Fenglong (January 2025, Association for Computational Linguistics)

Full Text Available
Model Extraction Attacks Revisited

https://doi.org/10.1145/3634737.3657002

Liang, Jiacheng; Pang, Ren Pang; Li, Changjiang; Wang, Ting (July 2024, ACM)

Full Text Available
Generative AI in the Wild: Prospects, Challenges, and Strategies

https://doi.org/10.1145/3613904.3642160

Sun, Yuan; Jang, Eunchae; Ma, Fenglong; Wang, Ting (May 2024, CHI '24: Proceedings of the CHI Conference on Human Factors in Computing Systems)

Full Text Available
VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models

https://doi.org/10.1609/aaai.v38i7.28499

Yin, Ziyi; Ye, Muchao; Zhang, Tianrong; Wang, Jiaqi; Liu, Han; Chen, Jinghui; Wang, Ting; Ma, Fenglong (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

Visual Question Answering (VQA) is a fundamental task in computer vision and natural language process fields. Although the “pre-training & finetuning” learning paradigm significantly improves the VQA performance, the adversarial robustness of such a learning paradigm has not been explored. In this paper, we delve into a new problem: using a pre-trained multimodal source model to create adversarial image-text pairs and then transferring them to attack the target VQA models. Correspondingly, we propose a novel VQATTACK model, which can iteratively generate both im- age and text perturbations with the designed modules: the large language model (LLM)-enhanced image attack and the cross-modal joint attack module. At each iteration, the LLM-enhanced image attack module first optimizes the latent representation-based loss to generate feature-level image perturbations. Then it incorporates an LLM to further enhance the image perturbations by optimizing the designed masked answer anti-recovery loss. The cross-modal joint attack module will be triggered at a specific iteration, which updates the image and text perturbations sequentially. Notably, the text perturbation updates are based on both the learned gradients in the word embedding space and word synonym-based substitution. Experimental results on two VQA datasets with five validated models demonstrate the effectiveness of the proposed VQATTACK in the transferable attack setting, compared with state-of-the-art baselines. This work revealsa significant blind spot in the “pre-training & fine-tuning” paradigm on VQA tasks. The source code can be found in the link https://github.com/ericyinyzy/VQAttack.
more » « less
Full Text Available

« Prev Next »

Search for: All records