NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multi-Instance Adversarial Attack on GNN-Based Malicious Domain Detection

https://doi.org/10.1109/SP54263.2024.00006

Nazzal, Mahmoud; Khalil, Issa; Khreishah, Abdallah; Phan, NhatHai; Ma, Yao (May 2024, IEEE)

Full Text Available
Lifelong DP: Consistently Bounded Differential Privacy in Lifelong Machine Learning

Lai, Phung; Hu, Han; Phan, Hai; Jin, Ruoming; Thai, My; Chen, An (August 2023, Proceedings of The 1st Conference on Lifelong Learning Agents)

Full Text Available
XRand: Differentially Private Defense against Explanation-Guided Attacks

https://doi.org/10.1609/aaai.v37i10.26401

Nguyen, Truc; Lai, Phung; Phan, Hai; Thai, My T. (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Recent development in the field of explainable artificial intelligence (XAI) has helped improve trust in Machine-Learning-as-a-Service (MLaaS) systems, in which an explanation is provided together with the model prediction in response to each query. However, XAI also opens a door for adversaries to gain insights into the black-box models in MLaaS, thereby making the models more vulnerable to several attacks. For example, feature-based explanations (e.g., SHAP) could expose the top important features that a black-box model focuses on. Such disclosure has been exploited to craft effective backdoor triggers against malware classifiers. To address this trade-off, we introduce a new concept of achieving local differential privacy (LDP) in the explanations, and from that we establish a defense, called XRand, against such attacks. We show that our mechanism restricts the information that the adversary can learn about the top important features, while maintaining the faithfulness of the explanations.
more » « less
Full Text Available
Active Membership Inference Attack under Local Differential Privacy in Federated Learning

Nguyen, Truc; Lai, Phung; Tran, Khang; Phan, NhatHai; Thai, My (April 2023, AISTATS)

Full Text Available
Heterogeneous Randomized Response for Differential Privacy in Graph Neural Networks

https://doi.org/10.1109/BigData55660.2022.10020501

Tran, Khang; Lai, Phung; Phan, NhatHai; Khalil, Issa; Ma, Yao; Khreishah, Abdallah; Thai, My T.; Wu, Xintao (December 2022, 2022 IEEE International Conference on Big Data (Big Data))

Full Text Available
A Synergetic Attack against Neural Network Classifiers combining Backdoor and Adversarial Examples

https://doi.org/10.1109/BigData52589.2021.9671964

Liu, Guanxiong; Khalil, Issa; Khreishah, Abdallah; Phan, NhatHai (October 2021, IEEE International Conference on Big Data)

The pervasiveness of neural networks (NNs) in critical computer vision and image processing applications makes them very attractive for adversarial manipulation. A large body of existing research thoroughly investigates two broad categories of attacks targeting the integrity of NN models. The first category of attacks, commonly called Adversarial Examples, perturbs the model's inference by carefully adding noise into input examples. In the second category of attacks, adversaries try to manipulate the model during the training process by implanting Trojan backdoors. Researchers show that such attacks pose severe threats to the growing applications of NNs and propose several defenses against each attack type individually. However, such one-sided defense approaches leave potentially unknown risks in real-world scenarios when an adversary can unify different attacks to create new and more lethal ones bypassing existing defenses. In this work, we show how to jointly exploit adversarial perturbation and model poisoning vulnerabilities to practically launch a new stealthy attack, dubbed AdvTrojan. AdvTrojan is stealthy because it can be activated only when: 1) a carefully crafted adversarial perturbation is injected into the input examples during inference, and 2) a Trojan backdoor is implanted during the training process of the model. We leverage adversarial noise in the input space to move Trojan-infected examples across the model decision boundary, making it difficult to detect. The stealthiness behavior of AdvTrojan fools the users into accidentally trusting the infected model as a robust classifier against adversarial examples. AdvTrojan can be implemented by only poisoning the training data similar to conventional Trojan backdoor attacks. Our thorough analysis and extensive experiments on several benchmark datasets show that AdvTrojan can bypass existing defenses with a success rate close to 100% in most of our experimental scenarios and can be extended to attack federated learning as well as high-resolution images.
more » « less
Full Text Available
Continual Learning with Differential Privacy

https://doi.org/10.1007/978-3-030-92310-5_39

Desai, Pradnya; Lai, Phung; Phan, NhatHai; Thai, My T. (October 2021, International Conference on Neural Information Processing)

In this paper, we focus on preserving differential privacy (DP) in continual learning (CL), in which we train ML models to learn a sequence of new tasks while memorizing previous tasks. We first introduce a notion of continual adjacent databases to bound the sensitivity of any data record participating in the training process of CL. Based upon that, we develop a new DP-preserving algorithm for CL with a data sampling strategy to quantify the privacy risk of training data in the well-known Averaged Gradient Episodic Memory (A-GEM) approach by applying a moments accountant. Our algorithm provides formal guarantees of privacy for data records across tasks in CL. Preliminary theoretical analysis and evaluations show that our mechanism tightens the privacy loss while maintaining a promising model utility.
more » « less
Full Text Available

Search for: All records