DART: A Principled Approach to Adversarially Robust Unsupervised Domain Adaptation

Wang, Yunjuan; Hazimeh, Hussein; Ponomareva, Natalia; Kurakin, Alexey; Hammoud, Ibrahim; Arora, Raman

Citation Details

This content will become publicly available on April 1, 2026

DART: A Principled Approach to Adversarially Robust Unsupervised Domain Adaptation

In this work, we consider a setting where the goal is to achieve adversarial robustness on a target task, given only unlabeled training data from the task distribution, by leveraging a labeled training data from a different yet related source task distribution. The absence of the labels on training data for the target task poses a unique challenge as conventional adversarial robustness defenses cannot be directly applied. To address this challenge, we first bound the adversarial population 0-1 robust loss on the target task in terms of (i) empirical 0-1 loss on the source task, (ii) joint loss on source and target tasks of an ideal classifier, and (iii) a measure of worst-case domain divergence. Motivated by this bound, we develop a novel unified defense framework called Divergence-Aware adveRsarial Training (DART), which can be used in conjunction with a variety of standard UDA methods; e.g., DANN. DART is applicable to general threat models, including the popular \ell_p-norm model, and does not require heuristic regularizers or architectural changes. We also release DomainRobust, a testbed for evaluating robustness of UDA models to adversarial attacks. DomainRobust consists of 4 multidomain benchmark datasets (with 46 source-target pairs) and 7 meta-algorithms with a total of 11 variants. Our large-scale experiments demonstrate that, on average, DART significantly enhances model robustness on all benchmarks compared to the state of the art, while maintaining competitive standard accuracy. The relative improvement in robustness from DART reaches up to 29.2% on the source-target domain pairs considered. more »

Award ID(s):: 1943251

PAR ID:: 10572986

Author(s) / Creator(s):: Wang, Yunjuan; Hazimeh, Hussein; Ponomareva, Natalia; Kurakin, Alexey; Hammoud, Ibrahim; Arora, Raman

Publisher / Repository:: 3rd IEEE Conference on Secure and Trustworthy Machine Learning (2025)

Date Published:: 2025-04-01

ISSN:: 2169-3536

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on April 1, 2026
Conference Paper:
The DOI is not currently available.

More Like this