Learning Robust and Privacy-Preserving Representations via Information Theory

Zhang, Binghui; Noorbakhsh, Sayedeh Leila; Dong, Yun; Hong, Yuan; Wang, Binghui

doi:10.1609/AAAI.V39I21.34392

Citation Details

This content will become publicly available on April 11, 2026

Learning Robust and Privacy-Preserving Representations via Information Theory

Machine learning models are vulnerable to both security attacks (e.g., adversarial examples) and privacy attacks (e.g., private attribute inference). We take the first step to mitigate both the security and privacy attacks, and maintain task utility as well. Particularly, we propose an information-theoretic framework to achieve the goals through the lens of representation learning, i.e., learning representations that are robust to both adversarial examples and attribute inference adversaries. We also derive novel theoretical results under our framework, e.g., the inherent trade-off between adversarial robustness/utility and attribute privacy, and guaranteed attribute privacy leakage against attribute inference adversaries. more »

Award ID(s):: 2326341 2308730 2302689

PAR ID:: 10631551

Author(s) / Creator(s):: Zhang, Binghui; Noorbakhsh, Sayedeh Leila; Dong, Yun; Hong, Yuan; Wang, Binghui

Publisher / Repository:: AAAI

Date Published:: 2025-04-11

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 39

Issue:: 21

ISSN:: 2159-5399

Page Range / eLocation ID:: 22363 to 22371

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on April 11, 2026
Journal Article:
https://doi.org/10.1609/AAAI.V39I21.34392

More Like this