Flexible, Efficient, and Stable Adversarial Attacks on Machine Unlearning

Zhou, Zihan; Zhou, Yang; Zhang, Zijie; Lyu, Lingjuan; Yan, Da; Jin, Ruoming; Dou, Dejing

Citation Details

This content will become publicly available on July 15, 2026

Flexible, Efficient, and Stable Adversarial Attacks on Machine Unlearning

Machine unlearning (MU) aims to remove the influence of specific data points from trained models, enhancing compliance with privacy regulations. However, the vulnerability of basic MU models to malicious unlearning requests in adversarial learning environments has been largely overlooked. Existing adversarial MU attacks suffer from three key limitations: inflexibility due to pre-defined attack targets, inefficiency in handling multiple attack requests, and instability caused by non-convex loss functions. To address these challenges, we propose a Flexible, Efficient, and Stable Attack (DDPA). First, leveraging Carathéodory's theorem, we introduce a convex polyhedral approximation to identify points in the loss landscape where convexity approximately holds, ensuring stable attack performance. Second, inspired by simplex theory and John's theorem, we develop a regular simplex detection technique that maximizes coverage over the parameter space, improving attack flexibility and efficiency. We theoretically derive the proportion of the effective parameter space occupied by the constructed simplex. We evaluate the attack success rate of our DDPA method on real datasets against state-of-the-art machine unlearning attack methods. Our source code is available at https://github.com/zzz0134/DDPA. more »

Award ID(s):: 2414474

PAR ID:: 10631552

Author(s) / Creator(s):: Zhou, Zihan; Zhou, Yang; Zhang, Zijie; Lyu, Lingjuan; Yan, Da; Jin, Ruoming; Dou, Dejing

Publisher / Repository:: Proceedings of the 42nd International Conference on Machine Learning (ICML'25)

Date Published:: 2025-07-15

Format(s):: Medium: X

Location:: Vancouver, Canada

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 15, 2026
Conference Paper:
The DOI is not currently available.

More Like this