DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning

Bao, Wenxuan; Pittaluga, Francesco; Kumar B G, Vijay; Bindschaedler, Vincent

Citation Details

Data augmentation techniques, such as simple image transformations and combinations, are highly effective at improving the generalization of computer vision models, especially when training data is limited. However, such techniques are fundamentally incompatible with differentially private learning approaches, due to the latter's built-in assumption that each training image's contribution to the learned model is bounded. In this paper, we investigate why naive applications of multi-sample data augmentation techniques, such as mixup, fail to achieve good performance and propose two novel data augmentation techniques specifically designed for the constraints of differentially private learning. Our first technique, DP-Mix_Self, achieves SoTA classification performance across a range of datasets and settings by performing mixup on self-augmented data. Our second technique, DP-Mix_Diff, further improves performance by incorporating synthetic data from a pre-trained diffusion model into the mixup process. more »

Award ID(s):: 2055123

PAR ID:: 10481576

Author(s) / Creator(s):: Bao, Wenxuan; Pittaluga, Francesco; Kumar B G, Vijay; Bindschaedler, Vincent

Publisher / Repository:: OpenReview

Date Published:: 2023-12-11

Journal Name:: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

Subject(s) / Keyword(s):: differential privacy deep learning data augmentation synthetic data diffusion models

Format(s):: Medium: X

Location:: New Orleans, LA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this