Data augmentation via diffusion model to enhance AI fairness

Hastings_Blow, Christina; Qian, Lijun; Gibson, Camille; Obiomon, Pamela; Dong, Xishuang

doi:10.3389/frai.2025.1530397

Citation Details

This content will become publicly available on March 19, 2026

Data augmentation via diffusion model to enhance AI fairness

IntroductionAI fairness seeks to improve the transparency and explainability of AI systems by ensuring that their outcomes genuinely reflect the best interests of users. Data augmentation, which involves generating synthetic data from existing datasets, has gained significant attention as a solution to data scarcity. In particular, diffusion models have become a powerful technique for generating synthetic data, especially in fields like computer vision. MethodsThis paper explores the potential of diffusion models to generate synthetic tabular data to improve AI fairness. The Tabular Denoising Diffusion Probabilistic Model (Tab-DDPM), a diffusion model adaptable to any tabular dataset and capable of handling various feature types, was utilized with different amounts of generated data for data augmentation. Additionally, reweighting samples from AIF360 was employed to further enhance AI fairness. Five traditional machine learning models—Decision Tree (DT), Gaussian Naive Bayes (GNB), K-Nearest Neighbors (KNN), Logistic Regression (LR), and Random Forest (RF)—were used to validate the proposed approach. Results and discussionExperimental results demonstrate that the synthetic data generated by Tab-DDPM improves fairness in binary classification. more »

Award ID(s):: 2323419

PAR ID:: 10629220

Author(s) / Creator(s):: Hastings_Blow, Christina; Qian, Lijun; Gibson, Camille; Obiomon, Pamela; Dong, Xishuang

Publisher / Repository:: Frontiers

Date Published:: 2025-03-19

Journal Name:: Frontiers in Artificial Intelligence

Volume:: 8

ISSN:: 2624-8212

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 19, 2026
Journal Article:
https://doi.org/10.3389/frai.2025.1530397

More Like this