NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Improving model fairness in image-based computer-aided diagnosis

https://doi.org/10.1038/s41467-023-41974-4

Lin, Mingquan; Li, Tianhao; Yang, Yifan; Holste, Gregory; Ding, Ying; Van Tassel, Sarah H.; Kovacs, Kyle; Shih, George; Wang, Zhangyang; Lu, Zhiyong; et al (December 2023, Nature Communications)

Abstract Deep learning has become a popular tool for computer-aided diagnosis using medical images, sometimes matching or exceeding the performance of clinicians. However, these models can also reflect and amplify human bias, potentially resulting inaccurate missed diagnoses. Despite this concern, the problem of improving model fairness in medical image classification by deep learning has yet to be fully studied. To address this issue, we propose an algorithm that leverages the marginal pairwise equal opportunity to reduce bias in medical image classification. Our evaluations across four tasks using four independent large-scale cohorts demonstrate that our proposed algorithm not only improves fairness in individual and intersectional subgroups but also maintains overall performance. Specifically, the relative change in pairwise fairness difference between our proposed model and the baseline model was reduced by over 35%, while the relative change in AUC value was typically within 1%. By reducing the bias generated by deep learning models, our proposed approach can potentially alleviate concerns about the fairness and reliability of image-based computer-aided diagnosis.
more » « less
Full Text Available
PIPA: Preference Alignment as Prior-Informed Statistical Estimation

Li, Junbo; Wang, Zhangyang; Liu, Qiang (July 2025, International Conference on Machine Learning (ICML))

Full Text Available
HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model Training

Kim, Geon-Woo; Li, Junbo; Gandham, Shashidhar; Baldonado, Omar; Gangidi, Adithya; Balaji; Pavan; Wang, Zhangyang; Akella, Aditya (July 2025, International Conference on Machine Learning (ICML))

Full Text Available
Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge

https://doi.org/10.1016/j.media.2024.103224

Holste, Gregory; Zhou, Yiliang; Wang, Song; Jaiswal, Ajay; Lin, Mingquan; Zhuge, Sherry; Yang, Yuzhe; Kim, Dongkyun; Nguyen-Mau, Trong-Hieu; Tran, Minh-Triet; et al (October 2024, Medical Image Analysis)

Full Text Available
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Hong, Junyuan; Duan, Jinhao; Zhang, Chenhui; Li, Zhangheng; Xie, Chulin; Lieberman, Kelsey; Diffenderfer, James; Bartoldson, Brian; Jaiswal, Ajay; Xu, Kaidi; et al (July 2024, International Conference on Machine Learning (ICML))

Full Text Available
DP-OPT: MAKE LARGE LANGUAGE MODEL YOUR PRIVACY-PRESERVING PROMPT ENGINEER

Hong, Junyuan; Wang, Jiachen; Zhang, Chenhui; Li, Zhangheng; Li, Bo; Wang, Zhangyang (May 2024, International Conference on Learning Representations (ICLR) 2024)

Full Text Available
Lowering the Pre-training Tax for Gradient-based Subset Training: A Lightweight Distributed Pre-Training Toolkit

Ro, Yeonju; Wang, Zhangyang; Chidambaram, Vijay; Akella, Aditya (July 2023, International Conference on Machine Learning (ICML))

Full Text Available
SAFE AND ROBUST WATERMARK INJECTION WITH A SINGLE OOD IMAGE

Yu, Shuyang; Hong, Junyuan; Zhang, Haobo; Wang, Haotao; Wang, Zhangyang; Zhou, Jiayu (May 2023, International Conference on Learning Representations (ICLR) 2023)

Full Text Available
Turning the Curse of Heterogeneity in Federated Learning into a Blessing for Out-of-Distribution Detection

Yu, Shuyang; Hong, Junyuan; Wang, Haotao; Wang, Zhangyang; Zhou, Jiayu (April 2023, International Conference on Learning Representations (ICLR))

Deep neural networks have witnessed huge successes in many challenging prediction tasks and yet they often suffer from out-of-distribution (OoD) samples, misclassifying them with high confidence. Recent advances show promising OoD detection performance for centralized training, and however, OoD detection in federated learning (FL) is largely overlooked, even though many security sensitive applications such as autonomous driving and voice recognition authorization are commonly trained using FL for data privacy concerns. The main challenge that prevents previous state-of-the-art OoD detection methods from being incorporated to FL is that they require large amount of real OoD samples. However, in real-world scenarios, such large-scale OoD training data can be costly or even infeasible to obtain, especially for resource-limited local devices. On the other hand, a notorious challenge in FL is data heterogeneity where each client collects non-identically and independently distributed (non-iid) data. We propose to take advantage of such heterogeneity and turn the curse into a blessing that facilitates OoD detection in FL. The key is that for each client, non-iid data from other clients (unseen external classes) can serve as an alternative to real OoD samples. Specifically, we propose a novel Federated Out-of-Distribution Synthesizer (FOSTER), which learns a class-conditional generator to synthesize virtual external-class OoD samples, and maintains data confidentiality and communication efficiency required by FL. Experimental results show that our method outperforms the state-of-the-art by 2.49%, 2.88%, 1.42% AUROC, and 0.01%, 0.89%, 1.74% ID accuracy, on CIFAR-10, CIFAR-100, and STL10, respectively.
more » « less
Full Text Available
How Robust is Your Fairness? Evaluating and Sustaining Fairness under Unseen Distribution Shifts

Wang, Haotao; Hong, Junyuan; Zhou, Jiayu; Wang, Zhangyang (March 2023, Transactions on machine learning research)

Increasing concerns have been raised on deep learning fairness in recent years. Existing fairness-aware machine learning methods mainly focus on the fairness of in-distribution data. However, in real-world applications, it is common to have a distribution shift between the training and test data. In this paper, we first show that the fairness achieved by existing methods can be easily broken by slight distribution shifts. To solve this problem, we propose a novel fairness learning method termed CUrvature MAtching (CUMA), which can achieve robust fairness generalizable to unseen domains with unknown distributional shifts. Specifically, CUMA enforces the model to have similar generalization ability on the majority and minority groups, by matching the loss curvature distributions of the two groups. We evaluate our method on three popular fairness datasets. Compared with existing methods, CUMA achieves superior fairness under unseen distribution shifts, without sacrificing either the overall accuracy or the in-distribution fairness.
more » « less
Full Text Available

« Prev Next »

Search for: All records