Joint Imbalance Adaptation for Radiology Report Generation

Li, Wang; Han, Guangzeng; Wu, Yuexin; Huang, I-Chan; Huang, Xiaolei

doi:10.1007/s41666-025-00205-9

Citation Details

This content will become publicly available on June 20, 2026

Joint Imbalance Adaptation for Radiology Report Generation

Radiology report generation, translating radiological images into precise and clinically relevant description, may face the data imbalance challenge — medical tokens appear less frequently than regular tokens, and normal entries are significantly more than abnormal ones. However, very few studies consider the imbalance issues, not even with conjugate imbalance factors. In this study, we propose a Joint Imbalance Adaptation (JIMA) model to promote task robustness by leveraging token and label imbalance. We employ a hard-to-easy learning strategy that mitigates overfitting to frequent labels and tokens, thereby encouraging the model to focus more on infrequent labels and clinical tokens. JIMA presents notable improvements (16.75–50.50% on average) across evaluation metrics on IU X-ray and MIMIC-CXR datasets. Our ablation analysis and human evaluations show the improvements mainly come from enhancing performance on infrequent tokens and abnormal radiological entries, which can also lead to more clinically accurate reports. While data imbalance (e.g., infrequent tokens and abnormal labels) can lead to the underperformance of radiology report generation, our imbalance learning strategy opens promising directions on how to encounter data imbalance by reducing overfitting on frequent patterns and underfitting on infrequent patterns. more »

Award ID(s):: 2245920

PAR ID:: 10621308

Author(s) / Creator(s):: Li, Wang; Han, Guangzeng; Wu, Yuexin; Huang, I-Chan; Huang, Xiaolei

Publisher / Repository:: Springer Nature

Date Published:: 2025-06-20

Journal Name:: Journal of Healthcare Informatics Research

ISSN:: 2509-4971

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on June 20, 2026
Journal Article:
https://doi.org/10.1007/s41666-025-00205-9

More Like this