NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Xia, P; Zhu, K; Li, H; Wang, T; Shi, W; Wang, S; Zhang, L; Zou, J; Yao, H (April 2025, ICLR)

Artificial Intelligence (AI) has demonstrated significant potential in healthcare, particularly in disease diagnosis and treatment planning. Recent progress in Medical Large Vision-Language Models (Med-LVLMs) has opened up new possibilities for interactive diagnostic tools. However, these models often suffer from factual hallucination, which can lead to incorrect diagnoses. Fine-tuning and retrieval-augmented generation (RAG) have emerged as methods to address these issues. However, the amount of high-quality data and distribution shifts between training data and deployment data limit the application of fine-tuning methods. Although RAG is lightweight and effective, existing RAG-based approaches are not sufficiently general to different medical domains and can potentially cause misalignment issues, both between modalities and between the model and the ground truth. In this paper, we propose a versatile multimodal RAG system, MMed-RAG, designed to enhance the factuality of Med-LVLMs. Our approach introduces a domain-aware retrieval mechanism, an adaptive retrieved contexts selection, and a provable RAG-based preference fine-tuning strategy. These innovations make the RAG process sufficiently general and reliable, significantly improving alignment when introducing retrieved contexts. Experimental results across five medical datasets (involving radiology, ophthalmology, pathology) on medical VQA and report generation demonstrate that MMed-RAG can achieve an average improvement of 43.8% in factual accuracy in the factual accuracy of Med-LVLMs.
more » « less
Free, publicly-accessible full text available April 24, 2026
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Xia, P; Zhu, K; Li, H; Zhu, H; Li, Y; Li, G; Zhang, L; Yao, H (November 2024, EMNLP)

The recent emergence of Medical Large Vision Language Models (Med-LVLMs) has enhanced medical diagnosis. However, current Med-LVLMs frequently encounter factual issues, often generating responses that do not align with established medical facts. Retrieval-Augmented Generation (RAG), which utilizes external knowledge, can improve the factual accuracy of these models but introduces two major challenges. First, limited retrieved contexts might not cover all necessary information, while excessive retrieval can introduce irrelevant and inaccurate references, interfering with the model’s generation. Second, in cases where the model originally responds correctly, applying RAG can lead to an over-reliance on retrieved contexts, resulting in incorrect answers. To address these issues, we propose RULE, which consists of two components. First, we introduce a provably effective strategy for controlling factuality risk through the calibrated selection of the number of retrieved contexts. Second, based on samples where over-reliance on retrieved contexts led to errors, we curate a preference dataset to fine-tune the model, balancing its dependence on inherent knowledge and retrieved contexts for generation. We demonstrate the effectiveness of RAFE on three medical VQA datasets, achieving an average improvement of 20.8% in factual accuracy.
more » « less
Full Text Available
Determination of energy-dependent neutron backgrounds using shadow bars

https://doi.org/10.1016/j.nima.2023.168341

Paneru, S.N.; Brown, K.W.; Teh, F.C.E; Zhu, K.; Tsang, M.B.; Dell’Aquila, D.; Chajecki, Z.; Lynch, W.G.; Sweany, S.; Tsang, C.Y.; et al (August 2023, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment)

Full Text Available
Reaction losses of charged particles in CsI(Tl) crystals

https://doi.org/10.1016/j.nima.2021.165798

Sweany, S.; Lynch, W.G.; Brown, K.; Anthony, A.; Chajecki, Z.; Dell’Aquila, D.; Morfouace, P.; Teh, F.C.E.; Tsang, C.Y.; Tsang, M.B.; et al (December 2021, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment)

Full Text Available
Effective Analog/Mixed-Signal Circuit Placement Considering System Signal Flow

Zhu, K.; Chen, H.; Liu, M.; Tang, X.; Sun, N.; Pan, D. Z. (November 2020, IEEE/ACM International Conference on Computer-Aided Design (ICCAD))
null (Ed.)
Full Text Available
Proton decay spectroscopy of ${}^{28}S$ and ${}^{30}{Cl}$

https://doi.org/10.1103/PhysRevC.105.044321

Gillespie, S. A.; Brown, K. W.; Charity, R. J.; Sobotka, L. G.; Anthony, A. K.; Barney, J.; Bonaccorso, A.; Brown, B. A.; Crosby, J.; Dell'Aquila, D.; et al (April 2022, Physical Review C)

Full Text Available
Value-Assigned Pulse Shape Discrimination for Neutron Detectors

https://doi.org/10.1109/TNS.2021.3091126

Teh, F. C.; Lee, J.-W.; Zhu, K.; Brown, K. W.; Chajecki, Z.; Lynch, W. G.; Tsang, M. B.; Anthony, A.; Barney, J.; Dell'Aquila, D.; et al (August 2021, IEEE Transactions on Nuclear Science)
null (Ed.)
Full Text Available
Using spin alignment of inelastically excited nuclei in fast beams to assign spins: The spectroscopy of ${}^{13}O$ as a test case

https://doi.org/10.1103/PhysRevC.104.024325

Charity, R. J.; Webb, T. B.; Elson, J. M.; Hoff, D. E.; Pruitt, C. D.; Sobotka, L. G.; Navrátil, P.; Hupin, G.; Kravvaris, K.; Quaglioni, S.; et al (August 2021, Physical Review C)

Full Text Available
Observation of the Exotic Isotope ${}^{13}F$ Located Four Neutrons beyond the Proton Drip Line

https://doi.org/10.1103/PhysRevLett.126.132501

Charity, R. J.; Webb, T. B.; Elson, J. M.; Hoff, D. E. M.; Pruitt, C. D.; Sobotka, L. G.; Brown, K. W.; Cerizza, G.; Estee, J.; Lynch, W. G.; et al (March 2021, Physical Review Letters)
null (Ed.)
Full Text Available
Calibration of large neutron detection arrays using cosmic rays

https://doi.org/10.1016/j.nima.2020.163826

Zhu, K.; Tsang, M.B.; Dell’Aquila, D.; Brown, K.W.; Chajecki, Z.; Lynch, W.G.; Sweany, S.; Teh, F.C.E.; Tsang, C.Y.; Anderson, C.; et al (July 2020, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment)

Full Text Available

« Prev Next »

Search for: All records