Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews

Yun, Hye; Marshall, Iain; Trikalinos, Thomas; Wallace, Byron

doi:10.18653/v1/2023.emnlp-main.626

Citation Details

Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews

Medical systematic reviews play a vital role in healthcare decision making and policy. However, their production is time-consuming, limiting the availability of high-quality and up-to-date evidence summaries. Recent advancements in LLMs offer the potential to automatically generate literature reviews on demand, addressing this issue. However, LLMs sometimes generate inaccurate (and potentially misleading) texts by hallucination or omission. In healthcare, this can make LLMs unusable at best and dangerous at worst. We conducted 16 interviews with international systematic review experts to characterize the perceived utility and risks of LLMs in the specific context of medical evidence reviews. Experts indicated that LLMs can assist in the writing process by drafting summaries, generating templates, distilling information, and crosschecking information. They also raised concerns regarding confidently composed but inaccurate LLM outputs and other potential downstream harms, including decreased accountability and proliferation of low-quality reviews. Informed by this qualitative analysis, we identify criteria for rigorous evaluation of biomedical LLMs aligned with domain expert views. more »

Award ID(s):: 2211954

PAR ID:: 10506520

Author(s) / Creator(s):: Yun, Hye; Marshall, Iain; Trikalinos, Thomas; Wallace, Byron

Publisher / Repository:: Association for Computational Linguistics

Date Published:: 2023-01-01

Journal Name:: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)

Page Range / eLocation ID:: 10122 to 10139

Format(s):: Medium: X

Location:: Singapore

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2023.emnlp-main.626

More Like this