Can ChatGPT Understand Causal Language in Science Claims?

Kim, Yuheun; Guo, Lu; Yu, Bei; Li, Yingya

doi:10.18653/v1/2023.wassa-1.33

Citation Details

Can ChatGPT Understand Causal Language in Science Claims?

This study evaluated ChatGPT’s ability to understand causal language in science papers and news by testing its accuracy in a task of labeling the strength of a claim as causal, conditional causal, correlational, or no relationship. The results show that ChatGPT is still behind the existing fine-tuned BERT models by a large margin. ChatGPT also had difficulty understanding conditional causal claims mitigated by hedges. However, its weakness may be utilized to improve the clarity of human annotation guideline. Chain-of-thought prompting was faithful and helpful for improving prompt performance, but finding the optimal prompt is difficult with inconsistent results and the lack of effective method to establish cause-effect between prompts and outcomes, suggesting caution when generalizing prompt engineering results across tasks or models. more »

Award ID(s):: 1952353

PAR ID:: 10552287

Author(s) / Creator(s):: Kim, Yuheun; Guo, Lu; Yu, Bei; Li, Yingya

Publisher / Repository:: Association for Computational Linguistics

Date Published:: 2023-07-01

Page Range / eLocation ID:: 379 to 389

Format(s):: Medium: X

Location:: Toronto, Canada

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2023.wassa-1.33

More Like this