How Much Do Prompting Methods Help LLMs on Quantitative Reasoning with Irrelevant Information?

Song, Seok Hwan; Tavanapong, Wallapak

doi:10.1145/3627673.3679840

Citation Details

How Much Do Prompting Methods Help LLMs on Quantitative Reasoning with Irrelevant Information?

Real-world quantitative reasoning problems are complex, often including extra information irrelevant to the question (or “IR noise” for short). State-of-the-art (SOTA) prompting methods have increased the Large Language Model’s ability for quantitative rea-soning on grade-school Math Word Problems (MWPs). To assess how well these SOTA methods handle IR noise, we constructed four new datasets with IR noise, each consisting of 300 problems from each of the four public datasets: MAWPS, ASDiv, SVAMP, and GSM8K, with added IR noise. We called the collection of these new datasets “MPN”—Math Word Problems with IR Noise. We evaluated SOTA prompting methods using MPN. We propose Noise Reduction Prompting (NRP) and its variant (NRP+) to reduce the impact of IR noise. Findings: Our IR noise significantly degrades the performance of Chain-of-Thought (CoT) Prompting on three different backend models: ChatGPT (gpt-3.5-turbo-0613), PaLM2, and Llama3-8B-instruct. Among them, ChatGPT offers the best accuracy on MPN with and without IR noise. With IR noise, the performances of CoT, Least-To-Most Prompting, Progressive-Hint Prompting, and Program-aided Language Models with ChatGPT were significantly impacted, each with an average accuracy drop of above 12%. NRP is least impacted by the noise, with a drop in average accuracy to only around 1.9%. Our NRP+ and NRP perform comparably in the presence of IR noise. more »

Award ID(s):: 2152117

PAR ID:: 10611681

Author(s) / Creator(s):: Song, Seok Hwan; Tavanapong, Wallapak

Publisher / Repository:: ACM

Date Published:: 2024-10-21

ISBN:: 9798400704369

Page Range / eLocation ID:: 2128 to 2137

Subject(s) / Keyword(s):: Prompt Engineering, Large Language Model, Trustworthy Information Extraction, Quantitative Reasoning, Math Word Problem

Format(s):: Medium: X

Location:: Boise ID USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3627673.3679840

More Like this