Semantically Enriched Text Generation for {QA} through Dense Paraphrasing

Obiso, Timothy; Ye, Bingyang; Rim, Kyeongmin; Pustejovsky, James

Citation Details

Large Language Models (LLMs) are very effective at extractive language tasks such as Question Answering (QA). While LLMs can improve their performance on these tasks through increases in model size (via massive pretraining) and/or iterative on-the-job training (one-shot, few-shot, chain-of-thought), we explore what other less resource-intensive and more efficient types of data augmentation can be applied to obtain similar boosts in performance. We define multiple forms of Dense Paraphrasing (DP) and obtain DP-enriched versions of different contexts. We demonstrate that performing QA using these semantically enriched contexts leads to increased performance on models of various sizes and across task domains, without needing to increase model size. more »

Award ID(s):: 2326985

PAR ID:: 10599003

Author(s) / Creator(s):: Obiso, Timothy; Ye, Bingyang; Rim, Kyeongmin; Pustejovsky, James

Editor(s):: Abbas, Mourad; Freihat, Abed Alhakim

Publisher / Repository:: Association for Computational Linguistics

Date Published:: 2024-10-19

Edition / Version:: 7

Volume:: 1

Issue:: 1

ISBN:: 979-8-89176-165-0

Page Range / eLocation ID:: 279-286

Subject(s) / Keyword(s):: Dense Paraphrasing Question answering LLMs chain-of-thought

Format(s):: Medium: X

Location:: Trento, Italy

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this