NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization

Wang, Boshi; Yue, Xiang; Su, Yu; Sun, Huan (December 2024, NeurIPS)

Full Text Available
AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs

Liao, Zeyi; Sun, Huan (October 2024, CONFERENCE ON LANGUAGE MODELING)

Full Text Available
Towards Transparent Interactive Semantic Parsing via Step-by-Step Correction

https://doi.org/10.18653/v1/2022.findings-acl.28

Mo, Lingbo; Lewis, Ashley; Sun, Huan; White, Michael (May 2022, Findings of the Association for Computational Linguistics: ACL 2022)

Existing studies on semantic parsing focus primarily on mapping a natural-language utterance to a corresponding logical form in one turn. However, because natural language can contain a great deal of ambiguity and variability, this is a difficult challenge. In this work, we investigate an interactive semantic parsing framework that explains the predicted logical form step by step in natural language and enables the user to make corrections through natural-language feedback for individual steps. We focus on question answering over knowledge bases (KBQA) as an instantiation of our framework, aiming to increase the transparency of the parsing process and help the user appropriately trust the final answer. To do so, we construct INSPIRED, a crowdsourced dialogue dataset derived from the ComplexWebQuestions dataset. Our experiments show that the interactive framework with human feedback has the potential to greatly improve overall parse accuracy. Furthermore, we develop a pipeline for dialogue simulation to evaluate our framework w.r.t. a variety of state-of-the-art KBQA models without involving further crowdsourcing effort. The results demonstrate that our interactive semantic parsing framework promises to be effective across such models.
more » « less
Full Text Available
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering

https://doi.org/10.1109/BIBM52615.2021.9669300

Yue, Xiang; Zhang, Xinliang; Yao, Ziyu; Lin, Simon; Sun, Huan (December 2021, 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM))

Clinical question answering (QA) aims to automatically answer questions from medical professionals based on clinical texts. Studies show that neural QA models trained on one corpus may not generalize well to new clinical texts from a different institute or a different patient group, where large-scale QA pairs are not readily available for model retraining. To address this challenge, we propose a simple yet effective framework, CliniQG4QA, which leverages question generation (QG) to synthesize QA pairs on new clinical contexts and boosts QA models without requiring manual annotations. In order to generate diverse types of questions that are essential for training QA models, we further introduce a seq2seq-based question phrase prediction (QPP) module that can be used together with most existing QG models to diversify the generation. Our comprehensive experiment results show that the QA corpus generated by our framework can improve QA models on the new contexts (up to 8% absolute gain in terms of Exact Match), and that the QPP module plays a crucial role in achieving the gain.
more » « less
Full Text Available
ReasonBERT: Pre-trained to Reason with Distant Supervision

https://doi.org/10.18653/v1/2021.emnlp-main.494

Deng, Xiang; Su, Yu; Lees, Alyssa; Wu, You; Yu, Cong; Sun, Huan (November 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

We present ReasonBert, a pre-training method that augments language models with the ability to reason over long-range relations and multiple, possibly hybrid contexts. Unlike existing pre-training methods that only harvest learning signals from local contexts of naturally occurring texts, we propose a generalized notion of distant supervision to automatically connect multiple pieces of text and tables to create pre-training examples that require long-range reasoning. Different types of reasoning are simulated, including intersecting multiple pieces of evidence, bridging from one piece of evidence to another, and detecting unanswerable cases. We conduct a comprehensive evaluation on a variety of extractive question answering datasets ranging from single-hop to multi-hop and from text-only to table-only to hybrid that require various reasoning capabilities and show that ReasonBert achieves remarkable improvement over an array of strong baselines. Few-shot experiments further demonstrate that our pre-training method substantially improves sample efficiency.
more » « less
Full Text Available
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval

https://doi.org/10.18653/v1/2021.emnlp-main.305

Zhang, Xinliang; Sun, Heming; Yue, Xiang; Lin, Simon; Sun, Huan (November 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

We present a large, challenging dataset, COUGH, for COVID-19 FAQ retrieval. Similar to a standard FAQ dataset, COUGH consists of three parts: FAQ Bank, Query Bank and Relevance Set. The FAQ Bank contains ~16K FAQ items scraped from 55 credible websites (e.g., CDC and WHO). For evaluation, we introduce Query Bank and Relevance Set, where the former contains 1,236 human-paraphrased queries while the latter contains ~32 human-annotated FAQ items for each query. We analyze COUGH by testing different FAQ retrieval models built on top of BM25 and BERT, among which the best model achieves 48.8 under P@5, indicating a great challenge presented by COUGH and encouraging future research for further improvement. Our COUGH dataset is available at https://github.com/sunlab-osu/covid-faq.
more » « less
Full Text Available
Learning Structural Edits via Incremental Tree Transformations

Yao, Ziyu; Xu, Frank; Yin, Pengcheng; Sun, Huan; Neubig, Graham (January 2021, The Ninth International Conference on Learning Representations 2021 (ICLR'21))
null (Ed.)
Full Text Available
Learning a Cost-Effective Annotation Policy for Question Answering

Kratzwald, Bernhard; Feuerriegel, Stefan; Sun, Huan (January 2020, 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP'20, long))
null (Ed.)
Full Text Available
An Imitation Game for Learning Semantic Parsers from User Interaction

Yao, Ziyu; Tang, Yiqi; Yih, Wen-tau; Sun, Huan; Su, Yu (January 2020, 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP'20, long))
null (Ed.)
Full Text Available
Adversarial Training for Code Retrieval with Question-Description Relevance Regularization

Zhao, Jie; Sun, Huan (January 2020, Findings of 2020 Conference on Empirical Methods in Natural Language Processing)
null (Ed.)
Full Text Available

Search for: All records