NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Wang, Boshi; Yue, Xiang; Su, Yu; Sun, Huan (December 2024, NeurIPS)

Full Text Available
Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization

Wang, Boshi; Yue, Xiang; Su, Yu; Sun, Huan (December 2024, NeurIPS)

Full Text Available
Automatic Evaluation of Attribution by Large Language Models

Yue, Xiang; Wang, Boshi; Zhang, Kai; Chen, Ziru; Su, Yu; Sun, Huan (December 2023, Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP Findings))

Full Text Available
Synthetic Question Value Estimation for Domain Adaptation of Question Answering

https://doi.org/10.18653/v1/2022.acl-long.95

Yue, Xiang; Yao, Ziyu; Sun, Huan (May 2022, ACL 2022)

Full Text Available
Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe

https://doi.org/10.18653/v1/2023.acl-long.74

Yue, Xiang; Inan, Huseyin; Li, Xuechen; Kumar, Girish; McAnallen, Julia; Shajari, Hoda; Sun, Huan; Levitan, David; Sim, Robert (January 2023, Association for Computational Linguistics)
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering

https://doi.org/10.1109/BIBM52615.2021.9669300

Yue, Xiang; Zhang, Xinliang; Yao, Ziyu; Lin, Simon; Sun, Huan (December 2021, 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM))

Clinical question answering (QA) aims to automatically answer questions from medical professionals based on clinical texts. Studies show that neural QA models trained on one corpus may not generalize well to new clinical texts from a different institute or a different patient group, where large-scale QA pairs are not readily available for model retraining. To address this challenge, we propose a simple yet effective framework, CliniQG4QA, which leverages question generation (QG) to synthesize QA pairs on new clinical contexts and boosts QA models without requiring manual annotations. In order to generate diverse types of questions that are essential for training QA models, we further introduce a seq2seq-based question phrase prediction (QPP) module that can be used together with most existing QG models to diversify the generation. Our comprehensive experiment results show that the QA corpus generated by our framework can improve QA models on the new contexts (up to 8% absolute gain in terms of Exact Match), and that the QPP module plays a crucial role in achieving the gain.
more » « less
Full Text Available
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval

https://doi.org/10.18653/v1/2021.emnlp-main.305

Zhang, Xinliang; Sun, Heming; Yue, Xiang; Lin, Simon; Sun, Huan (November 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

We present a large, challenging dataset, COUGH, for COVID-19 FAQ retrieval. Similar to a standard FAQ dataset, COUGH consists of three parts: FAQ Bank, Query Bank and Relevance Set. The FAQ Bank contains ~16K FAQ items scraped from 55 credible websites (e.g., CDC and WHO). For evaluation, we introduce Query Bank and Relevance Set, where the former contains 1,236 human-paraphrased queries while the latter contains ~32 human-annotated FAQ items for each query. We analyze COUGH by testing different FAQ retrieval models built on top of BM25 and BERT, among which the best model achieves 48.8 under P@5, indicating a great challenge presented by COUGH and encouraging future research for further improvement. Our COUGH dataset is available at https://github.com/sunlab-osu/covid-faq.
more » « less
Full Text Available
Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset

Yue, Xiang; Jimenez, Bernal; Sun, Huan (July 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL'20))

Full Text Available
Clinical Phrase Mining with Language Models

https://doi.org/10.1109/BIBM49941.2020.9313496

Mani, Kaushik; Yue, Xiang; Gutierrez, Bernal Jimenez; Huang, Yungui; Lin, Simon; Sun, Huan (December 2020, 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM))
null (Ed.)
Full Text Available
SurfCon: Synonym Discovery on Privacy-Aware Clinical Data

https://doi.org/10.1145/3292500.3330894

Wang, Zhen; Yue, Xiang; Moosavinasab, Soheil; Huang, Yungui; Lin, Simon; Sun, Huan (January 2019, Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining)

Full Text Available

« Prev Next »

Search for: All records