NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?

https://doi.org/10.18653/v1/2024.naacl-long.424

Li, Bangzheng; Zhou, Ben; Wang, Fei; Fu, Xingyu; Roth, Dan; Chen, Muhao (January 2024, Association for Computational Linguistics)

Full Text Available
Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical Thinking

https://doi.org/10.18653/v1/2024.findings-naacl.224

Xu, Nan; Wang, Fei; Zhou, Ben; Li, Bangzheng; Xiao, Chaowei; Chen, Muhao (January 2024, Association for Computational Linguistics)

Full Text Available
Affective and Dynamic Beam Search for Story Generation

Huang, Tenghao; Qasemi, Ehsan; Li, Bangzheng; Wang, He; Brahman, Faeze; Chen, Muhao; Chaturvedi, Snigdha (December 2023, Association for Computational Linguistics)

Full Text Available
Affective and Dynamic Beam Search for Story Generation

https://doi.org/10.18653/v1/2023.findings-emnlp.789

Huang, Tenghao; Qasemi, Ehsan; Li, Bangzheng; Wang, He; Brahman, Faeze; Chen, Muhao; Chaturvedi, Snigdha (January 2023, Findings of the Association for Computational Linguistics: EMNLP 2023)

Storytelling’s captivating potential makes it a fascinating research area, with implications for entertainment, education, therapy, and cognitive studies. In this paper, we propose Affective Story Generator (AffGen) for generating interesting narratives. AffGen introduces ‘intriguing twists’ in narratives by employing two novel techniques—Dynamic Beam Sizing and Affective Reranking. Dynamic Beam Sizing encourages less predictable, more captivating word choices using a contextual multi-arm bandit model. Affective Reranking prioritizes sentence candidates based on affect intensity. Our empirical evaluations, both automatic and human, demonstrate AffGen’s superior performance over existing baselines in generating affectively charged and interesting narratives. Our ablation study and analysis provide insights into the strengths and weaknesses of AffGen.
more » « less
Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference

https://doi.org/10.1162/tacl_a_00479

Li, Bangzheng; Yin, Wenpeng; Chen, Muhao (January 2022, Transactions of the Association for Computational Linguistics)

Abstract The task of ultra-fine entity typing (UFET) seeks to predict diverse and free-form words or phrases that describe the appropriate types of entities mentioned in sentences. A key challenge for this task lies in the large number of types and the scarcity of annotated data per type. Existing systems formulate the task as a multi-way classification problem and train directly or distantly supervised classifiers. This causes two issues: (i) the classifiers do not capture the type semantics because types are often converted into indices; (ii) systems developed in this way are limited to predicting within a pre-defined type set, and often fall short of generalizing to types that are rarely seen or unseen in training. This work presents LITE🍻, a new approach that formulates entity typing as a natural language inference (NLI) problem, making use of (i) the indirect supervision from NLI to infer type information meaningfully represented as textual hypotheses and alleviate the data scarcity issue, as well as (ii) a learning-to-rank objective to avoid the pre-defining of a type set. Experiments show that, with limited training data, LITE obtains state-of-the-art performance on the UFET task. In addition, LITE demonstrates its strong generalizability by not only yielding best results on other fine-grained entity typing benchmarks, more importantly, a pre-trained LITE system works well on new data containing unseen types.1
more » « less
Full Text Available
Unified Semantic Typing with Meaningful Label Inference

https://doi.org/10.18653/v1/2022.naacl-main.190

Huang, James Y.; Li, Bangzheng; Xu, Jiashu; Chen, Muhao (January 2022, Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)

Semantic typing aims at classifying tokens or spans of interest in a textual context into semantic categories such as relations, entity types, and event types. The inferred labels of semantic categories meaningfully interpret how machines understand components of text. In this paper, we present UniST, a unified framework for semantic typing that captures label semantics by projecting both inputs and labels into a joint semantic embedding space. To formulate different lexical and relational semantic typing tasks as a unified task, we incorporate task descriptions to be jointly encoded with the input, allowing UniST to be adapted to different tasks without introducing task-specific model components. UniST optimizes a margin ranking loss such that the semantic relatedness of the input and labels is reflected from their embedding similarity. Our experiments demonstrate that UniST achieves strong performance across three semantic typing tasks: entity typing, relation classification and event typing. Meanwhile, UniST effectively transfers semantic knowledge of labels and substantially improves generalizability on inferring rarely seen and unseen types. In addition, multiple semantic typing tasks can be jointly trained within the unified framework, leading to a single compact multi-tasking model that performs comparably to dedicated single-task models, while offering even better transferability.
more » « less
Full Text Available
Does Your Model Classify Entities Reasonably? Diagnosing and Mitigating Spurious Correlations in Entity Typing

Xu, Nan; Wang, Fei; Li, Bangzheng; Dong, Mingtao; Chen, Muhao (January 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Entity typing aims at predicting one or more words that describe the type(s) of a specific mention in a sentence. Due to shortcuts from surface patterns to annotated entity labels and biased training, existing entity typing models are subject to the problem of spurious correlations. To comprehensively investigate the faithfulness and reliability of entity typing methods, we first systematically define distinct kinds of model biases that are reflected mainly from spurious correlations. Particularly, we identify six types of existing model biases, including mention-context bias, lexical overlapping bias, named entity bias, pronoun bias, dependency bias, and overgeneralization bias. To mitigate model biases, we then introduce a counterfactual data augmentation method. By augmenting the original training set with their debiasedcounterparts, models are forced to fully comprehend sentences and discover the fundamental cues for entity typing, rather than relying on spurious correlations for shortcuts. Experimental results on the UFET dataset show our counterfactual data augmentation approach helps improve generalization of different entity typing models with consistently better performance on both the original and debiased test sets.
more » « less
Full Text Available
Fine-Grained Named Entity Recognition with Distant Supervision in COVID-19 Literature

https://doi.org/10.1109/BIBM49941.2020.9313126

Wang, Xuan; Song, Xiangchen; Li, Bangzheng; Zhou, Kang; Li, Qi; Han, Jiawei (December 2020, BIBM'20, IEEE Int. Conf. on Bioinformatics and Biomedicine, Dec 2020)
null (Ed.)
Biomedical named entity recognition (BioNER) is a fundamental step for mining COVID-19 literature. Existing BioNER datasets cover a few common coarse-grained entity types (e.g., genes, chemicals, and diseases), which cannot be used to recognize highly domain-specific entity types (e.g., animal models of diseases) or emerging ones (e.g., coronaviruses) for COVID-19 studies. We present CORD-NER, a fine-grained named entity recognized dataset of COVID-19 literature (up until May 19, 2020). CORD-NER contains over 12 million sentences annotated via distant supervision. Also included in CORD-NER are 2,000 manually-curated sentences as a test set for performance evaluation. CORD-NER covers 75 fine-grained entity types. In addition to the common biomedical entity types, it covers new entity types specifically related to COVID-19 studies, such as coronaviruses, viral proteins, evolution, and immune responses. The dictionaries of these fine-grained entity types are collected from existing knowledge bases and human-input seed sets. We further present DISTNER, a distantly supervised NER model that relies on a massive unlabeled corpus and a collection of dictionaries to annotate the COVID-19 corpus. DISTNER provides a benchmark performance on the CORD-NER test set for future research.
more » « less
Full Text Available

Search for: All records