NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

Guo, Xingang; Yu, Fangxu; Zhang, Huan; Qin, Lianhui; Hu, Bin (July 2024, International Conference on Machine Learning (ICML))

Full Text Available
Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts

https://doi.org/10.18653/v1/2022.findings-acl.149

Yu, Wenhao; Zhu, Chenguang; Qin, Lianhui; Zhang, Zhihan; Zhao, Tong; Jiang, Meng (January 2022, Findings of the Association for Computational Linguistics: ACL 2022)

Generative commonsense reasoning (GCR) in natural language is to reason about the commonsense while generating coherent text. Recent years have seen a surge of interest in improving the generation quality of commonsense reasoning tasks. Nevertheless, these approaches have seldom investigated diversity in the GCR tasks, which aims to generate alternative explanations for a real-world situation or predict all possible outcomes. Diversifying GCR is challenging as it expects to generate multiple outputs that are not only semantically different but also grounded in commonsense knowledge. In this paper, we propose MoKGE, a novel method that diversifies the generative reasoning by a mixture of expert (MoE) strategy on commonsense knowledge graphs (KG). A set of knowledge experts seek diverse reasoning on KG to encourage various generation outputs. Empirical experiments demonstrated that MoKGE can significantly improve the diversity while achieving on par performance on accuracy on two GCR benchmarks, based on both automatic and human evaluations.
more » « less
Full Text Available
TuringAdvice: A Generative and Dynamic Evaluation of Language Use

https://doi.org/10.18653/v1/2021.naacl-main.386

Zellers, Rowan; Holtzman, Ari; Clark, Elizabeth; Qin, Lianhui; Farhadi, Ali; Choi, Yejin (June 2021, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume)

We propose TuringAdvice, a new challenge task and dataset for language understanding models. Given a written situation that a real person is currently facing, a model must generate helpful advice in natural language. Our evaluation framework tests a fundamental aspect of human language understanding: our ability to use language to resolve open-ended situations by communicating with each other. Empirical results show that today’s models struggle at TuringAdvice, even multibillion parameter models finetuned on 600k in-domain training examples. The best model, T5, writes advice that is at least as helpful as human-written advice in only 14% of cases; a much larger non-finetunable GPT3 model does even worse at 4%. This low performance reveals language understanding errors that are hard to spot outside of a generative setting, showing much room for progress.
more » « less
Full Text Available
Social Bias Frames: Reasoning about Social and Power Implications of Language

https://doi.org/10.18653/v1/2020.acl-main.486

Sap, Maarten; Gabriel, Saadia; Qin, Lianhui; Jurafsky, Dan; Smith, Noah A; Choi, Yejin (July 2020, Association for Computational Linguistics)

Full Text Available
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Srivastava, Aarohi; Rastogi, Abhinav; Rao, Abhishek; Shoeb, Abu Awal; Abid, Abubakar; Fisch, Adam; Brown, Adam R.; Santoro, Adam; Gupta, Aditya; Garriga-Alonso, Adri; et al (January 2023, Transactions on machine learning research)

Full Text Available

Search for: All records