NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Yao, Shunyu; Yu, Dian; Zhao, Jeffrey; Shafran, Izhak; Griffiths, Thomas L; Cao, Yuan; Narasimhan, Karthik (December 2023, Advances in neural information processing systems)

Full Text Available
ReAct: Synergizing Reasoning and Acting in Language Models

Yao, Shunyu; Zhao, Jeffrey; Yu, Dian; Du, Nan; Shafran, Izhak; Narasimhan, Karthik; Cao, Yuan (January 2023, International Conference on Learning Representations (ICLR))

While large language models (LLMs) have demonstrated impressive capabilities across tasks in language understanding and interactive decision making, their abilities for reasoning (e.g. chain-of-thought prompting) and acting (e.g. action plan generation) have primarily been studied as separate topics. In this paper, we explore the use of LLMs to generate both reasoning traces and task-specific actions in an interleaved manner, allowing for greater synergy between the two: reasoning traces help the model induce, track, and update action plans as well as handle exceptions, while actions allow it to interface with external sources, such as knowledge bases or environments, to gather additional information. We apply our approach, named ReAct, to a diverse set of language and decision making tasks and demonstrate its effectiveness over state-of-the-art baselines, as well as improved human interpretability and trustworthiness over methods without reasoning or acting components. Concretely, on question answering (HotpotQA) and fact verification (Fever), ReAct overcomes issues of hallucination and error propagation prevalent in chain-of-thought reasoning by interacting with a simple Wikipedia API, and generates human-like task-solving trajectories that are more interpretable than baselines without reasoning traces. On two interactive decision making benchmarks (ALFWorld and WebShop), ReAct outperforms imitation and reinforcement learning methods by an absolute success rate of 34% and 10% respectively, while being prompted with only one or two in-context examples.
more » « less
Full Text Available
NarraSum: A Large-Scale Dataset for Abstractive Narrative Summarization

https://doi.org/10.18653/v1/2022.findings-emnlp.14

Zhao, Chao; Brahman, Faeze; Song, Kaiqiang; Yao, Wenlin; Yu, Dian; Chaturvedi, Snigdha (January 2022, Proceedings of the Findings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
Automatically Exposing Problems with Neural Dialog Models

https://doi.org/10.18653/v1/2021.emnlp-main.37

Yu, Dian; Sagae, Kenji (January 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

Neural dialog models are known to suffer from problems such as generating unsafe and inconsistent responses. Even though these problems are crucial and prevalent, they are mostly manually identified by model designers through interactions. Recently, some research instructs crowdworkers to goad the bots into triggering such problems. However, humans leverage superficial clues such as hate speech, while leaving systematic problems undercover. In this paper, we propose two methods including reinforcement learning to automatically trigger a dialog model into generating problematic responses. We show the effect of our methods in exposing safety and contradiction issues with state-of-the-art dialog models.
more » « less
Full Text Available
Beyond NVD: Cybersecurity meets the Semantic Web.

https://doi.org/10.1145/3498891.3501259

Aranovich, Raúl; Wu, Muting; Yu, Dian; Katsy, Katya; Ahmadnia, Benyamin; Bishop, Matthew; Filkov, Vladimir; Sagae, Kenji (October 2021, NSPW '21: New Security Paradigms Workshop)

Full Text Available
Attribute Alignment: Controlling Text Generation from Pre-trained Language Models

https://doi.org/10.18653/v1/2021.findings-emnlp.194

Yu, Dian; Yu, Zhou; Sagae, Kenji (January 2021, Findings of the Association for Computational Linguistics: EMNLP 2021)

Large language models benefit from training with a large amount of unlabeled text, which gives them increasingly fluent and diverse generation capabilities. However, using these models for text generation that takes into account target attributes, such as sentiment polarity or specific topics, remains a challenge. We propose a simple and flexible method for controlling text generation by aligning disentangled attribute representations. In contrast to recent efforts on training a discriminator to perturb the token level distribution for an attribute, we use the same data to learn an alignment function to guide the pre-trained, non-controlled language model to generate texts with the target attribute without changing the original language model parameters. We evaluate our method on sentiment- and topic-controlled generation, and show large performance gains over previous methods while retaining fluency and diversity.
more » « less
Full Text Available
Language Embeddings for Typology and Cross-lingual Transfer Learning

https://doi.org/10.18653/v1/2021.acl-long.560

Yu, Dian; He, Taiqi; Sagae, Kenji (January 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers))
null (Ed.)
Cross-lingual language tasks typically require a substantial amount of annotated data or parallel translation data. We explore whether language representations that capture relationships among languages can be learned and subsequently leveraged in cross-lingual tasks without the use of parallel data. We generate dense embeddings for 29 languages using a denoising autoencoder, and evaluate the embeddings using the World Atlas of Language Structures (WALS) and two extrinsic tasks in a zero-shot setting: cross-lingual dependency parsing and cross-lingual natural language inference.
more » « less
Full Text Available
Challenges to pooling models of crowding: Implications for visual mechanisms

https://doi.org/10.1167/19.7.15

Rosenholtz, Ruth; Yu, Dian; Keshvari, Shaiyan (July 2019, Journal of Vision)

Full Text Available

Search for: All records