Search for: All records

Creators/Authors contains: "Neubig, Graham"

« Prev Next »

Total Resources

60

Resource Type
Conference Paper

54

Conference Proceeding

0

Dataset

0

Journal Article

6

Workshop Report

0

Availability
Full Text / Resource Available

57

Citation Only

3

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PAL: Program-aided Language Models

Gao, Luyu ; Madaan, Aman ; Zhou, Shuyan ; Alon, Uri ; Liu, Pengfei ; Yang, Yiming ; Callan, Jamie ; Neubig, Graham ( July 2023 , Proceedings of the 40th International Conference on Machine Learning)

Large language models (LLMs) have demonstrated an impressive ability to perform arithmetic and symbolic reasoning tasks, when provided with a few examples at test time ("few-shot prompting"). Much of this success can be attributed to prompting methods such as "chain-of-thought", which employ LLMs for both understanding the problem description by decomposing it into steps, as well as solving each step of the problem. While LLMs seem to be adept at this sort of step-by-step decomposition, LLMs often make logical and arithmetic mistakes in the solution part, even when the problem is decomposed correctly. In this paper, we present Program-Aided Language models (PAL): a novel approach that uses the LLM to read natural language problems and generate programs as the intermediate reasoning steps, but offloads the solution step to a runtime such as a Python interpreter. With PAL, decomposing the natural language problem into runnable steps remains the only learning task for the LLM, while solving is delegated to the interpreter. We demonstrate this synergy between a neural LLM and a symbolic interpreter across 13 mathematical, symbolic, and algorithmic reasoning tasks from BIG-Bench Hard and others. In all these natural language reasoning tasks, generating code using an LLM and reasoning using a Python interpreter leads to more accurate results than much larger models. For example, PAL using Codex achieves state-of-the-art few-shot accuracy on GSM8K, surpassing PaLM which uses chain-of-thought by absolute 15% top-1.
more » « less
Free, publicly-accessible full text available July 23, 2024
Computational Language Acquisition with Theory of Mind

Liu, Andy ; Zhu, Hao ; Liu, Emmy ; Bisk, Yonatan ; Neubig, Graham ( May 2023 , International Conference on Learning Representations)

Free, publicly-accessible full text available May 1, 2024
EXCALIBUR: Encouraging and Evaluating Embodied Exploration

https://doi.org/10.1109/CVPR52729.2023.01434

Zhu, Hao ; Kapoor, Raghav ; Min, So Yeon ; Han, Winson ; Li, Jiatai ; Geng, Kaiwen ; Neubig, Graham ; Bisk, Yonatan ; Kembhavi, Aniruddha ; Weihs, Luca ( June 2023 , Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition)

Free, publicly-accessible full text available June 1, 2024
Syntax and Semantics Meet in the “Middle”: Probing the Syntax-Semantics Interface of LMs Through Agentivity

https://doi.org/10.18653/v1/2023.starsem-1.14

Tjuatja, Lindia ; Liu, Emmy ; Levin, Lori ; Neubig, Graham ( January 2023 , Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023))

Full Text Available
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages

Wang, Zhiruo ; Cuenca, Grace ; Zhou, Shuyan ; Xu, Frank F. ; Neubig, Graham ( January 2023 , Findings of the Conference of the European Chapter of the Association for Computational Linguistics)

Full Text Available
SigMoreFun Submission to the SIGMORPHON Shared Task on Interlinear Glossing

https://doi.org/10.18653/v1/2023.sigmorphon-1.22

He, Taiqi ; Tjuatja, Lindia ; Robinson, Nathaniel ; Watanabe, Shinji ; Mortensen, David R. ; Neubig, Graham ; Levin, Lori ( January 2023 , Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology)

Full Text Available
Testing the Ability of Language Models to Interpret Figurative Language

https://doi.org/10.18653/v1/2022.naacl-main.330

Liu, Emmy ; Cui, Chenxuan ; Zheng, Kenneth ; Neubig, Graham ( July 2022 , Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)

Figurative and metaphorical language are commonplace in discourse, and figurative expressions play an important role in communication and cognition. However, figurative language has been a relatively under-studied area in NLP, and it remains an open question to what extent modern language models can interpret nonliteral phrases. To address this question, we introduce Fig-QA, a Winograd-style nonliteral language understanding task consisting of correctly interpreting paired figurative phrases with divergent meanings. We evaluate the performance of several state-of-the-art language models on this task, and find that although language models achieve performance significantly over chance, they still fall short of human performance, particularly in zero- or few-shot settings. This suggests that further work is needed to improve the nonliteral reasoning capabilities of language models.
more » « less
Full Text Available
Systematic Inequalities in Language Technology Performance across the World’s Languages

https://doi.org/10.18653/v1/2022.acl-long.376

Blasi, Damian ; Anastasopoulos, Antonios ; Neubig, Graham ( May 2022 , Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Full Text Available
In-IDE Code Generation from Natural Language: Promise and Challenges

https://doi.org/10.1145/3487569

Xu, Frank F. ; Vasilescu, Bogdan ; Neubig, Graham ( April 2022 , ACM Transactions on Software Engineering and Methodology)

A great part of software development involves conceptualizing or communicating the underlying procedures and logic that needs to be expressed in programs. One major difficulty of programming is turning concept into code , especially when dealing with the APIs of unfamiliar libraries. Recently, there has been a proliferation of machine learning methods for code generation and retrieval from natural language queries , but these have primarily been evaluated purely based on retrieval accuracy or overlap of generated code with developer-written code, and the actual effect of these methods on the developer workflow is surprisingly unattested. In this article, we perform the first comprehensive investigation of the promise and challenges of using such technology inside the PyCharm IDE, asking, “At the current state of technology does it improve developer productivity or accuracy, how does it affect the developer experience, and what are the remaining gaps and challenges?” To facilitate the study, we first develop a plugin for the PyCharm IDE that implements a hybrid of code generation and code retrieval functionality, and we orchestrate virtual environments to enable collection of many user events (e.g., web browsing, keystrokes, fine-grained code edits). We ask developers with various backgrounds to complete 7 varieties of 14 Python programming tasks ranging from basic file manipulation to machine learning or data visualization, with or without the help of the plugin. While qualitative surveys of developer experience are largely positive, quantitative results with regards to increased productivity, code quality, or program correctness are inconclusive. Further analysis identifies several pain points that could improve the effectiveness of future machine learning-based code generation/retrieval developer assistants and demonstrates when developers prefer code generation over code retrieval and vice versa. We release all data and software to pave the road for future empirical studies on this topic, as well as development of better code generation models.
more » « less
Full Text Available
Capturing Structural Locality in Non-parametric Language Models

Xu, Frank F. ; He, Junxian ; Neubig, Graham ; Hellendoorn, Vincent Josua ( April 2022 , International Conference on Learning Representations)

Full Text Available

« Prev Next »