NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages

Wang, Zhiruo; Cuenca, Grace; Zhou, Shuyan; Xu, Frank F.; Neubig, Graham (January 2023, Findings of the Conference of the European Chapter of the Association for Computational Linguistics)

Full Text Available
In-IDE Code Generation from Natural Language: Promise and Challenges

https://doi.org/10.1145/3487569

Xu, Frank F.; Vasilescu, Bogdan; Neubig, Graham (April 2022, ACM Transactions on Software Engineering and Methodology)

A great part of software development involves conceptualizing or communicating the underlying procedures and logic that needs to be expressed in programs. One major difficulty of programming is turning concept into code , especially when dealing with the APIs of unfamiliar libraries. Recently, there has been a proliferation of machine learning methods for code generation and retrieval from natural language queries , but these have primarily been evaluated purely based on retrieval accuracy or overlap of generated code with developer-written code, and the actual effect of these methods on the developer workflow is surprisingly unattested. In this article, we perform the first comprehensive investigation of the promise and challenges of using such technology inside the PyCharm IDE, asking, “At the current state of technology does it improve developer productivity or accuracy, how does it affect the developer experience, and what are the remaining gaps and challenges?” To facilitate the study, we first develop a plugin for the PyCharm IDE that implements a hybrid of code generation and code retrieval functionality, and we orchestrate virtual environments to enable collection of many user events (e.g., web browsing, keystrokes, fine-grained code edits). We ask developers with various backgrounds to complete 7 varieties of 14 Python programming tasks ranging from basic file manipulation to machine learning or data visualization, with or without the help of the plugin. While qualitative surveys of developer experience are largely positive, quantitative results with regards to increased productivity, code quality, or program correctness are inconclusive. Further analysis identifies several pain points that could improve the effectiveness of future machine learning-based code generation/retrieval developer assistants and demonstrates when developers prefer code generation over code retrieval and vice versa. We release all data and software to pave the road for future empirical studies on this topic, as well as development of better code generation models.
more » « less
Full Text Available
Capturing Structural Locality in Non-parametric Language Models

Xu, Frank F.; He, Junxian; Neubig, Graham; Hellendoorn, Vincent Josua (April 2022, International Conference on Learning Representations)

Full Text Available
In-IDE Code Generation from Natural Language: Promise and Challenges

Xu, Frank F.; Vasilescu, Bogdan; Neubig, Graham (January 2021, ACM transactions on software engineering and methodology)
null (Ed.)
Full Text Available
Learning Structural Edits via Incremental Tree Transformations

Yao, Ziyu; Xu, Frank; Yin, Pengcheng; Sun, Huan; Neubig, Graham (January 2021, The Ninth International Conference on Learning Representations 2021 (ICLR'21))
null (Ed.)
Full Text Available
Learning Structural Edits via Incremental Tree Transformations

Yao, Ziyu; Xu, Frank F.; Yin, Pengcheng; Sun, Huan; Neubig, Graham (January 2021, International Conference on Learning Representations)
null (Ed.)
Full Text Available
How Can We Know What Language Models Know?

https://doi.org/10.1162/tacl_a_00324

Jiang, Zhengbao; Xu, Frank F.; Araki, Jun; Neubig, Graham (July 2020, Transactions of the Association for Computational Linguistics)

Recent work has presented intriguing results examining the knowledge contained in language models (LMs) by having the LM fill in the blanks of prompts such as “ Obama is a __ by profession”. These prompts are usually manually created, and quite possibly sub-optimal; another prompt such as “ Obama worked as a __ ” may result in more accurately predicting the correct profession. Because of this, given an inappropriate prompt, we might fail to retrieve facts that the LM does know, and thus any given prompt only provides a lower bound estimate of the knowledge contained in an LM. In this paper, we attempt to more accurately estimate the knowledge contained in LMs by automatically discovering better prompts to use in this querying process. Specifically, we propose mining-based and paraphrasing-based methods to automatically generate high-quality and diverse prompts, as well as ensemble methods to combine answers from different prompts. Extensive experiments on the LAMA benchmark for extracting relational knowledge from LMs demonstrate that our methods can improve accuracy from 31.1% to 39.6%, providing a tighter lower bound on what LMs know. We have released the code and the resulting LM Prompt And Query Archive (LPAQA) at https://github.com/jzbjyb/LPAQA .
more » « less
Full Text Available
Incorporating External Knowledge through Pre-training for Natural Language to Code Generation

https://doi.org/10.18653/v1/2020.acl-main.538

Xu, Frank F.; Jiang, Zhengbao; Yin, Pengcheng; Vasilescu, Bogdan; Neubig, Graham (July 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)

Full Text Available
Parsimonious Morpheme Segmentation with an Application to Enriching Word Embeddings

https://doi.org/10.1109/BigData47090.2019.9005957

El-Kishky, Ahmed; Xu, Frank; Zhang, Aston; Han, Jiawei (December 2019, 2019 {IEEE} International Conference on Big Data (Big Data))

Traditionally, many text-mining tasks treat individual word-tokens as the finest meaningful semantic granularity. However, in many languages and specialized corpora, words are composed by concatenating semantically meaningful subword structures. Word-level analysis cannot leverage the semantic information present in such subword structures. With regard to word embedding techniques, this leads to not only poor embeddings for infrequent words in long-tailed text corpora but also weak capabilities for handling out-of-vocabulary words. In this paper we propose MorphMine for unsupervised morpheme segmentation. MorphMine applies a parsimony criterion to hierarchically segment words into the fewest number of morphemes at each level of the hierarchy. This leads to longer shared morphemes at each level of segmentation. Experiments show that MorphMine segments words in a variety of languages into human-verified morphemes. Additionally, we experimentally demonstrate that utilizing MorphMine morphemes to enrich word embeddings consistently improves embedding quality on a variety of of embedding evaluations and a downstream language modeling task.
more » « less
Full Text Available
Machine Learning Enhanced Real-Time Intrusion Detection Using Timing Information

Hang Xu, Frank Mueller (December 2018, International Workshop on Trustworthy & Real-time Edge Computing for Cyber-Physical Systems)

Past work has investigated intrusion detection mechanisms for real-time control devices. This work contributes a novel framework of separating security monitoring and detection from real-time control, where the former is performed on Cloud edge devices while the latter is run on embedded devices attached to the system that is controlled. We contribute a security monitoring system that validates worst-case timing bounds of the target controller and also validates its control outputs by comparing it against model-based predictions, which are derived from machine learning.
more » « less
Full Text Available

« Prev Next »

Search for: All records