NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Iterative Paraphrastic Augmentation with Discriminative Span Alignment

https://doi.org/10.1162/tacl_a_00380

Culkin, Ryan; Hu, J. Edward; Stengel-Eskin, Elias; Qin, Guanghui; Durme, Benjamin Van (January 2021, Transactions of the Association for Computational Linguistics)
null (Ed.)
Abstract We introduce a novel paraphrastic augmentation strategy based on sentence-level lexically constrained paraphrasing and discriminative span alignment. Our approach allows for the large-scale expansion of existing datasets or the rapid creation of new datasets using a small, manually produced seed corpus. We demonstrate our approach with experiments on the Berkeley FrameNet Project, a large-scale language understanding effort spanning more than two decades of human labor. With four days of training data collection for a span alignment model and one day of parallel compute, we automatically generate and release to the community 495,300 unique (Frame,Trigger) pairs in diverse sentential contexts, a roughly 50-fold expansion atop FrameNet v1.7. The resulting dataset is intrinsically and extrinsically evaluated in detail, showing positive results on a downstream task.
more » « less
Full Text Available
Joint Universal Syntactic and Semantic Parsing

https://doi.org/10.1162/tacl_a_00396

Stengel-Eskin, Elias; Murray, Kenton; Zhang, Sheng; Steven White, Aaron; Van Durme, Benjamin (January 2021, Transactions of the Association for Computational Linguistics)
null (Ed.)
Full Text Available
Frequency, acceptability, and selection: A case study of clause-embedding

https://doi.org/10.5334/gjgl.1001

White, Aaron Steven; Rawlins, Kyle (January 2020, Glossa: a journal of general linguistics)
null (Ed.)
Full Text Available
Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

https://doi.org/10.18653/v1/D18-1007

Poliak, Adam; Haldar, Aparajita; Rudinger, Rachel; Hu, J. Edward; Pavlick, Ellie; White, Aaron Steven; Van Durme, Benjamin (January 2018, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing)

We present a large-scale collection of diverse natural language inference (NLI) datasets that help provide insight into how well a sentence representation captures distinct types of reasoning. The collection results from recasting 13 existing datasets from 7 semantic phenomena into a common NLI structure, resulting in over half a million labeled context-hypothesis pairs in total. We refer to our collection as the DNC: Diverse Natural Language Inference Collection. The DNC is available online at https://www.decomp.net, and will grow over time as additional resources are recast and added from novel sources.
more » « less
Full Text Available
Lexicosyntactic Inference in Neural Models

https://doi.org/10.18653/v1/D18-1501

White, Aaron Steven; Rudinger, Rachel; Rawlins, Kyle; Van Durme, Benjamin (January 2018, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing)

We investigate neural models’ ability to capture lexicosyntactic inferences: inferences triggered by the interaction of lexical and syntactic information. We take the task of event factuality prediction as a case study and build a factuality judgment dataset for all English clause-embedding verbs in various syntactic contexts. We use this dataset, which we make publicly available, to probe the behavior of current state-of-the-art neural systems, showing that these systems make certain systematic errors that are clearly visible through the lens of factuality prediction.
more » « less
Full Text Available

Search for: All records