Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

Poliak, Adam; Haldar, Aparajita; Rudinger, Rachel; Hu, J. Edward; Pavlick, Ellie; White, Aaron Steven; Van Durme, Benjamin

doi:10.18653/v1/D18-1007

Citation Details

Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation

We present a large-scale collection of diverse natural language inference (NLI) datasets that help provide insight into how well a sentence representation captures distinct types of reasoning. The collection results from recasting 13 existing datasets from 7 semantic phenomena into a common NLI structure, resulting in over half a million labeled context-hypothesis pairs in total. We refer to our collection as the DNC: Diverse Natural Language Inference Collection. The DNC is available online at https://www.decomp.net, and will grow over time as additional resources are recast and added from novel sources. more »

Award ID(s):: 1749025

PAR ID:: 10111906

Author(s) / Creator(s):: Poliak, Adam; Haldar, Aparajita; Rudinger, Rachel; Hu, J. Edward; Pavlick, Ellie; White, Aaron Steven; Van Durme, Benjamin

Date Published:: 2018-01-01

Journal Name:: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

Page Range / eLocation ID:: 67 to 81

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/D18-1007

More Like this