What is Learned in Visually Grounded Neural Syntax Acquisition

Kojima, Noriyuki; Averbuch-Elor, Hadar; Rush, Alexander; Artzi, Yoav

doi:10.18653/v1/2020.acl-main.234

Citation Details

What is Learned in Visually Grounded Neural Syntax Acquisition

Visual features are a promising signal for learning bootstrap textual models. However, blackbox learning models make it difficult to isolate the specific contribution of visual components. In this analysis, we consider the case study of the Visually Grounded Neural Syntax Learner (Shi et al., 2019), a recent approach for learning syntax from a visual training signal. By constructing simplified versions of the model, we isolate the core factors that yield the model’s strong performance. Contrary to what the model might be capable of learning, we find significantly less expressive versions produce similar predictions and perform just as well, or even better. We also find that a simple lexical signal of noun concreteness plays the main role in the model’s predictions as opposed to more complex syntactic reasoning. more »

Award ID(s):: 1656998

PAR ID:: 10197947

Author(s) / Creator(s):: Kojima, Noriyuki; Averbuch-Elor, Hadar; Rush, Alexander; Artzi, Yoav

Date Published:: 2020-01-01

Journal Name:: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

Page Range / eLocation ID:: 2615 to 2635

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2020.acl-main.234

More Like this