On Importance Sampling-Based Evaluation of Latent Language Models

Logan IV, Robert L; Gardner, Matt; Singh, Sameer

doi:10.18653/v1/2020.acl-main.196

Citation Details

On Importance Sampling-Based Evaluation of Latent Language Models

Language models that use additional latent structures (e.g., syntax trees, coreference chains, knowledge graph links) provide several advantages over traditional language models. However, likelihood-based evaluation of these models is often intractable as it requires marginalizing over the latent space. Existing works avoid this issue by using importance sampling. Although this approach has asymptotic guarantees, analysis is rarely conducted on the effect of decisions such as sample size and choice of proposal distribution on the reported estimates. In this paper, we carry out this analysis for three models: RNNG, EntityNLM, and KGLM. In addition, we elucidate subtle differences in how importance sampling is applied in these works that can have substantial effects on the final estimates, as well as provide theoretical results which reinforce the validity of this technique. more »

Award ID(s):: 1817183

PAR ID:: 10180483

Author(s) / Creator(s):: Logan IV, Robert L; Gardner, Matt; Singh, Sameer

Date Published:: 2020-01-01

Journal Name:: Annual Meeting of the Association for Computational Linguistics (ACL)

Page Range / eLocation ID:: 2171 to 2176

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2020.acl-main.196

More Like this