MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers

Pillutla, Krishna; Swayamdipta, Swabha; Zellers, Rowan; Thickstun, John; Welleck, Sean; hoi, Yejin; Harchaoui, Zaid

Citation Details

As major progress is made in open-ended text generation, measuring how close machine-generated text is to human language remains a critical open problem. We introduce MAUVE, a comparison measure for open-ended text generation, which directly compares the learnt distribution from a text generation model to the distribution of human-written text using divergence frontiers. MAUVE scales up to modern text generation models by computing information divergences in a quantized embedding space. Through an extensive empirical study on three open-ended generation tasks, we find that MAUVE identifies known properties of generated text, scales naturally with model size, and correlates with human judgments, with fewer restrictions than existing distributional evaluation metrics. more »

Award ID(s):: 2134012 2023166

PAR ID:: 10349865

Author(s) / Creator(s):: Pillutla, Krishna; Swayamdipta, Swabha; Zellers, Rowan; Thickstun, John; Welleck, Sean; hoi, Yejin; Harchaoui, Zaid

Date Published:: 2022-01-01

Journal Name:: Advances in neural information processing systems

ISSN:: 1049-5258

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this