Snoopy: An Online Interface for Exploring the Effect of Pretraining Term Frequencies on Few-Shot LM Performance

Razeghi, Yasaman; Mekala, Raja Sekhar; Logan Iv, Robert L; Gardner, Matt; Singh, Sameer

doi:10.18653/v1/2022.emnlp-demos.39

Citation Details

Snoopy: An Online Interface for Exploring the Effect of Pretraining Term Frequencies on Few-Shot LM Performance

Current evaluation schemes for large language models often fail to consider the impact of the overlap between pretraining corpus and test data on model performance statistics. Snoopy is an online interface that allows researchers to study this impact in few-shot learning settings. Our demo provides term frequency statistics for the Pile, which is an 800 GB corpus, accompanied by the precomputed performance of EleutherAI/GPT models on more than 20 NLP benchmarks, including numerical, commonsense reasoning, natural language understanding, and question-answering tasks. Snoopy allows a user to interactively align specific terms in test instances with their frequency in the Pile, enabling exploratory analysis of how term frequency is related to the accuracy of the models, which are hard to discover through automated means. A user can look at correlations over various model sizes and numbers of in-context examples and visualize the result across multiple (potentially aggregated) datasets. Using Snoopy, we show that a researcher can quickly replicate prior analyses for numerical tasks while simultaneously allowing for much more expansive exploration that was previously challenging. Snoopy is available at https://nlp.ics.uci.edu/snoopy. more »

Award ID(s):: 2046873

PAR ID:: 10462493

Author(s) / Creator(s):: Razeghi, Yasaman; Mekala, Raja Sekhar; Logan Iv, Robert L; Gardner, Matt; Singh, Sameer

Date Published:: 2022-01-01

Journal Name:: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations

Page Range / eLocation ID:: 389 to 395

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2022.emnlp-demos.39

More Like this