Can LLMs Generate Random Numbers? Evaluating LLM Sampling in Controlled Domains

Aspen K Hopkins, Alex Renda

Citation Details

Practitioners frequently take multiple samples from large language models (LLMs) to explore the distribution of completions induced by a given prompt. While individual samples can give high-quality results for given tasks, collectively there are no guarantees of the distribution over these samples induced by the generating LLM. In this paper, we empirically evaluate LLMs’ capabilities as distribution samplers. We identify core concepts and metrics underlying LLM-based sampling, including different sampling methodologies and prompting strategies. Using a set of controlled domains we evaluate the error and variance of the distributions induced by the LLM. We find that LLMs struggle to induce reasonable distributions over generated elements, suggesting that practitioners should more carefully consider the semantics and methodologies of sampling from LLMs. more »

Award ID(s):: 1918839

PAR ID:: 10498692

Author(s) / Creator(s):: Aspen K Hopkins, Alex Renda

Publisher / Repository:: Sampling and Optimization in Discrete Space (SODS) ICML 2023 Workshop

Date Published:: 2023-10-02

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Workshop Report:
The DOI is not currently available.

More Like this