What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization

Zhang, Yufeng; Zhang, Fengzhuo; Yang, Zhuoran; Wang, Zhaoran

Citation Details

This content will become publicly available on May 6, 2026

What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization

In-Context Learning (ICL) ability has been found efficient across a wide range of applications, where the Large Language Models (LLM) learn to complete the tasks from the examples in the prompt without tuning the parameters. In this work, we conduct a comprehensive study to understand ICL from a statistical perspective. First, we show that the perfectly pretrained LLMs perform Bayesian Model Averaging (BMA) for ICL under a dynamic model of examples in the prompt. The average error analysis for ICL is then built for the perfectly pretrained LLMs with the analysis of BMA. Second, we demonstrate how the attention structure boosts the BMA implementation. With sufficient examples in the prompt, attention is proven to perform BMA under the Gaussian linear ICL model, which also motivates the explicit construction of the hidden concepts from the attention heads' values. Finally, we analyze the pretraining behavior of LLMs. The pretraining error is decomposed as the generalization error and the approximation error. The generalization error is upper bounded via the PAC-Bayes framework. Then the ICL average error of the pretrained LLMs is shown to be the sum of O(T^{-1}) and the pretraining error. In addition, we analyze the ICL performance of the pretrained LLMs with misspecified examples. more »

Award ID(s):: 2413243

PAR ID:: 10588218

Author(s) / Creator(s):: Zhang, Yufeng; Zhang, Fengzhuo; Yang, Zhuoran; Wang, Zhaoran

Publisher / Repository:: International Conference on Artificial Intelligence and Statistics

Date Published:: 2025-05-06

Journal Name:: Journal of medicinal and chemical sciences

ISSN:: 2640-3498

Format(s):: Medium: X

Location:: The 28th International Conference on Artificial Intelligence and Statistics

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on May 6, 2026
Conference Proceeding:
The DOI is not currently available.

More Like this