Mixture of Demonstrations for In-Context Learning

Wang, Song; Chen, Zihan; Shi, Chengshuai; Shen, Cong; Li, Jundong

Citation Details

In-Context Learning (ICL) empowers Large Language Models (LLMs) to tackle various tasks by providing input-output examples as additional inputs, referred to as demonstrations. Nevertheless, the performance of ICL could be easily impacted by the quality of selected demonstrations. Existing efforts generally learn a retriever model to score each demonstration for selecting suitable demonstrations, however, the effect is suboptimal due to the large search space and the noise from unhelpful demonstrations. In this study, we introduce MoD, which partitions the demonstration pool into groups, each governed by an expert to reduce search space. We further design an expert-wise training strategy to alleviate the impact of unhelpful demonstrations when optimizing the retriever model. During inference, experts collaboratively retrieve demonstrations for the input query to enhance the ICL performance. We validate MoD via experiments across a range of NLP datasets and tasks, demonstrating its state-of-the-art performance and shedding new light on the future design of retrieval methods for ICL. more »

Award ID(s):: 2411248 2223769 2228534 2154962 2144209 2006844

PAR ID:: 10612750

Author(s) / Creator(s):: Wang, Song; Chen, Zihan; Shi, Chengshuai; Shen, Cong; Li, Jundong

Publisher / Repository:: Annual Conference on Neural Information Processing Systems

Date Published:: 2024-12-10

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this