Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization

Salemi, Alireza; Zamani, Hamed

doi:10.1145/3731120.3744584

Citation Details

This content will become publicly available on July 18, 2026

Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization

This paper investigates the design of a unified search engine to serve multiple retrieval-augmented generation (RAG) agents, each with a distinct task, backbone large language model (LLM), and RAG strategy. We introduce an iterative approach where the search engine generates retrieval results for the RAG agents and gathers feedback on the quality of the retrieved documents during an offline phase. This feedback is then used to iteratively optimize the search engine using an expectation-maximization algorithm, with the goal of maximizing each agent's utility function. Additionally, we adapt this to an online setting, allowing the search engine to refine its behavior based on real-time individual agents feedback to better serve the results for each of them. Experiments on datasets from the Knowledge-Intensive Language Tasks (KILT) benchmark demonstrates that our approach significantly on average outperforms baselines across 18 RAG models. We demonstrate that our method effectively ''personalizes'' the retrieval for each RAG agent based on the collected feedback. Finally, we provide a comprehensive ablation study to explore various aspects of our method. more »

Award ID(s):: 2402873

PAR ID:: 10618715

Author(s) / Creator(s):: Salemi, Alireza; Zamani, Hamed

Publisher / Repository:: ACM

Date Published:: 2025-07-18

ISBN:: 9798400718618

Page Range / eLocation ID:: 183-193

Format(s):: Medium: X

Location:: Padua, Italy

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 18, 2026
Conference Paper:
https://doi.org/10.1145/3731120.3744584

More Like this