<?xml version="1.0" encoding="UTF-8"?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcq="http://purl.org/dc/terms/"><records count="1" morepages="false" start="1" end="1"><record rownumber="1"><dc:product_type>Conference Proceeding</dc:product_type><dc:title>Conformal Information Pursuit for Interactively Guiding Large Language Models</dc:title><dc:creator>Chan, Ryan; Ge, Yuyan; Dobriban, Edgar; Hassani, Hamed; Vidal, René</dc:creator><dc:corporate_author/><dc:editor/><dc:description>A significant use case of instruction-finetuned Large Language Models (LLMs) is
to solve question-answering tasks interactively. In this setting, an LLM agent is
tasked with making a prediction by sequentially querying relevant information from
the user, as opposed to a single-turn conversation. This paper explores sequential
querying strategies that aim to minimize the expected number of queries. One such
strategy is Information Pursuit (IP), a greedy algorithm that at each iteration selects
the query that maximizes information gain or equivalently minimizes uncertainty.
However, obtaining accurate estimates of mutual information or conditional entropy
for LLMs is very difficult in practice due to over- or under-confident LLM probabilities, which leads to suboptimal query selection and predictive performance. To
better estimate the uncertainty at each iteration, we propose Conformal Information
Pursuit (C-IP), an alternative approach to sequential information gain based on
conformal prediction sets. More specifically, C-IP leverages a relationship between
prediction sets and conditional entropy at each iteration to estimate uncertainty
based on the average size of conformal prediction sets. In contrast to conditional
entropy, we find that conformal prediction sets are a distribution-free and robust
method of measuring uncertainty. Experiments with 20 Questions show that C-IP
obtains better predictive performance and shorter query-answer chains compared
to previous approaches to IP and uncertainty-based chain-of-thought methods.
Furthermore, extending to an interactive medical setting between a doctor and a
patient on the MediQ dataset, C-IP achieves competitive performance with direct
single-turn prediction while offering greater interpretability.</dc:description><dc:publisher>NeurIPS</dc:publisher><dc:date>2025-12-13</dc:date><dc:nsf_par_id>10671235</dc:nsf_par_id><dc:journal_name/><dc:journal_volume/><dc:journal_issue/><dc:page_range_or_elocation/><dc:issn/><dc:isbn/><dc:doi>https://doi.org/</dc:doi><dcq:identifierAwardId>1943064</dcq:identifierAwardId><dc:subject/><dc:version_number/><dc:location/><dc:rights/><dc:institution/><dc:sponsoring_org>National Science Foundation</dc:sponsoring_org></record></records></rdf:RDF>