From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems

He, Jianliang; Chen, Siyu; Zhang, Fengzhuo; Yang, Zhuoran

Citation Details

In this work, we theoretically investigate why large language model (LLM)-empowered agents can solve decision-making problems in the physical world. We consider a hierarchical reinforcement learning (RL) model where the LLM Planner handles high-level task planning and the Actor performs low-level execution. Within this model, the LLM Planner operates in a partially observable Markov decision process (POMDP), iteratively generating language-based subgoals through prompting. Assuming appropriate pretraining data, we prove that the pretrained LLM Planner effectively conducts Bayesian aggregated imitation learning (BAIL) via in-context learning. We also demonstrate the need for exploration beyond the subgoals produced by BAIL, showing that naively executing these subgoals results in linear regret. To address this, we propose an ε-greedy exploration strategy for BAIL, which we prove achieves sublinear regret when pretraining error is low. Finally, we extend our theoretical framework to cases where the LLM Planner acts as a world model to infer the environment’s transition model and to multi-agent settings, facilitating coordination among multiple Actors. more »

Award ID(s):: 2413243

PAR ID:: 10588220

Author(s) / Creator(s):: He, Jianliang; Chen, Siyu; Zhang, Fengzhuo; Yang, Zhuoran

Publisher / Repository:: International Conference on Machine Learning 2024

Date Published:: 2024-07-21

Format(s):: Medium: X

Location:: International Conference on Machine Learning 2024

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this