On a Connection Between Imitation Learning and RLHF

Xiao, Teng; Yuan, Yige; Li, Mingxiao; Chen, Zhengyu; Honavar, Vasant G

Citation Details

This content will become publicly available on April 28, 2026

On a Connection Between Imitation Learning and RLHF

This work studies the alignment of large language models with preference data from an imitation learning perspective. We establish a close theoretical connection between reinforcement learning from human feedback (RLHF) and imitation learning (IL), revealing that RLHF implicitly performs imitation learning on the preference data distribution. Building on this connection, we propose DIL, a principled framework that directly optimizes the imitation learning objective. DIL provides a unified imitation learning perspective on alignment, encompassing existing alignment algorithms as special cases while naturally introducing new variants. By bridging IL and RLHF, DIL offers new insights into alignment with RLHF. Extensive experiments demonstrate that DIL outperforms existing methods on various challenging benchmarks. The code for DIL is available at https://github.com/tengxiao1/DIL. more »

Award ID(s):: 2226025 2225824

PAR ID:: 10638657

Author(s) / Creator(s):: Xiao, Teng; Yuan, Yige; Li, Mingxiao; Chen, Zhengyu; Honavar, Vasant G

Publisher / Repository:: International Conference on Representation Learning 2025 (ICLR 2025)

Date Published:: 2025-04-28

Subject(s) / Keyword(s):: Large Language Models Machine Learning Preference Alignment

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on April 28, 2026
Conference Paper:
The DOI is not currently available.

More Like this