Selective Perception: Learning Concise State Descriptions for Language Model Actors

Nottingham, Kolby; Razeghi, Yasaman; Kim, Kyungmin; Lanier, J_B; Baldi, Pierre; Fox, Roy; Singh, Sameer

Citation Details

The latest large language models (LMs) support increasingly longer contexts. While this trend permits using substantial amounts of text with SOTA LMs, requiring these large LMs to process potentially redundant or irrelevant data needlessly increases inference time and cost. To remedy this problem, we propose BLINDER, a method that leverages a small finetuned LM to sample the minimal set of input features that maximizes the performance of a downstream LM. BLINDER trains an LM with a value head to estimate the likelihood of optimal outputs from a downstream LM given an input. We evaluate BLINDER on embodied decision making tasks with notoriously verbose state descriptions: NetHack and robot planning. BLINDER reduces the length of LM actor input by 87% and 99% while improving task success rates by 158% and 54% on NetHack and robot planning respectively which represents substantial inference cost savings while actually increasing performance. more »

Award ID(s):: 2046873

PAR ID:: 10526348

Author(s) / Creator(s):: Nottingham, Kolby; Razeghi, Yasaman; Kim, Kyungmin; Lanier, J_B; Baldi, Pierre; Fox, Roy; Singh, Sameer

Publisher / Repository:: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT)

Date Published:: 2024-06-01

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this