Situated Mapping of Sequential Instructions to Actions with Single-step Reward Observation

Suhr, Alane; Artzi, Yoav

Citation Details

We propose a learning approach for mapping context-dependent sequential instructions to actions. We address the problem of discourse and state dependencies with an attention-based model that considers both the history of the interaction and the state of the world. To train from start and goal states without access to demonstrations, we propose SESTRA, a learning algorithm that takes advantage of single-step reward observations and immediate expected reward maximization. We evaluate on the SCONE domains, and show absolute accuracy improvements of 9.8%-25.3% across the domains over approaches that use high-level logical representations. more »

Award ID(s):: 1656998

PAR ID:: 10095009

Author(s) / Creator(s):: Suhr, Alane; Artzi, Yoav

Date Published:: 2018-01-01

Journal Name:: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Volume:: 1

Page Range / eLocation ID:: 2072-2082

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this