Reinforcement Learning for Mapping Instructions to Actions

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function that defines the quality of the executed actions. During training, the learner repeatedly constructs action sequences for a...

Full description

Bibliographic Details
Main Authors:	Branavan, Satchuthanan R., Chen, Harr, Zettlemoyer, Luke S., Barzilay, Regina
Other Authors:	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format:	Article
Language:	en_US
Published:	Association for Computational Linguistics 2010
Subjects:	algorithms design experimentation languages measurement performance
Online Access:	http://hdl.handle.net/1721.1/59313 https://orcid.org/0000-0002-2921-8201

Description
Summary:	In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function that defines the quality of the executed actions. During training, the learner repeatedly constructs action sequences for a set of documents, executes those actions, and observes the resulting reward. We use a policy gradient algorithm to estimate the parameters of a log-linear model for action selection. We apply our method to interpret instructions in two domains --- Windows troubleshooting guides and game tutorials. Our results demonstrate that this method can rival supervised learning techniques while requiring few or no annotated training examples.

Reinforcement Learning for Mapping Instructions to Actions

Similar Items