Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

AbstractWe study continual learning for natural language instruction generation, by observing human users’ instruction execution. We focus on a collaborative scenario, where the system both acts and delegates tasks to human users using natural language. We compare user execution of g...

Full description

Bibliographic Details
Main Authors:	Noriyuki Kojima, Alane Suhr, Yoav Artzi
Format:	Article
Language:	English
Published:	The MIT Press 2021-01-01
Series:	Transactions of the Association for Computational Linguistics
Online Access:	https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00428/108610/Continual-Learning-for-Grounded-Instruction

_version_	1818215423887278080
author	Noriyuki Kojima Alane Suhr Yoav Artzi
author_facet	Noriyuki Kojima Alane Suhr Yoav Artzi
author_sort	Noriyuki Kojima
collection	DOAJ
description	AbstractWe study continual learning for natural language instruction generation, by observing human users’ instruction execution. We focus on a collaborative scenario, where the system both acts and delegates tasks to human users using natural language. We compare user execution of generated instructions to the original system intent as an indication to the system’s success communicating its intent. We show how to use this signal to improve the system’s ability to generate instructions via contextual bandit learning. In interaction with real users, our system demonstrates dramatic improvements in its ability to generate language over time.
first_indexed	2024-12-12T06:35:51Z
format	Article
id	doaj.art-df09a249402a44debc6aed2c01716cea
institution	Directory Open Access Journal
issn	2307-387X
language	English
last_indexed	2024-12-12T06:35:51Z
publishDate	2021-01-01
publisher	The MIT Press
record_format	Article
series	Transactions of the Association for Computational Linguistics
spelling	doaj.art-df09a249402a44debc6aed2c01716cea2022-12-22T00:34:29ZengThe MIT PressTransactions of the Association for Computational Linguistics2307-387X2021-01-0191303131910.1162/tacl_a_00428Continual Learning for Grounded Instruction Generation by Observing Human Following BehaviorNoriyuki Kojima0Alane Suhr1Yoav Artzi2Department of Computer Science and Cornell Tech, Cornell University, USA. nk654@cornell.eduDepartment of Computer Science and Cornell Tech, Cornell University, USA. suhr@cs.cornell.eduDepartment of Computer Science and Cornell Tech, Cornell University, USA. yoav@cs.cornell.edu AbstractWe study continual learning for natural language instruction generation, by observing human users’ instruction execution. We focus on a collaborative scenario, where the system both acts and delegates tasks to human users using natural language. We compare user execution of generated instructions to the original system intent as an indication to the system’s success communicating its intent. We show how to use this signal to improve the system’s ability to generate instructions via contextual bandit learning. In interaction with real users, our system demonstrates dramatic improvements in its ability to generate language over time.https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00428/108610/Continual-Learning-for-Grounded-Instruction
spellingShingle	Noriyuki Kojima Alane Suhr Yoav Artzi Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior Transactions of the Association for Computational Linguistics
title	Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
title_full	Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
title_fullStr	Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
title_full_unstemmed	Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
title_short	Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
title_sort	continual learning for grounded instruction generation by observing human following behavior
url	https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00428/108610/Continual-Learning-for-Grounded-Instruction
work_keys_str_mv	AT noriyukikojima continuallearningforgroundedinstructiongenerationbyobservinghumanfollowingbehavior AT alanesuhr continuallearningforgroundedinstructiongenerationbyobservinghumanfollowingbehavior AT yoavartzi continuallearningforgroundedinstructiongenerationbyobservinghumanfollowingbehavior

Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

Similar Items