Propagation networks for model-based control under partial observation

There has been an increasing interest in learning dynamics simulators for model-based control. Compared with off-the-shelf physics engines, a learnable simulator can quickly adapt to unseen objects, scenes, and tasks. However, existing models like interaction networks only work for fully observable...

Full description

Bibliographic Details
Main Authors:	Li, Yunzhu, Wu, Jiajun, Zhu, Junyan, Tenenbaum, Joshua B, Torralba, Antonio, Tedrake, Russell L
Other Authors:	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format:	Article
Language:	English
Published:	IEEE 2020
Online Access:	https://hdl.handle.net/1721.1/126583

_version_	1826190523914780672
author	Li, Yunzhu Wu, Jiajun Zhu, Junyan Tenenbaum, Joshua B Torralba, Antonio Tedrake, Russell L
author2	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Li, Yunzhu Wu, Jiajun Zhu, Junyan Tenenbaum, Joshua B Torralba, Antonio Tedrake, Russell L
author_sort	Li, Yunzhu
collection	MIT
description	There has been an increasing interest in learning dynamics simulators for model-based control. Compared with off-the-shelf physics engines, a learnable simulator can quickly adapt to unseen objects, scenes, and tasks. However, existing models like interaction networks only work for fully observable systems; they also only consider pairwise interactions within a single time step, both restricting their use in practical systems. We introduce Propagation Networks (PropNet), a differentiable, learnable dynamics model that handles partially observable scenarios and enables instantaneous propagation of signals beyond pairwise interactions. With these innovations, our propagation networks not only outperform current learnable physics engines in forward simulation, but also achieves superior performance on various control tasks. Compared with existing deep reinforcement learning algorithms, model-based control with propagation networks is more accurate, efficient, and generalizable to novel, partially observable scenes and tasks.
first_indexed	2024-09-23T08:41:35Z
format	Article
id	mit-1721.1/126583
institution	Massachusetts Institute of Technology
language	English
last_indexed	2024-09-23T08:41:35Z
publishDate	2020
publisher	IEEE
record_format	dspace
spelling	mit-1721.1/1265832022-09-23T13:53:36Z Propagation networks for model-based control under partial observation Li, Yunzhu Wu, Jiajun Zhu, Junyan Tenenbaum, Joshua B Torralba, Antonio Tedrake, Russell L Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science There has been an increasing interest in learning dynamics simulators for model-based control. Compared with off-the-shelf physics engines, a learnable simulator can quickly adapt to unseen objects, scenes, and tasks. However, existing models like interaction networks only work for fully observable systems; they also only consider pairwise interactions within a single time step, both restricting their use in practical systems. We introduce Propagation Networks (PropNet), a differentiable, learnable dynamics model that handles partially observable scenarios and enables instantaneous propagation of signals beyond pairwise interactions. With these innovations, our propagation networks not only outperform current learnable physics engines in forward simulation, but also achieves superior performance on various control tasks. Compared with existing deep reinforcement learning algorithms, model-based control with propagation networks is more accurate, efficient, and generalizable to novel, partially observable scenes and tasks. Charles Stark Draper Laboratory. Sponsor Award (SC001-0000001002) United States. National Aeronautics and Space Administration Sponsor Award (NNX16AC49A) National Science Foundation (U.S.) (Grant 1524817) United States. Defense Advanced Research Projects Agency. Explainable Artificial Intelligence (Grant FA8750-18-C000) United States. Office of Naval Research. Multidisciplinary University Research Initiative (Grant N00014-16-1-2007) 2020-08-14T13:23:08Z 2020-08-14T13:23:08Z 2019-05 2019-10-09T12:04:27Z Article http://purl.org/eprint/type/ConferencePaper https://hdl.handle.net/1721.1/126583 Li, Yunzhu et al. “Propagation networks for model-based control under partial observation.” Paper presented at the 2019 International Conference on Robotics and Automation (ICRA), Montréal, Québec, May 20-24, 2019, IEEE © 2019 The Author(s) en 10.1109/ICRA.2019.8793509 2019 International Conference on Robotics and Automation (ICRA) Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf IEEE arXiv
spellingShingle	Li, Yunzhu Wu, Jiajun Zhu, Junyan Tenenbaum, Joshua B Torralba, Antonio Tedrake, Russell L Propagation networks for model-based control under partial observation
title	Propagation networks for model-based control under partial observation
title_full	Propagation networks for model-based control under partial observation
title_fullStr	Propagation networks for model-based control under partial observation
title_full_unstemmed	Propagation networks for model-based control under partial observation
title_short	Propagation networks for model-based control under partial observation
title_sort	propagation networks for model based control under partial observation
url	https://hdl.handle.net/1721.1/126583
work_keys_str_mv	AT liyunzhu propagationnetworksformodelbasedcontrolunderpartialobservation AT wujiajun propagationnetworksformodelbasedcontrolunderpartialobservation AT zhujunyan propagationnetworksformodelbasedcontrolunderpartialobservation AT tenenbaumjoshuab propagationnetworksformodelbasedcontrolunderpartialobservation AT torralbaantonio propagationnetworksformodelbasedcontrolunderpartialobservation AT tedrakerusselll propagationnetworksformodelbasedcontrolunderpartialobservation

Propagation networks for model-based control under partial observation

Similar Items