Neural probabilistic motor primitives for humanoid control

We focus on the problem of learning a single motor module that can flexibly express a range of behaviors for the control of high-dimensional physically simulated humanoids. To do this, we propose a motor architecture that has the general structure of an inverse model with a latent-variable bottlenec...

Full description

Bibliographic Details
Main Authors:	Merel, J, Hasenclever, L, Galashov, A, Ahuja, A, Pham, V, Wayne, G, Teh, Y, Heess, N
Format:	Conference item
Published:	2019

_version_	1826277170075402240
author	Merel, J Hasenclever, L Galashov, A Ahuja, A Pham, V Wayne, G Teh, Y Heess, N
author_facet	Merel, J Hasenclever, L Galashov, A Ahuja, A Pham, V Wayne, G Teh, Y Heess, N
author_sort	Merel, J
collection	OXFORD
description	We focus on the problem of learning a single motor module that can flexibly express a range of behaviors for the control of high-dimensional physically simulated humanoids. To do this, we propose a motor architecture that has the general structure of an inverse model with a latent-variable bottleneck. We show that it is possible to train this model entirely offline to compress thousands of expert policies and learn a motor primitive embedding space. The trained neural probabilistic motor primitive system can perform one-shot imitation of whole-body humanoid behaviors, robustly mimicking unseen trajectories. Additionally, we demonstrate that it is also straightforward to train controllers to reuse the learned motor primitive space to solve tasks, and the resulting movements are relatively naturalistic. To support the training of our model, we compare two approaches for offline policy cloning, including an experience efficient method which we call linear feedback policy cloning. We encourage readers to view a supplementary video (https://youtu.be/CaDEf-QcKwA) summarizing our results.
first_indexed	2024-03-06T23:24:52Z
format	Conference item
id	oxford-uuid:6a02be08-a2b6-46b5-a5cf-0ee99a23c67a
institution	University of Oxford
last_indexed	2024-03-06T23:24:52Z
publishDate	2019
record_format	dspace
spelling	oxford-uuid:6a02be08-a2b6-46b5-a5cf-0ee99a23c67a2022-03-26T18:54:46ZNeural probabilistic motor primitives for humanoid controlConference itemhttp://purl.org/coar/resource_type/c_5794uuid:6a02be08-a2b6-46b5-a5cf-0ee99a23c67aSymplectic Elements at Oxford2019Merel, JHasenclever, LGalashov, AAhuja, APham, VWayne, GTeh, YHeess, NWe focus on the problem of learning a single motor module that can flexibly express a range of behaviors for the control of high-dimensional physically simulated humanoids. To do this, we propose a motor architecture that has the general structure of an inverse model with a latent-variable bottleneck. We show that it is possible to train this model entirely offline to compress thousands of expert policies and learn a motor primitive embedding space. The trained neural probabilistic motor primitive system can perform one-shot imitation of whole-body humanoid behaviors, robustly mimicking unseen trajectories. Additionally, we demonstrate that it is also straightforward to train controllers to reuse the learned motor primitive space to solve tasks, and the resulting movements are relatively naturalistic. To support the training of our model, we compare two approaches for offline policy cloning, including an experience efficient method which we call linear feedback policy cloning. We encourage readers to view a supplementary video (https://youtu.be/CaDEf-QcKwA) summarizing our results.
spellingShingle	Merel, J Hasenclever, L Galashov, A Ahuja, A Pham, V Wayne, G Teh, Y Heess, N Neural probabilistic motor primitives for humanoid control
title	Neural probabilistic motor primitives for humanoid control
title_full	Neural probabilistic motor primitives for humanoid control
title_fullStr	Neural probabilistic motor primitives for humanoid control
title_full_unstemmed	Neural probabilistic motor primitives for humanoid control
title_short	Neural probabilistic motor primitives for humanoid control
title_sort	neural probabilistic motor primitives for humanoid control
work_keys_str_mv	AT merelj neuralprobabilisticmotorprimitivesforhumanoidcontrol AT hasencleverl neuralprobabilisticmotorprimitivesforhumanoidcontrol AT galashova neuralprobabilisticmotorprimitivesforhumanoidcontrol AT ahujaa neuralprobabilisticmotorprimitivesforhumanoidcontrol AT phamv neuralprobabilisticmotorprimitivesforhumanoidcontrol AT wayneg neuralprobabilisticmotorprimitivesforhumanoidcontrol AT tehy neuralprobabilisticmotorprimitivesforhumanoidcontrol AT heessn neuralprobabilisticmotorprimitivesforhumanoidcontrol

Neural probabilistic motor primitives for humanoid control

Similar Items