Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation

<p>In order to enable more widespread application of robots, we are required to reduce the human effort for the introduction of existing robotic platforms to new environments and tasks. In this thesis, we identify three complementary strategies to address this challenge, via the use of imitati...

Full description

Bibliographic Details
Main Author:	Wulfmeier, M
Other Authors:	Posner, I
Format:	Thesis
Language:	English
Published:	2018
Subjects:	Machine learning Robotics

_version_	1797060160406945792
author	Wulfmeier, M
author2	Posner, I
author_facet	Posner, I Wulfmeier, M
author_sort	Wulfmeier, M
collection	OXFORD
description	<p>In order to enable more widespread application of robots, we are required to reduce the human effort for the introduction of existing robotic platforms to new environments and tasks. In this thesis, we identify three complementary strategies to address this challenge, via the use of imitation learning, domain adaptation, and transfer learning based on simulations. The overall work strives to reduce the effort of generating training data by employing inexpensively obtainable labels and by transferring information between different domains with deviating underlying properties.</p> <p>Imitation learning enables a straightforward way for untrained personnel to teach robots to perform tasks by providing demonstrations, which represent a comparably inexpensive source of supervision. We develop a scalable approach to identify the preferences underlying demonstration data via the framework of inverse reinforcement learning. The method enables integration of the extracted preferences as cost maps into existing motion planning systems. We further incorporate prior domain knowledge and demonstrate that the approach outperforms the baselines including manually crafted cost functions.</p> <p>In addition to employing low-cost labels from demonstration, we investigate the adaptation of models to domains without available supervisory information. Specifically, the challenge of appearance changes in outdoor robotics such as illumination and weather shifts is addressed using an adversarial domain adaptation approach. A principal advantage of the method over prior work is the straightforwardness of adapting arbitrary, state-of-the-art neural network architectures. Finally, we demonstrate performance benefits of the method for semantic segmentation of drivable terrain.</p> <p>Our last contribution focuses on simulation to real world transfer learning, where the characteristic differences are not only regarding the visual appearance but the underlying system dynamics. Our work aims at parallel training in both systems and mutual guidance via auxiliary alignment rewards to accelerate training for real world systems. The approach is shown to outperform various baselines as well as a unilateral alignment variant.</p>
first_indexed	2024-03-06T20:13:29Z
format	Thesis
id	oxford-uuid:2b5eeb55-639a-40ae-83b7-bd01fc8fd6cc
institution	University of Oxford
language	English
last_indexed	2024-03-06T20:13:29Z
publishDate	2018
record_format	dspace
spelling	oxford-uuid:2b5eeb55-639a-40ae-83b7-bd01fc8fd6cc2022-03-26T12:30:27ZEfficient Supervision for Robot Learning via Imitation, Simulation, and AdaptationThesishttp://purl.org/coar/resource_type/c_db06uuid:2b5eeb55-639a-40ae-83b7-bd01fc8fd6ccMachine learningRoboticsEnglishORA Deposit2018Wulfmeier, MPosner, IZisserman, ANewman, PRiedmiller, M<p>In order to enable more widespread application of robots, we are required to reduce the human effort for the introduction of existing robotic platforms to new environments and tasks. In this thesis, we identify three complementary strategies to address this challenge, via the use of imitation learning, domain adaptation, and transfer learning based on simulations. The overall work strives to reduce the effort of generating training data by employing inexpensively obtainable labels and by transferring information between different domains with deviating underlying properties.</p> <p>Imitation learning enables a straightforward way for untrained personnel to teach robots to perform tasks by providing demonstrations, which represent a comparably inexpensive source of supervision. We develop a scalable approach to identify the preferences underlying demonstration data via the framework of inverse reinforcement learning. The method enables integration of the extracted preferences as cost maps into existing motion planning systems. We further incorporate prior domain knowledge and demonstrate that the approach outperforms the baselines including manually crafted cost functions.</p> <p>In addition to employing low-cost labels from demonstration, we investigate the adaptation of models to domains without available supervisory information. Specifically, the challenge of appearance changes in outdoor robotics such as illumination and weather shifts is addressed using an adversarial domain adaptation approach. A principal advantage of the method over prior work is the straightforwardness of adapting arbitrary, state-of-the-art neural network architectures. Finally, we demonstrate performance benefits of the method for semantic segmentation of drivable terrain.</p> <p>Our last contribution focuses on simulation to real world transfer learning, where the characteristic differences are not only regarding the visual appearance but the underlying system dynamics. Our work aims at parallel training in both systems and mutual guidance via auxiliary alignment rewards to accelerate training for real world systems. The approach is shown to outperform various baselines as well as a unilateral alignment variant.</p>
spellingShingle	Machine learning Robotics Wulfmeier, M Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title	Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_full	Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_fullStr	Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_full_unstemmed	Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_short	Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_sort	efficient supervision for robot learning via imitation simulation and adaptation
topic	Machine learning Robotics
work_keys_str_mv	AT wulfmeierm efficientsupervisionforrobotlearningviaimitationsimulationandadaptation

Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation

Similar Items