Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation

<p>In order to enable more widespread application of robots, we are required to reduce the human effort for the introduction of existing robotic platforms to new environments and tasks. In this thesis, we identify three complementary strategies to address this challenge, via the use of imitati...

Full description

Bibliographic Details
Main Author: Wulfmeier, M
Other Authors: Posner, I
Format: Thesis
Language:English
Published: 2018
Subjects:
_version_ 1797060160406945792
author Wulfmeier, M
author2 Posner, I
author_facet Posner, I
Wulfmeier, M
author_sort Wulfmeier, M
collection OXFORD
description <p>In order to enable more widespread application of robots, we are required to reduce the human effort for the introduction of existing robotic platforms to new environments and tasks. In this thesis, we identify three complementary strategies to address this challenge, via the use of imitation learning, domain adaptation, and transfer learning based on simulations. The overall work strives to reduce the effort of generating training data by employing inexpensively obtainable labels and by transferring information between different domains with deviating underlying properties.</p> <p>Imitation learning enables a straightforward way for untrained personnel to teach robots to perform tasks by providing demonstrations, which represent a comparably inexpensive source of supervision. We develop a scalable approach to identify the preferences underlying demonstration data via the framework of inverse reinforcement learning. The method enables integration of the extracted preferences as cost maps into existing motion planning systems. We further incorporate prior domain knowledge and demonstrate that the approach outperforms the baselines including manually crafted cost functions.</p> <p>In addition to employing low-cost labels from demonstration, we investigate the adaptation of models to domains without available supervisory information. Specifically, the challenge of appearance changes in outdoor robotics such as illumination and weather shifts is addressed using an adversarial domain adaptation approach. A principal advantage of the method over prior work is the straightforwardness of adapting arbitrary, state-of-the-art neural network architectures. Finally, we demonstrate performance benefits of the method for semantic segmentation of drivable terrain.</p> <p>Our last contribution focuses on simulation to real world transfer learning, where the characteristic differences are not only regarding the visual appearance but the underlying system dynamics. Our work aims at parallel training in both systems and mutual guidance via auxiliary alignment rewards to accelerate training for real world systems. The approach is shown to outperform various baselines as well as a unilateral alignment variant.</p>
first_indexed 2024-03-06T20:13:29Z
format Thesis
id oxford-uuid:2b5eeb55-639a-40ae-83b7-bd01fc8fd6cc
institution University of Oxford
language English
last_indexed 2024-03-06T20:13:29Z
publishDate 2018
record_format dspace
spelling oxford-uuid:2b5eeb55-639a-40ae-83b7-bd01fc8fd6cc2022-03-26T12:30:27ZEfficient Supervision for Robot Learning via Imitation, Simulation, and AdaptationThesishttp://purl.org/coar/resource_type/c_db06uuid:2b5eeb55-639a-40ae-83b7-bd01fc8fd6ccMachine learningRoboticsEnglishORA Deposit2018Wulfmeier, MPosner, IZisserman, ANewman, PRiedmiller, M<p>In order to enable more widespread application of robots, we are required to reduce the human effort for the introduction of existing robotic platforms to new environments and tasks. In this thesis, we identify three complementary strategies to address this challenge, via the use of imitation learning, domain adaptation, and transfer learning based on simulations. The overall work strives to reduce the effort of generating training data by employing inexpensively obtainable labels and by transferring information between different domains with deviating underlying properties.</p> <p>Imitation learning enables a straightforward way for untrained personnel to teach robots to perform tasks by providing demonstrations, which represent a comparably inexpensive source of supervision. We develop a scalable approach to identify the preferences underlying demonstration data via the framework of inverse reinforcement learning. The method enables integration of the extracted preferences as cost maps into existing motion planning systems. We further incorporate prior domain knowledge and demonstrate that the approach outperforms the baselines including manually crafted cost functions.</p> <p>In addition to employing low-cost labels from demonstration, we investigate the adaptation of models to domains without available supervisory information. Specifically, the challenge of appearance changes in outdoor robotics such as illumination and weather shifts is addressed using an adversarial domain adaptation approach. A principal advantage of the method over prior work is the straightforwardness of adapting arbitrary, state-of-the-art neural network architectures. Finally, we demonstrate performance benefits of the method for semantic segmentation of drivable terrain.</p> <p>Our last contribution focuses on simulation to real world transfer learning, where the characteristic differences are not only regarding the visual appearance but the underlying system dynamics. Our work aims at parallel training in both systems and mutual guidance via auxiliary alignment rewards to accelerate training for real world systems. The approach is shown to outperform various baselines as well as a unilateral alignment variant.</p>
spellingShingle Machine learning
Robotics
Wulfmeier, M
Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_full Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_fullStr Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_full_unstemmed Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_short Efficient Supervision for Robot Learning via Imitation, Simulation, and Adaptation
title_sort efficient supervision for robot learning via imitation simulation and adaptation
topic Machine learning
Robotics
work_keys_str_mv AT wulfmeierm efficientsupervisionforrobotlearningviaimitationsimulationandadaptation