Learn what matters: cross-domain imitation learning with task-relevant embeddings

We study how an autonomous agent learns to perform a task from demonstrations in a different domain, such as a different environment or different agent. Such cross-domain imitation learning is required to, for example, train an artificial agent from demonstrations of a human expert. We propose a sca...

Full description

Bibliographic Details
Main Authors: Franzmeyer, T, Torr, PHS, Henriques, JF
Format: Conference item
Language:English
Published: Curran Associates 2023
_version_ 1797112955619246080
author Franzmeyer, T
Torr, PHS
Henriques, JF
author_facet Franzmeyer, T
Torr, PHS
Henriques, JF
author_sort Franzmeyer, T
collection OXFORD
description We study how an autonomous agent learns to perform a task from demonstrations in a different domain, such as a different environment or different agent. Such cross-domain imitation learning is required to, for example, train an artificial agent from demonstrations of a human expert. We propose a scalable framework that enables cross-domain imitation learning without access to additional demonstrations or further domain knowledge. We jointly train the learner agent's policy and learn a mapping between the learner and expert domains with adversarial training. We effect this by using a mutual information criterion to find an embedding of the expert's state space that contains task-relevant information and is invariant to domain specifics. This step significantly simplifies estimating the mapping between the learner and expert domains and hence facilitates end-to-end learning. We demonstrate successful transfer of policies between considerably different domains, without extra supervision such as additional demonstrations, and in situations where other methods fail.
first_indexed 2024-03-07T08:29:46Z
format Conference item
id oxford-uuid:d1dbb71a-5d3f-40fc-aa64-5ef120569894
institution University of Oxford
language English
last_indexed 2024-04-09T03:55:23Z
publishDate 2023
publisher Curran Associates
record_format dspace
spelling oxford-uuid:d1dbb71a-5d3f-40fc-aa64-5ef1205698942024-03-12T10:45:50ZLearn what matters: cross-domain imitation learning with task-relevant embeddingsConference itemhttp://purl.org/coar/resource_type/c_5794uuid:d1dbb71a-5d3f-40fc-aa64-5ef120569894EnglishSymplectic ElementsCurran Associates2023Franzmeyer, TTorr, PHSHenriques, JFWe study how an autonomous agent learns to perform a task from demonstrations in a different domain, such as a different environment or different agent. Such cross-domain imitation learning is required to, for example, train an artificial agent from demonstrations of a human expert. We propose a scalable framework that enables cross-domain imitation learning without access to additional demonstrations or further domain knowledge. We jointly train the learner agent's policy and learn a mapping between the learner and expert domains with adversarial training. We effect this by using a mutual information criterion to find an embedding of the expert's state space that contains task-relevant information and is invariant to domain specifics. This step significantly simplifies estimating the mapping between the learner and expert domains and hence facilitates end-to-end learning. We demonstrate successful transfer of policies between considerably different domains, without extra supervision such as additional demonstrations, and in situations where other methods fail.
spellingShingle Franzmeyer, T
Torr, PHS
Henriques, JF
Learn what matters: cross-domain imitation learning with task-relevant embeddings
title Learn what matters: cross-domain imitation learning with task-relevant embeddings
title_full Learn what matters: cross-domain imitation learning with task-relevant embeddings
title_fullStr Learn what matters: cross-domain imitation learning with task-relevant embeddings
title_full_unstemmed Learn what matters: cross-domain imitation learning with task-relevant embeddings
title_short Learn what matters: cross-domain imitation learning with task-relevant embeddings
title_sort learn what matters cross domain imitation learning with task relevant embeddings
work_keys_str_mv AT franzmeyert learnwhatmatterscrossdomainimitationlearningwithtaskrelevantembeddings
AT torrphs learnwhatmatterscrossdomainimitationlearningwithtaskrelevantembeddings
AT henriquesjf learnwhatmatterscrossdomainimitationlearningwithtaskrelevantembeddings