Powderworld: A Platform for Understanding Generalization via Rich Task Distributions

One of the grand challenges of reinforcement learning is the ability to generalize to new tasks. However, general agents require a set of rich, diverse tasks to train on. Designing a ‘foundation environment’ for such tasks is tricky – the ideal environment would support a range of emergent phenomena...

Full description

Bibliographic Details
Main Author: Frans, Kevin
Other Authors: Isola, Phillip
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/151635
_version_ 1826211240101281792
author Frans, Kevin
author2 Isola, Phillip
author_facet Isola, Phillip
Frans, Kevin
author_sort Frans, Kevin
collection MIT
description One of the grand challenges of reinforcement learning is the ability to generalize to new tasks. However, general agents require a set of rich, diverse tasks to train on. Designing a ‘foundation environment’ for such tasks is tricky – the ideal environment would support a range of emergent phenomena, an expressive task space, and fast runtime. To take a step towards addressing this research bottleneck, this work presents Powderworld, a lightweight yet expressive simulation environment running directly on the GPU. Within Powderworld, two motivating challenges are presented, one for world-modelling and one for reinforcement learning. Each contains hand-designed test tasks to examine generalization. Experiments indicate that increasing the environment’s complexity improves generalization for world models and certain reinforcement learning agents, yet may inhibit learning in high-variance environments. Powderworld aims to support the study of generalization by providing a source of diverse tasks arising from the same core rules.
first_indexed 2024-09-23T15:02:47Z
format Thesis
id mit-1721.1/151635
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T15:02:47Z
publishDate 2023
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1516352023-08-01T03:16:50Z Powderworld: A Platform for Understanding Generalization via Rich Task Distributions Frans, Kevin Isola, Phillip Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science One of the grand challenges of reinforcement learning is the ability to generalize to new tasks. However, general agents require a set of rich, diverse tasks to train on. Designing a ‘foundation environment’ for such tasks is tricky – the ideal environment would support a range of emergent phenomena, an expressive task space, and fast runtime. To take a step towards addressing this research bottleneck, this work presents Powderworld, a lightweight yet expressive simulation environment running directly on the GPU. Within Powderworld, two motivating challenges are presented, one for world-modelling and one for reinforcement learning. Each contains hand-designed test tasks to examine generalization. Experiments indicate that increasing the environment’s complexity improves generalization for world models and certain reinforcement learning agents, yet may inhibit learning in high-variance environments. Powderworld aims to support the study of generalization by providing a source of diverse tasks arising from the same core rules. M.Eng. 2023-07-31T19:54:42Z 2023-07-31T19:54:42Z 2023-06 2023-06-06T16:34:54.162Z Thesis https://hdl.handle.net/1721.1/151635 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Frans, Kevin
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
title Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
title_full Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
title_fullStr Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
title_full_unstemmed Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
title_short Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
title_sort powderworld a platform for understanding generalization via rich task distributions
url https://hdl.handle.net/1721.1/151635
work_keys_str_mv AT franskevin powderworldaplatformforunderstandinggeneralizationviarichtaskdistributions