Neural MMO: Massively Multiagent Simulation and Learning

Neural MMO is a massively multi-agent environment for reinforcement learning research. It is designed to push the boundaries of environment complexity while maintaining computationally efficiency for academic research. Agents in Neural MMO can forage for a variety of resources, engage in strategic c...

Full description

Bibliographic Details
Main Author: Suarez, Joseph
Other Authors: Isola, Phillip
Format: Thesis
Published: Massachusetts Institute of Technology 2024
Online Access:https://hdl.handle.net/1721.1/156586
_version_ 1811074201740640256
author Suarez, Joseph
author2 Isola, Phillip
author_facet Isola, Phillip
Suarez, Joseph
author_sort Suarez, Joseph
collection MIT
description Neural MMO is a massively multi-agent environment for reinforcement learning research. It is designed to push the boundaries of environment complexity while maintaining computationally efficiency for academic research. Agents in Neural MMO can forage for a variety of resources, engage in strategic combat with each other, defeat scripted enemies for loot, level-up various interdependent professions, acquire tools, weapons, equipment, etc., and exchange items on a global market. Neural MMO was among the first many-agent simulators for reinforcement learning research, and it is still unique among environments today. To my knowledge, no other project provides large agent populations, high per-agent complexity, and efficient simulation at once. These properties make Neural MMO a suitable environment for a variety of research topics in multi-agent learning that would be difficult to explore without such a simulator. The environment can process 128 agents at up to 25x real-time on a single CPU core, totaling 3,000 agent-steps per second. This speed is owed to simulation techniques borrowed from the games industry. In the course of developing Neural MMO, I made several adaptations for the specifics of reinforcement learning, such as the two-layer structure of Neural MMO's observations and actions and the efficient internal data representation. The contributions of this work include these adapted methods as general-purpose tools for designing RL environments. Through my own experiments and from the results of a series of competitions that I hosted on Neural MMO, we have seen agents capable of long-term coherent strategies, multi-tasking across various objectives, and conditioning for specific goals. The largest discovery of this project has been the extent to which standard reinforcement learning methods with limited compute are able to solve complex tasks. Neural MMO is free and open-source software under the MIT license with comprehensive documentation at neuralmmo.github.io and a 1000+ member community Discord.
first_indexed 2024-09-23T09:45:34Z
format Thesis
id mit-1721.1/156586
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T09:45:34Z
publishDate 2024
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1565862024-09-04T03:52:22Z Neural MMO: Massively Multiagent Simulation and Learning Suarez, Joseph Isola, Phillip Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Neural MMO is a massively multi-agent environment for reinforcement learning research. It is designed to push the boundaries of environment complexity while maintaining computationally efficiency for academic research. Agents in Neural MMO can forage for a variety of resources, engage in strategic combat with each other, defeat scripted enemies for loot, level-up various interdependent professions, acquire tools, weapons, equipment, etc., and exchange items on a global market. Neural MMO was among the first many-agent simulators for reinforcement learning research, and it is still unique among environments today. To my knowledge, no other project provides large agent populations, high per-agent complexity, and efficient simulation at once. These properties make Neural MMO a suitable environment for a variety of research topics in multi-agent learning that would be difficult to explore without such a simulator. The environment can process 128 agents at up to 25x real-time on a single CPU core, totaling 3,000 agent-steps per second. This speed is owed to simulation techniques borrowed from the games industry. In the course of developing Neural MMO, I made several adaptations for the specifics of reinforcement learning, such as the two-layer structure of Neural MMO's observations and actions and the efficient internal data representation. The contributions of this work include these adapted methods as general-purpose tools for designing RL environments. Through my own experiments and from the results of a series of competitions that I hosted on Neural MMO, we have seen agents capable of long-term coherent strategies, multi-tasking across various objectives, and conditioning for specific goals. The largest discovery of this project has been the extent to which standard reinforcement learning methods with limited compute are able to solve complex tasks. Neural MMO is free and open-source software under the MIT license with comprehensive documentation at neuralmmo.github.io and a 1000+ member community Discord. Ph.D. 2024-09-03T21:09:33Z 2024-09-03T21:09:33Z 2024-05 2024-07-10T13:02:12.861Z Thesis https://hdl.handle.net/1721.1/156586 0009-0004-5083-9303 Attribution 4.0 International (CC BY 4.0) Copyright retained by author(s) https://creativecommons.org/licenses/by/4.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Suarez, Joseph
Neural MMO: Massively Multiagent Simulation and Learning
title Neural MMO: Massively Multiagent Simulation and Learning
title_full Neural MMO: Massively Multiagent Simulation and Learning
title_fullStr Neural MMO: Massively Multiagent Simulation and Learning
title_full_unstemmed Neural MMO: Massively Multiagent Simulation and Learning
title_short Neural MMO: Massively Multiagent Simulation and Learning
title_sort neural mmo massively multiagent simulation and learning
url https://hdl.handle.net/1721.1/156586
work_keys_str_mv AT suarezjoseph neuralmmomassivelymultiagentsimulationandlearning