Toward an artificial intelligence physicist for unsupervised learning

© 2019 American Physical Society. We investigate opportunities and challenges for improving unsupervised machine learning using four common strategies with a long history in physics: divide and conquer, Occam's razor, unification, and lifelong learning. Instead of using one model to learn every...

Full description

Bibliographic Details
Main Authors:	Wu, Tailin, Tegmark, Max
Other Authors:	Massachusetts Institute of Technology. Department of Physics
Format:	Article
Language:	English
Published:	American Physical Society (APS) 2021
Online Access:	https://hdl.handle.net/1721.1/136247

_version_	1811075051787649024
author	Wu, Tailin Tegmark, Max
author2	Massachusetts Institute of Technology. Department of Physics
author_facet	Massachusetts Institute of Technology. Department of Physics Wu, Tailin Tegmark, Max
author_sort	Wu, Tailin
collection	MIT
description	© 2019 American Physical Society. We investigate opportunities and challenges for improving unsupervised machine learning using four common strategies with a long history in physics: divide and conquer, Occam's razor, unification, and lifelong learning. Instead of using one model to learn everything, we propose a paradigm centered around the learning and manipulation of theories, which parsimoniously predict both aspects of the future (from past observations) and the domain in which these predictions are accurate. Specifically, we propose a generalized mean loss to encourage each theory to specialize in its comparatively advantageous domain, and a differentiable description length objective to downweight bad data and "snap" learned theories into simple symbolic formulas. Theories are stored in a "theory hub," which continuously unifies learned theories and can propose theories when encountering new environments. We test our implementation, the toy "artificial intelligence physicist" learning agent, on a suite of increasingly complex physics environments. From unsupervised observation of trajectories through worlds involving random combinations of gravity, electromagnetism, harmonic motion, and elastic bounces, our agent typically learns faster and produces mean-squared prediction errors about a billion times smaller than a standard feedforward neural net of comparable complexity, typically recovering integer and rational theory parameters exactly. Our agent successfully identifies domains with different laws of motion also for a nonlinear chaotic double pendulum in a piecewise constant force field.
first_indexed	2024-09-23T09:59:52Z
format	Article
id	mit-1721.1/136247
institution	Massachusetts Institute of Technology
language	English
last_indexed	2024-09-23T09:59:52Z
publishDate	2021
publisher	American Physical Society (APS)
record_format	dspace
spelling	mit-1721.1/1362472023-09-26T20:13:40Z Toward an artificial intelligence physicist for unsupervised learning Wu, Tailin Tegmark, Max Massachusetts Institute of Technology. Department of Physics Center for Brains, Minds, and Machines © 2019 American Physical Society. We investigate opportunities and challenges for improving unsupervised machine learning using four common strategies with a long history in physics: divide and conquer, Occam's razor, unification, and lifelong learning. Instead of using one model to learn everything, we propose a paradigm centered around the learning and manipulation of theories, which parsimoniously predict both aspects of the future (from past observations) and the domain in which these predictions are accurate. Specifically, we propose a generalized mean loss to encourage each theory to specialize in its comparatively advantageous domain, and a differentiable description length objective to downweight bad data and "snap" learned theories into simple symbolic formulas. Theories are stored in a "theory hub," which continuously unifies learned theories and can propose theories when encountering new environments. We test our implementation, the toy "artificial intelligence physicist" learning agent, on a suite of increasingly complex physics environments. From unsupervised observation of trajectories through worlds involving random combinations of gravity, electromagnetism, harmonic motion, and elastic bounces, our agent typically learns faster and produces mean-squared prediction errors about a billion times smaller than a standard feedforward neural net of comparable complexity, typically recovering integer and rational theory parameters exactly. Our agent successfully identifies domains with different laws of motion also for a nonlinear chaotic double pendulum in a piecewise constant force field. 2021-10-27T20:34:29Z 2021-10-27T20:34:29Z 2019 2021-07-08T17:48:38Z Article http://purl.org/eprint/type/JournalArticle https://hdl.handle.net/1721.1/136247 en 10.1103/PHYSREVE.100.033311 Physical Review E Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. application/pdf American Physical Society (APS) APS
spellingShingle	Wu, Tailin Tegmark, Max Toward an artificial intelligence physicist for unsupervised learning
title	Toward an artificial intelligence physicist for unsupervised learning
title_full	Toward an artificial intelligence physicist for unsupervised learning
title_fullStr	Toward an artificial intelligence physicist for unsupervised learning
title_full_unstemmed	Toward an artificial intelligence physicist for unsupervised learning
title_short	Toward an artificial intelligence physicist for unsupervised learning
title_sort	toward an artificial intelligence physicist for unsupervised learning
url	https://hdl.handle.net/1721.1/136247
work_keys_str_mv	AT wutailin towardanartificialintelligencephysicistforunsupervisedlearning AT tegmarkmax towardanartificialintelligencephysicistforunsupervisedlearning

Toward an artificial intelligence physicist for unsupervised learning

Similar Items