Can reinforcement learning model learning across development? Online lifelong learning through adaptive intrinsic motivation

Reinforcement learning is a powerful model of animal learning in brief, controlled experimental conditions, but does not readily explain the development of behavior over an animal’s whole lifetime. In this paper, we describe a framework to address this shortcoming by introducing the single-life rein...

Full description

Bibliographic Details
Main Authors: Sandbrink, K, Christian, B, Nasvytis, L, Schroeder de Witt, C, Butlin, P
Format: Conference item
Language:English
Published: Cognitive Science Society 2024
_version_ 1811139580537077760
author Sandbrink, K
Christian, B
Nasvytis, L
Schroeder de Witt, C
Butlin, P
author_facet Sandbrink, K
Christian, B
Nasvytis, L
Schroeder de Witt, C
Butlin, P
author_sort Sandbrink, K
collection OXFORD
description Reinforcement learning is a powerful model of animal learning in brief, controlled experimental conditions, but does not readily explain the development of behavior over an animal’s whole lifetime. In this paper, we describe a framework to address this shortcoming by introducing the single-life reinforcement learning setting to cognitive science. We construct an agent with two learning systems: an extrinsic learner that learns within a single lifetime, and an intrinsic learner that learns across lifetimes, equipping the agent with intrinsic motivation. We show that this model outperforms heuristic benchmarks and recapitulates a transition from exploratory to habit-driven behavior, while allowing the agent to learn an interpretable value function. We formulate a precise definition of intrinsic motivation and discuss the philosophical implications of using reinforcement learning as a model of behavior in the real world
first_indexed 2024-09-25T04:08:21Z
format Conference item
id oxford-uuid:0ca8699b-a631-4b69-8ee0-78b9ba1d47dc
institution University of Oxford
language English
last_indexed 2024-09-25T04:08:21Z
publishDate 2024
publisher Cognitive Science Society
record_format dspace
spelling oxford-uuid:0ca8699b-a631-4b69-8ee0-78b9ba1d47dc2024-06-13T09:41:04ZCan reinforcement learning model learning across development? Online lifelong learning through adaptive intrinsic motivationConference itemhttp://purl.org/coar/resource_type/c_5794uuid:0ca8699b-a631-4b69-8ee0-78b9ba1d47dcEnglishSymplectic ElementsCognitive Science Society2024Sandbrink, KChristian, BNasvytis, LSchroeder de Witt, CButlin, PReinforcement learning is a powerful model of animal learning in brief, controlled experimental conditions, but does not readily explain the development of behavior over an animal’s whole lifetime. In this paper, we describe a framework to address this shortcoming by introducing the single-life reinforcement learning setting to cognitive science. We construct an agent with two learning systems: an extrinsic learner that learns within a single lifetime, and an intrinsic learner that learns across lifetimes, equipping the agent with intrinsic motivation. We show that this model outperforms heuristic benchmarks and recapitulates a transition from exploratory to habit-driven behavior, while allowing the agent to learn an interpretable value function. We formulate a precise definition of intrinsic motivation and discuss the philosophical implications of using reinforcement learning as a model of behavior in the real world
spellingShingle Sandbrink, K
Christian, B
Nasvytis, L
Schroeder de Witt, C
Butlin, P
Can reinforcement learning model learning across development? Online lifelong learning through adaptive intrinsic motivation
title Can reinforcement learning model learning across development? Online lifelong learning through adaptive intrinsic motivation
title_full Can reinforcement learning model learning across development? Online lifelong learning through adaptive intrinsic motivation
title_fullStr Can reinforcement learning model learning across development? Online lifelong learning through adaptive intrinsic motivation
title_full_unstemmed Can reinforcement learning model learning across development? Online lifelong learning through adaptive intrinsic motivation
title_short Can reinforcement learning model learning across development? Online lifelong learning through adaptive intrinsic motivation
title_sort can reinforcement learning model learning across development online lifelong learning through adaptive intrinsic motivation
work_keys_str_mv AT sandbrinkk canreinforcementlearningmodellearningacrossdevelopmentonlinelifelonglearningthroughadaptiveintrinsicmotivation
AT christianb canreinforcementlearningmodellearningacrossdevelopmentonlinelifelonglearningthroughadaptiveintrinsicmotivation
AT nasvytisl canreinforcementlearningmodellearningacrossdevelopmentonlinelifelonglearningthroughadaptiveintrinsicmotivation
AT schroederdewittc canreinforcementlearningmodellearningacrossdevelopmentonlinelifelonglearningthroughadaptiveintrinsicmotivation
AT butlinp canreinforcementlearningmodellearningacrossdevelopmentonlinelifelonglearningthroughadaptiveintrinsicmotivation