Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients

This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning. We offer an instrumental view of action selection, in which an ac...

Full description

Bibliographic Details
Main Authors: Yen Yu, Acer Y. C. Chang, Ryota Kanai
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-01-01
Series:Frontiers in Neurorobotics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fnbot.2018.00088/full
_version_ 1818386502139248640
author Yen Yu
Acer Y. C. Chang
Ryota Kanai
author_facet Yen Yu
Acer Y. C. Chang
Ryota Kanai
author_sort Yen Yu
collection DOAJ
description This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning. We offer an instrumental view of action selection, in which an action serves to disclose outcomes that have intrinsic meaningfulness to an agent itself. This motivated two central algorithmic ingredients: devaluation and devaluation progress, both underpin agent's cognition concerning intrinsically generated rewards. The two serve as an instantiation of homeostatic and heterostatic intrinsic motivation. A key insight from our algorithm is that the two seemingly opposite motivations can be reconciled—without which exploration and information-gathering cannot be effectively carried out. We supported this claim with empirical evidence, showing that boredom-enabled agents consistently outperformed other curious or explorative agent variants in model building benchmarks based on self-assisted experience accumulation.
first_indexed 2024-12-14T03:55:04Z
format Article
id doaj.art-5ab31f5070b64be986ee3787c6ecac87
institution Directory Open Access Journal
issn 1662-5218
language English
last_indexed 2024-12-14T03:55:04Z
publishDate 2019-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Neurorobotics
spelling doaj.art-5ab31f5070b64be986ee3787c6ecac872022-12-21T23:18:07ZengFrontiers Media S.A.Frontiers in Neurorobotics1662-52182019-01-011210.3389/fnbot.2018.00088391132Boredom-Driven Curious Learning by Homeo-Heterostatic Value GradientsYen YuAcer Y. C. ChangRyota KanaiThis paper presents the Homeo-Heterostatic Value Gradients (HHVG) algorithm as a formal account on the constructive interplay between boredom and curiosity which gives rise to effective exploration and superior forward model learning. We offer an instrumental view of action selection, in which an action serves to disclose outcomes that have intrinsic meaningfulness to an agent itself. This motivated two central algorithmic ingredients: devaluation and devaluation progress, both underpin agent's cognition concerning intrinsically generated rewards. The two serve as an instantiation of homeostatic and heterostatic intrinsic motivation. A key insight from our algorithm is that the two seemingly opposite motivations can be reconciled—without which exploration and information-gathering cannot be effectively carried out. We supported this claim with empirical evidence, showing that boredom-enabled agents consistently outperformed other curious or explorative agent variants in model building benchmarks based on self-assisted experience accumulation.https://www.frontiersin.org/article/10.3389/fnbot.2018.00088/fullcuriosityboredomgoal-directednessintrinsic motivationoutcome devaluationsatiety
spellingShingle Yen Yu
Acer Y. C. Chang
Ryota Kanai
Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
Frontiers in Neurorobotics
curiosity
boredom
goal-directedness
intrinsic motivation
outcome devaluation
satiety
title Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_full Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_fullStr Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_full_unstemmed Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_short Boredom-Driven Curious Learning by Homeo-Heterostatic Value Gradients
title_sort boredom driven curious learning by homeo heterostatic value gradients
topic curiosity
boredom
goal-directedness
intrinsic motivation
outcome devaluation
satiety
url https://www.frontiersin.org/article/10.3389/fnbot.2018.00088/full
work_keys_str_mv AT yenyu boredomdrivencuriouslearningbyhomeoheterostaticvaluegradients
AT acerycchang boredomdrivencuriouslearningbyhomeoheterostaticvaluegradients
AT ryotakanai boredomdrivencuriouslearningbyhomeoheterostaticvaluegradients