Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.

Transitive inference (the ability to infer that B > D given that B > C and C > D) is a widespread characteristic of serial learning, observed in dozens of species. Despite these robust behavioral effects, reinforcement learning models reliant on reward prediction error or associative streng...

Full description

Bibliographic Details
Main Authors:	Greg Jensen, Fabian Muñoz, Yelda Alkan, Vincent P Ferrera, Herbert S Terrace
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2015-01-01
Series:	PLoS Computational Biology
Online Access:	http://europepmc.org/articles/PMC4583549?pdf=render

_version_	1818961272663703552
author	Greg Jensen Fabian Muñoz Yelda Alkan Vincent P Ferrera Herbert S Terrace
author_facet	Greg Jensen Fabian Muñoz Yelda Alkan Vincent P Ferrera Herbert S Terrace
author_sort	Greg Jensen
collection	DOAJ
description	Transitive inference (the ability to infer that B > D given that B > C and C > D) is a widespread characteristic of serial learning, observed in dozens of species. Despite these robust behavioral effects, reinforcement learning models reliant on reward prediction error or associative strength routinely fail to perform these inferences. We propose an algorithm called betasort, inspired by cognitive processes, which performs transitive inference at low computational cost. This is accomplished by (1) representing stimulus positions along a unit span using beta distributions, (2) treating positive and negative feedback asymmetrically, and (3) updating the position of every stimulus during every trial, whether that stimulus was visible or not. Performance was compared for rhesus macaques, humans, and the betasort algorithm, as well as Q-learning, an established reward-prediction error (RPE) model. Of these, only Q-learning failed to respond above chance during critical test trials. Betasort's success (when compared to RPE models) and its computational efficiency (when compared to full Markov decision process implementations) suggests that the study of reinforcement learning in organisms will be best served by a feature-driven approach to comparing formal models.
first_indexed	2024-12-20T12:10:48Z
format	Article
id	doaj.art-8ac934abb3ae4b62bc89af239dcb9b0d
institution	Directory Open Access Journal
issn	1553-734X 1553-7358
language	English
last_indexed	2024-12-20T12:10:48Z
publishDate	2015-01-01
publisher	Public Library of Science (PLoS)
record_format	Article
series	PLoS Computational Biology
spelling	doaj.art-8ac934abb3ae4b62bc89af239dcb9b0d2022-12-21T19:41:14ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582015-01-01119e100452310.1371/journal.pcbi.1004523Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.Greg JensenFabian MuñozYelda AlkanVincent P FerreraHerbert S TerraceTransitive inference (the ability to infer that B > D given that B > C and C > D) is a widespread characteristic of serial learning, observed in dozens of species. Despite these robust behavioral effects, reinforcement learning models reliant on reward prediction error or associative strength routinely fail to perform these inferences. We propose an algorithm called betasort, inspired by cognitive processes, which performs transitive inference at low computational cost. This is accomplished by (1) representing stimulus positions along a unit span using beta distributions, (2) treating positive and negative feedback asymmetrically, and (3) updating the position of every stimulus during every trial, whether that stimulus was visible or not. Performance was compared for rhesus macaques, humans, and the betasort algorithm, as well as Q-learning, an established reward-prediction error (RPE) model. Of these, only Q-learning failed to respond above chance during critical test trials. Betasort's success (when compared to RPE models) and its computational efficiency (when compared to full Markov decision process implementations) suggests that the study of reinforcement learning in organisms will be best served by a feature-driven approach to comparing formal models.http://europepmc.org/articles/PMC4583549?pdf=render
spellingShingle	Greg Jensen Fabian Muñoz Yelda Alkan Vincent P Ferrera Herbert S Terrace Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model. PLoS Computational Biology
title	Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.
title_full	Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.
title_fullStr	Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.
title_full_unstemmed	Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.
title_short	Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.
title_sort	implicit value updating explains transitive inference performance the betasort model
url	http://europepmc.org/articles/PMC4583549?pdf=render
work_keys_str_mv	AT gregjensen implicitvalueupdatingexplainstransitiveinferenceperformancethebetasortmodel AT fabianmunoz implicitvalueupdatingexplainstransitiveinferenceperformancethebetasortmodel AT yeldaalkan implicitvalueupdatingexplainstransitiveinferenceperformancethebetasortmodel AT vincentpferrera implicitvalueupdatingexplainstransitiveinferenceperformancethebetasortmodel AT herbertsterrace implicitvalueupdatingexplainstransitiveinferenceperformancethebetasortmodel

Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.

Similar Items