Mixtures of strategies underlie rodent behavior during reversal learning.

In reversal learning tasks, the behavior of humans and animals is often assumed to be uniform within single experimental sessions to facilitate data analysis and model fitting. However, behavior of agents can display substantial variability in single experimental sessions, as they execute different...

Full description

Bibliographic Details
Main Authors:	Nhat Minh Le, Murat Yildirim, Yizhi Wang, Hiroki Sugihara, Mehrdad Jazayeri, Mriganka Sur
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2023-09-01
Series:	PLoS Computational Biology
Online Access:	https://doi.org/10.1371/journal.pcbi.1011430

_version_	1797674081044135936
author	Nhat Minh Le Murat Yildirim Yizhi Wang Hiroki Sugihara Mehrdad Jazayeri Mriganka Sur
author_facet	Nhat Minh Le Murat Yildirim Yizhi Wang Hiroki Sugihara Mehrdad Jazayeri Mriganka Sur
author_sort	Nhat Minh Le
collection	DOAJ
description	In reversal learning tasks, the behavior of humans and animals is often assumed to be uniform within single experimental sessions to facilitate data analysis and model fitting. However, behavior of agents can display substantial variability in single experimental sessions, as they execute different blocks of trials with different transition dynamics. Here, we observed that in a deterministic reversal learning task, mice display noisy and sub-optimal choice transitions even at the expert stages of learning. We investigated two sources of the sub-optimality in the behavior. First, we found that mice exhibit a high lapse rate during task execution, as they reverted to unrewarded directions after choice transitions. Second, we unexpectedly found that a majority of mice did not execute a uniform strategy, but rather mixed between several behavioral modes with different transition dynamics. We quantified the use of such mixtures with a state-space model, block Hidden Markov Model (block HMM), to dissociate the mixtures of dynamic choice transitions in individual blocks of trials. Additionally, we found that blockHMM transition modes in rodent behavior can be accounted for by two different types of behavioral algorithms, model-free or inference-based learning, that might be used to solve the task. Combining these approaches, we found that mice used a mixture of both exploratory, model-free strategies and deterministic, inference-based behavior in the task, explaining their overall noisy choice sequences. Together, our combined computational approach highlights intrinsic sources of noise in rodent reversal learning behavior and provides a richer description of behavior than conventional techniques, while uncovering the hidden states that underlie the block-by-block transitions.
first_indexed	2024-03-11T21:54:57Z
format	Article
id	doaj.art-a9adafe9053d40a891d4518235310437
institution	Directory Open Access Journal
issn	1553-734X 1553-7358
language	English
last_indexed	2024-03-11T21:54:57Z
publishDate	2023-09-01
publisher	Public Library of Science (PLoS)
record_format	Article
series	PLoS Computational Biology
spelling	doaj.art-a9adafe9053d40a891d45182353104372023-09-26T05:30:46ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582023-09-01199e101143010.1371/journal.pcbi.1011430Mixtures of strategies underlie rodent behavior during reversal learning.Nhat Minh LeMurat YildirimYizhi WangHiroki SugiharaMehrdad JazayeriMriganka SurIn reversal learning tasks, the behavior of humans and animals is often assumed to be uniform within single experimental sessions to facilitate data analysis and model fitting. However, behavior of agents can display substantial variability in single experimental sessions, as they execute different blocks of trials with different transition dynamics. Here, we observed that in a deterministic reversal learning task, mice display noisy and sub-optimal choice transitions even at the expert stages of learning. We investigated two sources of the sub-optimality in the behavior. First, we found that mice exhibit a high lapse rate during task execution, as they reverted to unrewarded directions after choice transitions. Second, we unexpectedly found that a majority of mice did not execute a uniform strategy, but rather mixed between several behavioral modes with different transition dynamics. We quantified the use of such mixtures with a state-space model, block Hidden Markov Model (block HMM), to dissociate the mixtures of dynamic choice transitions in individual blocks of trials. Additionally, we found that blockHMM transition modes in rodent behavior can be accounted for by two different types of behavioral algorithms, model-free or inference-based learning, that might be used to solve the task. Combining these approaches, we found that mice used a mixture of both exploratory, model-free strategies and deterministic, inference-based behavior in the task, explaining their overall noisy choice sequences. Together, our combined computational approach highlights intrinsic sources of noise in rodent reversal learning behavior and provides a richer description of behavior than conventional techniques, while uncovering the hidden states that underlie the block-by-block transitions.https://doi.org/10.1371/journal.pcbi.1011430
spellingShingle	Nhat Minh Le Murat Yildirim Yizhi Wang Hiroki Sugihara Mehrdad Jazayeri Mriganka Sur Mixtures of strategies underlie rodent behavior during reversal learning. PLoS Computational Biology
title	Mixtures of strategies underlie rodent behavior during reversal learning.
title_full	Mixtures of strategies underlie rodent behavior during reversal learning.
title_fullStr	Mixtures of strategies underlie rodent behavior during reversal learning.
title_full_unstemmed	Mixtures of strategies underlie rodent behavior during reversal learning.
title_short	Mixtures of strategies underlie rodent behavior during reversal learning.
title_sort	mixtures of strategies underlie rodent behavior during reversal learning
url	https://doi.org/10.1371/journal.pcbi.1011430
work_keys_str_mv	AT nhatminhle mixturesofstrategiesunderlierodentbehaviorduringreversallearning AT muratyildirim mixturesofstrategiesunderlierodentbehaviorduringreversallearning AT yizhiwang mixturesofstrategiesunderlierodentbehaviorduringreversallearning AT hirokisugihara mixturesofstrategiesunderlierodentbehaviorduringreversallearning AT mehrdadjazayeri mixturesofstrategiesunderlierodentbehaviorduringreversallearning AT mrigankasur mixturesofstrategiesunderlierodentbehaviorduringreversallearning

Mixtures of strategies underlie rodent behavior during reversal learning.

Similar Items