Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized.

Behavioral evidence suggests that instrumental conditioning is governed by two forms of action control: a goal-directed and a habit learning process. Model-based reinforcement learning (RL) has been argued to underlie the goal-directed process; however, the way in which it interacts with habits and...

Full description

Bibliographic Details
Main Authors: Amir Dezfouli, Bernard W Balleine
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC3854489?pdf=render
_version_ 1818413940622753792
author Amir Dezfouli
Bernard W Balleine
author_facet Amir Dezfouli
Bernard W Balleine
author_sort Amir Dezfouli
collection DOAJ
description Behavioral evidence suggests that instrumental conditioning is governed by two forms of action control: a goal-directed and a habit learning process. Model-based reinforcement learning (RL) has been argued to underlie the goal-directed process; however, the way in which it interacts with habits and the structure of the habitual process has remained unclear. According to a flat architecture, the habitual process corresponds to model-free RL, and its interaction with the goal-directed process is coordinated by an external arbitration mechanism. Alternatively, the interaction between these systems has recently been argued to be hierarchical, such that the formation of action sequences underlies habit learning and a goal-directed process selects between goal-directed actions and habitual sequences of actions to reach the goal. Here we used a two-stage decision-making task to test predictions from these accounts. The hierarchical account predicts that, because they are tied to each other as an action sequence, selecting a habitual action in the first stage will be followed by a habitual action in the second stage, whereas the flat account predicts that the statuses of the first and second stage actions are independent of each other. We found, based on subjects' choices and reaction times, that human subjects combined single actions to build action sequences and that the formation of such action sequences was sufficient to explain habitual actions. Furthermore, based on Bayesian model comparison, a family of hierarchical RL models, assuming a hierarchical interaction between habit and goal-directed processes, provided a better fit of the subjects' behavior than a family of flat models. Although these findings do not rule out all possible model-free accounts of instrumental conditioning, they do show such accounts are not necessary to explain habitual actions and provide a new basis for understanding how goal-directed and habitual action control interact.
first_indexed 2024-12-14T11:11:11Z
format Article
id doaj.art-85567bd5c3f14f7fba10b9faeb338a81
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-12-14T11:11:11Z
publishDate 2013-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-85567bd5c3f14f7fba10b9faeb338a812022-12-21T23:04:16ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582013-01-01912e100336410.1371/journal.pcbi.1003364Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized.Amir DezfouliBernard W BalleineBehavioral evidence suggests that instrumental conditioning is governed by two forms of action control: a goal-directed and a habit learning process. Model-based reinforcement learning (RL) has been argued to underlie the goal-directed process; however, the way in which it interacts with habits and the structure of the habitual process has remained unclear. According to a flat architecture, the habitual process corresponds to model-free RL, and its interaction with the goal-directed process is coordinated by an external arbitration mechanism. Alternatively, the interaction between these systems has recently been argued to be hierarchical, such that the formation of action sequences underlies habit learning and a goal-directed process selects between goal-directed actions and habitual sequences of actions to reach the goal. Here we used a two-stage decision-making task to test predictions from these accounts. The hierarchical account predicts that, because they are tied to each other as an action sequence, selecting a habitual action in the first stage will be followed by a habitual action in the second stage, whereas the flat account predicts that the statuses of the first and second stage actions are independent of each other. We found, based on subjects' choices and reaction times, that human subjects combined single actions to build action sequences and that the formation of such action sequences was sufficient to explain habitual actions. Furthermore, based on Bayesian model comparison, a family of hierarchical RL models, assuming a hierarchical interaction between habit and goal-directed processes, provided a better fit of the subjects' behavior than a family of flat models. Although these findings do not rule out all possible model-free accounts of instrumental conditioning, they do show such accounts are not necessary to explain habitual actions and provide a new basis for understanding how goal-directed and habitual action control interact.http://europepmc.org/articles/PMC3854489?pdf=render
spellingShingle Amir Dezfouli
Bernard W Balleine
Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized.
PLoS Computational Biology
title Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized.
title_full Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized.
title_fullStr Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized.
title_full_unstemmed Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized.
title_short Actions, action sequences and habits: evidence that goal-directed and habitual action control are hierarchically organized.
title_sort actions action sequences and habits evidence that goal directed and habitual action control are hierarchically organized
url http://europepmc.org/articles/PMC3854489?pdf=render
work_keys_str_mv AT amirdezfouli actionsactionsequencesandhabitsevidencethatgoaldirectedandhabitualactioncontrolarehierarchicallyorganized
AT bernardwballeine actionsactionsequencesandhabitsevidencethatgoaldirectedandhabitualactioncontrolarehierarchicallyorganized