A policy iteration algorithm for nonzero-sum stochastic impulse games

This work presents a novel policy iteration algorithm to tackle nonzero-sum stochastic impulse games arising naturally in many applications. Despite the obvious impact of solving such problems, there are no suitable numerical methods available, to the best of our knowledge. Our method relies on the...

Full description

Bibliographic Details
Main Authors: Aïd René, Bernal Francisco, Mnif Mohamed, Zabaljauregui Diego, Zubelli Jorge P.
Format: Article
Language:English
Published: EDP Sciences 2019-01-01
Series:ESAIM: Proceedings and Surveys
Subjects:
Online Access:https://www.esaim-proc.org/articles/proc/pdf/2019/01/proc196502.pdf
_version_ 1828081211189231616
author Aïd René
Bernal Francisco
Mnif Mohamed
Zabaljauregui Diego
Zubelli Jorge P.
author_facet Aïd René
Bernal Francisco
Mnif Mohamed
Zabaljauregui Diego
Zubelli Jorge P.
author_sort Aïd René
collection DOAJ
description This work presents a novel policy iteration algorithm to tackle nonzero-sum stochastic impulse games arising naturally in many applications. Despite the obvious impact of solving such problems, there are no suitable numerical methods available, to the best of our knowledge. Our method relies on the recently introduced characterisation of the value functions and Nash equilibrium via a system of quasi-variational inequalities. While our algorithm is heuristic and we do not provide a convergence analysis, numerical tests show that it performs convincingly in a wide range of situations, including the only analytically solvable example available in the literature at the time of writing.
first_indexed 2024-04-11T03:30:41Z
format Article
id doaj.art-e9fa093cff344fea9e50d563b80fe53d
institution Directory Open Access Journal
issn 2267-3059
language English
last_indexed 2024-04-11T03:30:41Z
publishDate 2019-01-01
publisher EDP Sciences
record_format Article
series ESAIM: Proceedings and Surveys
spelling doaj.art-e9fa093cff344fea9e50d563b80fe53d2023-01-02T06:33:57ZengEDP SciencesESAIM: Proceedings and Surveys2267-30592019-01-0165274510.1051/proc/201965027proc196502A policy iteration algorithm for nonzero-sum stochastic impulse gamesAïd RenéBernal FranciscoMnif MohamedZabaljauregui DiegoZubelli Jorge P.This work presents a novel policy iteration algorithm to tackle nonzero-sum stochastic impulse games arising naturally in many applications. Despite the obvious impact of solving such problems, there are no suitable numerical methods available, to the best of our knowledge. Our method relies on the recently introduced characterisation of the value functions and Nash equilibrium via a system of quasi-variational inequalities. While our algorithm is heuristic and we do not provide a convergence analysis, numerical tests show that it performs convincingly in a wide range of situations, including the only analytically solvable example available in the literature at the time of writing.https://www.esaim-proc.org/articles/proc/pdf/2019/01/proc196502.pdfstochastic impulse gamenonzero-sum gamenash equilibriumpolicy iterationhoward’s algorithmquasi-variational inequality
spellingShingle Aïd René
Bernal Francisco
Mnif Mohamed
Zabaljauregui Diego
Zubelli Jorge P.
A policy iteration algorithm for nonzero-sum stochastic impulse games
ESAIM: Proceedings and Surveys
stochastic impulse game
nonzero-sum game
nash equilibrium
policy iteration
howard’s algorithm
quasi-variational inequality
title A policy iteration algorithm for nonzero-sum stochastic impulse games
title_full A policy iteration algorithm for nonzero-sum stochastic impulse games
title_fullStr A policy iteration algorithm for nonzero-sum stochastic impulse games
title_full_unstemmed A policy iteration algorithm for nonzero-sum stochastic impulse games
title_short A policy iteration algorithm for nonzero-sum stochastic impulse games
title_sort policy iteration algorithm for nonzero sum stochastic impulse games
topic stochastic impulse game
nonzero-sum game
nash equilibrium
policy iteration
howard’s algorithm
quasi-variational inequality
url https://www.esaim-proc.org/articles/proc/pdf/2019/01/proc196502.pdf
work_keys_str_mv AT aidrene apolicyiterationalgorithmfornonzerosumstochasticimpulsegames
AT bernalfrancisco apolicyiterationalgorithmfornonzerosumstochasticimpulsegames
AT mnifmohamed apolicyiterationalgorithmfornonzerosumstochasticimpulsegames
AT zabaljaureguidiego apolicyiterationalgorithmfornonzerosumstochasticimpulsegames
AT zubellijorgep apolicyiterationalgorithmfornonzerosumstochasticimpulsegames
AT aidrene policyiterationalgorithmfornonzerosumstochasticimpulsegames
AT bernalfrancisco policyiterationalgorithmfornonzerosumstochasticimpulsegames
AT mnifmohamed policyiterationalgorithmfornonzerosumstochasticimpulsegames
AT zabaljaureguidiego policyiterationalgorithmfornonzerosumstochasticimpulsegames
AT zubellijorgep policyiterationalgorithmfornonzerosumstochasticimpulsegames