Learning‐based control for discrete‐time constrained nonzero‐sum games

Abstract A generalized policy‐iteration‐based solution to a class of discrete‐time multi‐player non‐zero‐sum games concerning the control constraints was proposed. Based on initial admissible control policies, the iterative value function of each player converges to the optimum approximately, which...

Full description

Bibliographic Details
Main Authors: Chaoxu Mu, Jiangwen Peng, Yufei Tang
Format: Article
Language:English
Published: Wiley 2021-06-01
Series:CAAI Transactions on Intelligence Technology
Subjects:
Online Access:https://doi.org/10.1049/cit2.12015