Policy gradient methods find the Nash equilibrium in N-player general-sum linear-quadratic games

We consider a general-sum N-player linear-quadratic game with stochastic dynamics over a finite horizon and prove the global convergence of the natural policy gradient method to the Nash equilibrium. In order to prove convergence of the method we require a certain amount of noise in the system. We g...

Fuld beskrivelse

Bibliografiske detaljer
Main Authors:	Hambly, B, Xu, R, Yang, H
Format:	Journal article
Sprog:	English
Udgivet:	Journal of Machine Learning Research 2023

Policy gradient methods find the Nash equilibrium in N-player general-sum linear-quadratic games

Lignende værker