HSVI-based online minimax strategies for partially observable stochastic games with neural perception mechanisms

We consider a variant of continuous-state partially-observable stochastic games with neural perception mechanisms and an asymmetric information structure. One agent has partial information, with the observation function implemented as a neural network, while the other agent is assumed to have full k...

Full description

Bibliographic Details
Main Authors: Yan, R, Santos, G, Norman, G, Parker, D, Kwiatkowska, M
Format: Conference item
Language:English
Published: Journal of Machine Learning Research 2024