HSVI-based online minimax strategies for partially observable stochastic games with neural perception mechanisms

We consider a variant of continuous-state partially-observable stochastic games with neural perception mechanisms and an asymmetric information structure. One agent has partial information, with the observation function implemented as a neural network, while the other agent is assumed to have full k...

Cur síos iomlán

Sonraí bibleagrafaíochta
Príomhchruthaitheoirí: Yan, R, Santos, G, Norman, G, Parker, D, Kwiatkowska, M
Formáid: Conference item
Teanga:English
Foilsithe / Cruthaithe: Journal of Machine Learning Research 2024