Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games

Neuro-symbolic approaches to artificial intelligence, which combine neural networks with classical symbolic techniques, are growing in prominence, necessitating formal approaches to reason about their correctness. We propose a novel modelling formalism called neuro-symbolic concurrent stochastic gam...

全面介绍

书目详细资料
Main Authors:	Yan, R, Santos, G, Gethin, N, Parker, D, Kwiatkowska, M
格式:	Journal article
语言:	English
出版:	Elsevier 2024

_version_	1826314327679827968
author	Yan, R Santos, G Gethin, N Parker, D Kwiatkowska, M
author_facet	Yan, R Santos, G Gethin, N Parker, D Kwiatkowska, M
author_sort	Yan, R
collection	OXFORD
description	Neuro-symbolic approaches to artificial intelligence, which combine neural networks with classical symbolic techniques, are growing in prominence, necessitating formal approaches to reason about their correctness. We propose a novel modelling formalism called neuro-symbolic concurrent stochastic games (NS-CSGs), which comprise two probabilistic finite-state agents interacting in a shared continuous-state environment. Each agent observes the environment using a neural perception mechanism, which converts inputs such as images into symbolic percepts, and makes decisions symbolically. We focus on the class of NS-CSGs with Borel state spaces and prove the existence and measurability of the value function for zero-sum discounted cumulative rewards under piecewise-constant restrictions. To compute values and synthesise strategies, we first introduce a Borel measurable piecewise-constant (B-PWC) representation of value functions and propose a B-PWC value iteration. Second, we introduce two novel representations for the value functions and strategies, and propose a minimax-action-free policy iteration based on alternating player choices.
first_indexed	2024-09-25T04:30:47Z
format	Journal article
id	oxford-uuid:ec3098e5-572d-450d-809b-6efdd6f65be7
institution	University of Oxford
language	English
last_indexed	2024-09-25T04:30:47Z
publishDate	2024
publisher	Elsevier
record_format	dspace
spelling	oxford-uuid:ec3098e5-572d-450d-809b-6efdd6f65be72024-08-27T14:52:51ZStrategy synthesis for zero-sum neuro-symbolic concurrent stochastic gamesJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:ec3098e5-572d-450d-809b-6efdd6f65be7EnglishSymplectic ElementsElsevier2024Yan, RSantos, GGethin, NParker, DKwiatkowska, MNeuro-symbolic approaches to artificial intelligence, which combine neural networks with classical symbolic techniques, are growing in prominence, necessitating formal approaches to reason about their correctness. We propose a novel modelling formalism called neuro-symbolic concurrent stochastic games (NS-CSGs), which comprise two probabilistic finite-state agents interacting in a shared continuous-state environment. Each agent observes the environment using a neural perception mechanism, which converts inputs such as images into symbolic percepts, and makes decisions symbolically. We focus on the class of NS-CSGs with Borel state spaces and prove the existence and measurability of the value function for zero-sum discounted cumulative rewards under piecewise-constant restrictions. To compute values and synthesise strategies, we first introduce a Borel measurable piecewise-constant (B-PWC) representation of value functions and propose a B-PWC value iteration. Second, we introduce two novel representations for the value functions and strategies, and propose a minimax-action-free policy iteration based on alternating player choices.
spellingShingle	Yan, R Santos, G Gethin, N Parker, D Kwiatkowska, M Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games
title	Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games
title_full	Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games
title_fullStr	Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games
title_full_unstemmed	Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games
title_short	Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games
title_sort	strategy synthesis for zero sum neuro symbolic concurrent stochastic games
work_keys_str_mv	AT yanr strategysynthesisforzerosumneurosymbolicconcurrentstochasticgames AT santosg strategysynthesisforzerosumneurosymbolicconcurrentstochasticgames AT gethinn strategysynthesisforzerosumneurosymbolicconcurrentstochasticgames AT parkerd strategysynthesisforzerosumneurosymbolicconcurrentstochasticgames AT kwiatkowskam strategysynthesisforzerosumneurosymbolicconcurrentstochasticgames

Strategy synthesis for zero-sum neuro-symbolic concurrent stochastic games

相似书籍