Trace refinement in labelled Markov decision processes

Given two labelled Markov decision processes (MDPs), the trace-refinement problem asks whether for all strategies of the first MDP there exists a strategy of the second MDP such that the induced labelled Markov chains are trace-equivalent. We show that this problem is decidable in polynomial time if...

وصف كامل

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	Fijalkow, N, Kiefer, S, Shirmohammadi, M
التنسيق:	Journal article
اللغة:	English
منشور في:	Logical Methods in Computer Science 2020

_version_	1826283611454701568
author	Fijalkow, N Kiefer, S Shirmohammadi, M
author_facet	Fijalkow, N Kiefer, S Shirmohammadi, M
author_sort	Fijalkow, N
collection	OXFORD
description	Given two labelled Markov decision processes (MDPs), the trace-refinement problem asks whether for all strategies of the first MDP there exists a strategy of the second MDP such that the induced labelled Markov chains are trace-equivalent. We show that this problem is decidable in polynomial time if the second MDP is a Markov chain. The algorithm is based on new results on a particular notion of bisimulation between distributions over the states. However, we show that the general trace-refinement problem is undecidable, even if the first MDP is a Markov chain. Decidability of those problems was stated as open in 2008. We further study the decidability and complexity of the trace-refinement problem provided that the strategies are restricted to be memoryless.
first_indexed	2024-03-07T01:01:28Z
format	Journal article
id	oxford-uuid:89e1915f-bd6e-4281-971b-6133f078a040
institution	University of Oxford
language	English
last_indexed	2024-03-07T01:01:28Z
publishDate	2020
publisher	Logical Methods in Computer Science
record_format	dspace
spelling	oxford-uuid:89e1915f-bd6e-4281-971b-6133f078a0402022-03-26T22:27:33ZTrace refinement in labelled Markov decision processesJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:89e1915f-bd6e-4281-971b-6133f078a040EnglishSymplectic ElementsLogical Methods in Computer Science2020Fijalkow, NKiefer, SShirmohammadi, MGiven two labelled Markov decision processes (MDPs), the trace-refinement problem asks whether for all strategies of the first MDP there exists a strategy of the second MDP such that the induced labelled Markov chains are trace-equivalent. We show that this problem is decidable in polynomial time if the second MDP is a Markov chain. The algorithm is based on new results on a particular notion of bisimulation between distributions over the states. However, we show that the general trace-refinement problem is undecidable, even if the first MDP is a Markov chain. Decidability of those problems was stated as open in 2008. We further study the decidability and complexity of the trace-refinement problem provided that the strategies are restricted to be memoryless.
spellingShingle	Fijalkow, N Kiefer, S Shirmohammadi, M Trace refinement in labelled Markov decision processes
title	Trace refinement in labelled Markov decision processes
title_full	Trace refinement in labelled Markov decision processes
title_fullStr	Trace refinement in labelled Markov decision processes
title_full_unstemmed	Trace refinement in labelled Markov decision processes
title_short	Trace refinement in labelled Markov decision processes
title_sort	trace refinement in labelled markov decision processes
work_keys_str_mv	AT fijalkown tracerefinementinlabelledmarkovdecisionprocesses AT kiefers tracerefinementinlabelledmarkovdecisionprocesses AT shirmohammadim tracerefinementinlabelledmarkovdecisionprocesses

Trace refinement in labelled Markov decision processes

مواد مشابهة