MULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSES

We study and provide efficient algorithms for multi-objective model checking problems for Markov Decision Processes (MDPs). Given an MDP, M, and given multiple linear-time (ω-regular or LTL) properties φi, and probabilities ri ∈ [0,1], i=1,...,k, we ask whether there exists a strategy σ for the cont...

Full description

Bibliographic Details
Main Authors: Etessami, K, Kwiatkowska, M, Vardi, M, Yannakakis, M
Format: Journal article
Language:English
Published: 2008
_version_ 1826258727277166592
author Etessami, K
Kwiatkowska, M
Vardi, M
Yannakakis, M
author_facet Etessami, K
Kwiatkowska, M
Vardi, M
Yannakakis, M
author_sort Etessami, K
collection OXFORD
description We study and provide efficient algorithms for multi-objective model checking problems for Markov Decision Processes (MDPs). Given an MDP, M, and given multiple linear-time (ω-regular or LTL) properties φi, and probabilities ri ∈ [0,1], i=1,...,k, we ask whether there exists a strategy σ for the controller such that, for all i, the probability that a trajectory of M controlled by σ satisfies φi is at least ri. We provide an algorithm that decides whether there exists such a strategy and if so produces it, and which runs in time polynomial in the size of the MDP. Such a strategy may require the use of both randomization and memory. We also consider more general multi-objective ω-regular queries, which we motivate with an application to assume-guarantee compositional reasoning for probabilistic systems. Note that there can be trade-offs between different properties: satisfying property φ1 with high probability may necessitate satisfying φ2 with low probability. Viewing this as a multi-objective optimization problem, we want information about the "trade-off curve" or Pareto curve for maximizing the probabilities of different properties. We show that one can compute an approximate Pareto curve with respect to a set of ω-regular properties in time polynomial in the size of the MDP. Our quantitative upper bounds use LP methods. We also study qualitative multi-objective model checking problems, and we show that these can be analysed by purely graph-theoretic methods, even though the strategies may still require both randomization and memory. © K. Etessami, M. Kwiatkowska, M. Y. Vardi, and M. Yannakakis.
first_indexed 2024-03-06T18:38:32Z
format Journal article
id oxford-uuid:0c1f87a1-3b54-4267-89c0-2fb91afc46cf
institution University of Oxford
language English
last_indexed 2024-03-06T18:38:32Z
publishDate 2008
record_format dspace
spelling oxford-uuid:0c1f87a1-3b54-4267-89c0-2fb91afc46cf2022-03-26T09:33:09ZMULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSESJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:0c1f87a1-3b54-4267-89c0-2fb91afc46cfEnglishSymplectic Elements at Oxford2008Etessami, KKwiatkowska, MVardi, MYannakakis, MWe study and provide efficient algorithms for multi-objective model checking problems for Markov Decision Processes (MDPs). Given an MDP, M, and given multiple linear-time (ω-regular or LTL) properties φi, and probabilities ri ∈ [0,1], i=1,...,k, we ask whether there exists a strategy σ for the controller such that, for all i, the probability that a trajectory of M controlled by σ satisfies φi is at least ri. We provide an algorithm that decides whether there exists such a strategy and if so produces it, and which runs in time polynomial in the size of the MDP. Such a strategy may require the use of both randomization and memory. We also consider more general multi-objective ω-regular queries, which we motivate with an application to assume-guarantee compositional reasoning for probabilistic systems. Note that there can be trade-offs between different properties: satisfying property φ1 with high probability may necessitate satisfying φ2 with low probability. Viewing this as a multi-objective optimization problem, we want information about the "trade-off curve" or Pareto curve for maximizing the probabilities of different properties. We show that one can compute an approximate Pareto curve with respect to a set of ω-regular properties in time polynomial in the size of the MDP. Our quantitative upper bounds use LP methods. We also study qualitative multi-objective model checking problems, and we show that these can be analysed by purely graph-theoretic methods, even though the strategies may still require both randomization and memory. © K. Etessami, M. Kwiatkowska, M. Y. Vardi, and M. Yannakakis.
spellingShingle Etessami, K
Kwiatkowska, M
Vardi, M
Yannakakis, M
MULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSES
title MULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSES
title_full MULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSES
title_fullStr MULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSES
title_full_unstemmed MULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSES
title_short MULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSES
title_sort multi objective model checking of markov decision processes
work_keys_str_mv AT etessamik multiobjectivemodelcheckingofmarkovdecisionprocesses
AT kwiatkowskam multiobjectivemodelcheckingofmarkovdecisionprocesses
AT vardim multiobjectivemodelcheckingofmarkovdecisionprocesses
AT yannakakism multiobjectivemodelcheckingofmarkovdecisionprocesses