Specification revision for Markov decision processes with optimal trade-off

Optimal control policy synthesis for probabilistic systems from high-level specifications is increasingly often studied. One major question that is commonly faced, however, is what to do when the optimal probability of achieving the specification is not satisfactory? We address this question by view...

Full description

Bibliographic Details
Main Authors: Lahijanian, M, Kwiatkowska, M
Format: Conference item
Published: Institute of Electrical and Electronics Engineers 2016
_version_ 1797079770488373248
author Lahijanian, M
Kwiatkowska, M
author_facet Lahijanian, M
Kwiatkowska, M
author_sort Lahijanian, M
collection OXFORD
description Optimal control policy synthesis for probabilistic systems from high-level specifications is increasingly often studied. One major question that is commonly faced, however, is what to do when the optimal probability of achieving the specification is not satisfactory? We address this question by viewing the specification as a soft constraint and present a synthesis framework for MDPs that encodes and automates specification revision in a trade-off for higher probability. The method uses co-safe LTL as the specification language and quantifies the revisions to the specification according to userdefined proposition costs. The framework computes a control policy that optimizes the trade-off between the probability of satisfaction and the cost of specification revision. The key idea of the method is a rule for the composition of the MDP, the automaton representing the specification, and the proposition costs such that all possible specification revisions along with their costs and probabilities of satisfaction are captured in one structure. The problem is then reduced to multi-objective optimization on an MDP. The power of the method is illustrated though simulations of a complex robotic scenario.
first_indexed 2024-03-07T00:50:31Z
format Conference item
id oxford-uuid:863e4a92-6a07-40bc-a7d8-dc3331718bf3
institution University of Oxford
last_indexed 2024-03-07T00:50:31Z
publishDate 2016
publisher Institute of Electrical and Electronics Engineers
record_format dspace
spelling oxford-uuid:863e4a92-6a07-40bc-a7d8-dc3331718bf32022-03-26T22:02:38ZSpecification revision for Markov decision processes with optimal trade-offConference itemhttp://purl.org/coar/resource_type/c_5794uuid:863e4a92-6a07-40bc-a7d8-dc3331718bf3Symplectic Elements at OxfordInstitute of Electrical and Electronics Engineers2016Lahijanian, MKwiatkowska, MOptimal control policy synthesis for probabilistic systems from high-level specifications is increasingly often studied. One major question that is commonly faced, however, is what to do when the optimal probability of achieving the specification is not satisfactory? We address this question by viewing the specification as a soft constraint and present a synthesis framework for MDPs that encodes and automates specification revision in a trade-off for higher probability. The method uses co-safe LTL as the specification language and quantifies the revisions to the specification according to userdefined proposition costs. The framework computes a control policy that optimizes the trade-off between the probability of satisfaction and the cost of specification revision. The key idea of the method is a rule for the composition of the MDP, the automaton representing the specification, and the proposition costs such that all possible specification revisions along with their costs and probabilities of satisfaction are captured in one structure. The problem is then reduced to multi-objective optimization on an MDP. The power of the method is illustrated though simulations of a complex robotic scenario.
spellingShingle Lahijanian, M
Kwiatkowska, M
Specification revision for Markov decision processes with optimal trade-off
title Specification revision for Markov decision processes with optimal trade-off
title_full Specification revision for Markov decision processes with optimal trade-off
title_fullStr Specification revision for Markov decision processes with optimal trade-off
title_full_unstemmed Specification revision for Markov decision processes with optimal trade-off
title_short Specification revision for Markov decision processes with optimal trade-off
title_sort specification revision for markov decision processes with optimal trade off
work_keys_str_mv AT lahijanianm specificationrevisionformarkovdecisionprocesseswithoptimaltradeoff
AT kwiatkowskam specificationrevisionformarkovdecisionprocesseswithoptimaltradeoff