Robust Markov decision processes: a place where AI and formal methods meet

Markov decision processes (MDPs) are a standard model for sequential decision-making problems and are widely used across many scientific areas, including formal methods and artificial intelligence (AI). MDPs do, however, come with the restrictive assumption that the transition probabilities need to...

Full description

Bibliographic Details
Main Authors:	Suilen, M, Badings, T, Bovy, EM, Parker, D, Jansen, N
Format:	Book section
Language:	English
Published:	Springer 2024

_version_	1824458679590584320
author	Suilen, M Badings, T Bovy, EM Parker, D Jansen, N
author_facet	Suilen, M Badings, T Bovy, EM Parker, D Jansen, N
author_sort	Suilen, M
collection	OXFORD
description	Markov decision processes (MDPs) are a standard model for sequential decision-making problems and are widely used across many scientific areas, including formal methods and artificial intelligence (AI). MDPs do, however, come with the restrictive assumption that the transition probabilities need to be precisely known. Robust MDPs (RMDPs) overcome this assumption by instead defining the transition probabilities to belong to some uncertainty set. We present a gentle survey on RMDPs, providing a tutorial covering their fundamentals. In particular, we discuss RMDP semantics and how to solve them by extending standard MDP methods such as value iteration and policy iteration. We also discuss how RMDPs relate to other models and how they are used in several contexts, including reinforcement learning and abstraction techniques. We conclude with some challenges for future work on RMDPs.
first_indexed	2025-02-19T04:29:44Z
format	Book section
id	oxford-uuid:4be22ef5-3a3f-4344-b370-a1336a0a1dc3
institution	University of Oxford
language	English
last_indexed	2025-02-19T04:29:44Z
publishDate	2024
publisher	Springer
record_format	dspace
spelling	oxford-uuid:4be22ef5-3a3f-4344-b370-a1336a0a1dc32024-12-16T12:36:49ZRobust Markov decision processes: a place where AI and formal methods meetBook sectionhttp://purl.org/coar/resource_type/c_3248uuid:4be22ef5-3a3f-4344-b370-a1336a0a1dc3EnglishSymplectic ElementsSpringer2024Suilen, MBadings, TBovy, EMParker, DJansen, NMarkov decision processes (MDPs) are a standard model for sequential decision-making problems and are widely used across many scientific areas, including formal methods and artificial intelligence (AI). MDPs do, however, come with the restrictive assumption that the transition probabilities need to be precisely known. Robust MDPs (RMDPs) overcome this assumption by instead defining the transition probabilities to belong to some uncertainty set. We present a gentle survey on RMDPs, providing a tutorial covering their fundamentals. In particular, we discuss RMDP semantics and how to solve them by extending standard MDP methods such as value iteration and policy iteration. We also discuss how RMDPs relate to other models and how they are used in several contexts, including reinforcement learning and abstraction techniques. We conclude with some challenges for future work on RMDPs.
spellingShingle	Suilen, M Badings, T Bovy, EM Parker, D Jansen, N Robust Markov decision processes: a place where AI and formal methods meet
title	Robust Markov decision processes: a place where AI and formal methods meet
title_full	Robust Markov decision processes: a place where AI and formal methods meet
title_fullStr	Robust Markov decision processes: a place where AI and formal methods meet
title_full_unstemmed	Robust Markov decision processes: a place where AI and formal methods meet
title_short	Robust Markov decision processes: a place where AI and formal methods meet
title_sort	robust markov decision processes a place where ai and formal methods meet
work_keys_str_mv	AT suilenm robustmarkovdecisionprocessesaplacewhereaiandformalmethodsmeet AT badingst robustmarkovdecisionprocessesaplacewhereaiandformalmethodsmeet AT bovyem robustmarkovdecisionprocessesaplacewhereaiandformalmethodsmeet AT parkerd robustmarkovdecisionprocessesaplacewhereaiandformalmethodsmeet AT jansenn robustmarkovdecisionprocessesaplacewhereaiandformalmethodsmeet

Robust Markov decision processes: a place where AI and formal methods meet

Similar Items