Planning for risk-aversion and expected value in MDPs

Planning in Markov decision processes (MDPs) typically optimises the expected cost. However, optimising the expectation does not consider the risk that for any given run of the MDP, the total cost received may be unacceptably high. An alternative approach is to find a policy which optimises a risk-a...

Full description

Bibliographic Details
Main Authors: Rigter, M, Duckworth, P, Lacerda, B, Hawes, N
Format: Conference item
Language:English
Published: Association for the Advancement of Artificial Intelligence 2022