Robust anytime learning of Markov decision processes

Markov decision processes (MDPs) are formal models commonly used in sequential decision-making. MDPs capture the stochasticity that may arise, for instance, from imprecise actuators via probabilities in the transition function. However, in data-driven applications, deriving precise probabilities fro...

Full description

Bibliographic Details
Main Authors: Suilen, M, Simão, TD, Jansen, N, Parker, D
Format: Conference item
Language:English
Published: Curran Associates 2023