Robust anytime learning of Markov decision processes
Markov decision processes (MDPs) are formal models commonly used in sequential decision-making. MDPs capture the stochasticity that may arise, for instance, from imprecise actuators via probabilities in the transition function. However, in data-driven applications, deriving precise probabilities fro...
Main Authors: | , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
Curran Associates
2023
|