On the complexity of value iteration

<p>Value iteration is a fundamental algorithm for solving Markov Decision Processes (MDPs). It computes the maximal <em>n</em>-step payoff by iterating <em>n</em> times a recurrence equation which is naturally associated to the MDP. At the same time, value iteration pro...

Full description

Bibliographic Details
Main Authors: Balaji, N, Kiefer, S, Novotny, P, Perez, G, Shirmohammadi, M
Format: Conference item
Published: Schloss Dagstuhl 2019