On the complexity of value iteration
<p>Value iteration is a fundamental algorithm for solving Markov Decision Processes (MDPs). It computes the maximal <em>n</em>-step payoff by iterating <em>n</em> times a recurrence equation which is naturally associated to the MDP. At the same time, value iteration pro...
Main Authors: | , , , , |
---|---|
Format: | Conference item |
Published: |
Schloss Dagstuhl
2019
|