On the complexity of value iteration
<p>Value iteration is a fundamental algorithm for solving Markov Decision Processes (MDPs). It computes the maximal <em>n</em>-step payoff by iterating <em>n</em> times a recurrence equation which is naturally associated to the MDP. At the same time, value iteration pro...
Հիմնական հեղինակներ: | , , , , |
---|---|
Ձևաչափ: | Conference item |
Հրապարակվել է: |
Schloss Dagstuhl
2019
|