On the complexity of value iteration

Value iteration is a fundamental algorithm for solving Markov Decision Processes (MDPs). It computes the maximal n-step payoff by iterating n times a recurrence equation which is naturally associated to the MDP. At the same time, value iteration pro...

Ամբողջական նկարագրություն

Մատենագիտական մանրամասներ
Հիմնական հեղինակներ:	Balaji, N, Kiefer, S, Novotny, P, Perez, G, Shirmohammadi, M
Ձևաչափ:	Conference item
Հրապարակվել է:	Schloss Dagstuhl 2019

On the complexity of value iteration

Նմանատիպ նյութեր