Büchi objectives in countable MDPs

<p>We study countably infinite Markov decision processes with Büchi objectives, which ask to visit a given subset F of states infinitely often. A question left open by T.P. Hill in 1979 [10] is whether there always exist ε-optimal Markov strategies, i.e., strategies that base decisions only on...

詳細記述

書誌詳細
主要な著者:	Kiefer, S, Mayr, R, Shirmohammadi, M, Totzke, P
フォーマット:	Conference item
出版事項:	Schloss Dagstuhl 2019

その他の書誌記述
要約:	<p>We study countably infinite Markov decision processes with Büchi objectives, which ask to visit a given subset F of states infinitely often. A question left open by T.P. Hill in 1979 [10] is whether there always exist ε-optimal Markov strategies, i.e., strategies that base decisions only on the current state and the number of steps taken so far. We provide a negative answer to this question by constructing a non-trivial counterexample. On the other hand, we show that Markov strategies with only 1 bit of extra memory are sufficient.</p>

Büchi objectives in countable MDPs

類似資料