Transience in countable MDPs

The Transience objective is not to visit any state infinitely often. While this is not possible in any finite Markov Decision Process (MDP), it can be satisfied in countably infinite ones, e.g., if the transition graph is acyclic. We prove the following fundamental properties of Transience in counta...

Ful tanımlama

Detaylı Bibliyografya
Asıl Yazarlar: Kiefer, SM, Mayr, R, Shirmohammadi, M, Totzke, P
Materyal Türü: Conference item
Dil:English
Baskı/Yayın Bilgisi: Schloss Dagstuhl 2021