Transience in countable MDPs

The Transience objective is not to visit any state infinitely often. While this is not possible in any finite Markov Decision Process (MDP), it can be satisfied in countably infinite ones, e.g., if the transition graph is acyclic. We prove the following fundamental properties of Transience in counta...

Disgrifiad llawn

Manylion Llyfryddiaeth
Prif Awduron: Kiefer, SM, Mayr, R, Shirmohammadi, M, Totzke, P
Fformat: Conference item
Iaith:English
Cyhoeddwyd: Schloss Dagstuhl 2021