Regular decision processes

We introduce and study Regular Decision Processes (RDPs), a new, compact model for domains with non-Markovian dynamics and rewards, in which the dependence on the past is regular, in the language theoretic sense. RDPs are an intermediate model between MDPs and POMDPs. They generalize k-order MDPs an...

詳細記述

書誌詳細
主要な著者: Brafman, RI, De Giacomo, G
フォーマット: Journal article
言語:English
出版事項: Elsevier 2024