Bottom-up learning of hierarchical models in a class of deterministic POMDP environments

The theory of partially observable Markov decision processes (POMDPs) is a useful tool for developing various intelligent agents, and learning hierarchical POMDP models is one of the key approaches for building such agents when the environments of the agents are unknown and large. To learn hierarchi...

Full description

Bibliographic Details
Main Authors: Itoh Hideaki, Fukumoto Hisao, Wakuya Hiroshi, Furukawa Tatsuya
Format: Article
Language:English
Published: Sciendo 2015-09-01
Series:International Journal of Applied Mathematics and Computer Science
Subjects:
Online Access:https://doi.org/10.1515/amcs-2015-0044