Bayesian Policy Search with Policy Priors
We consider the problem of learning to act in partially observable, continuous-state-and-action worlds where we have abstract prior knowledge about the structure of the optimal policy in the form of a distribution over policies. Using ideas from planning-as-inference reductions and Bayesian unsuperv...
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | en_US |
Published: |
International Joint Conference on Artificial Intelligence (IJCAI)
2014
|
Online Access: | http://hdl.handle.net/1721.1/87054 https://orcid.org/0000-0002-1925-2035 https://orcid.org/0000-0001-6054-7145 |