Bayesian Policy Search with Policy Priors

We consider the problem of learning to act in partially observable, continuous-state-and-action worlds where we have abstract prior knowledge about the structure of the optimal policy in the form of a distribution over policies. Using ideas from planning-as-inference reductions and Bayesian unsuperv...

Full description

Bibliographic Details
Main Authors:	Wingate, David, Goodman, Noah D., Roy, Daniel M., Kaelbling, Leslie P., Tenenbaum, Joshua B.
Other Authors:	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format:	Article
Language:	en_US
Published:	International Joint Conference on Artificial Intelligence (IJCAI) 2014
Online Access:	http://hdl.handle.net/1721.1/87054 https://orcid.org/0000-0002-1925-2035 https://orcid.org/0000-0001-6054-7145

Internet

http://hdl.handle.net/1721.1/87054
https://orcid.org/0000-0002-1925-2035
https://orcid.org/0000-0001-6054-7145

Bayesian Policy Search with Policy Priors

Internet

Similar Items