Compositional Policy Priors
This paper describes a probabilistic framework for incorporating structured inductive biases into reinforcement learning. These inductive biases arise from policy priors, probability distributions over optimal policies. Borrowing recent ideas from computational linguistics and Bayesian nonparametric...
Main Authors: | , , , , |
---|---|
Other Authors: | |
Published: |
2013
|
Online Access: | http://hdl.handle.net/1721.1/78573 |