Bayesian Bellman operators
We introduce a novel perspective on Bayesian reinforcement learning (RL); whereas existing approaches infer a posterior over the transition distribution or Q-function, we characterise the uncertainty in the Bellman operator. Our Bayesian Bellman operator (BBO) framework is motivated by the insight t...
Main Authors: | , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
NeurIPS
2022
|