Bayesian Bellman operators

We introduce a novel perspective on Bayesian reinforcement learning (RL); whereas existing approaches infer a posterior over the transition distribution or Q-function, we characterise the uncertainty in the Bellman operator. Our Bayesian Bellman operator (BBO) framework is motivated by the insight t...

Full description

Bibliographic Details
Main Authors: Fellows, M, Hartikainen, K, Whiteson, S
Format: Conference item
Language:English
Published: NeurIPS 2022