Bayesian Bellman operators

We introduce a novel perspective on Bayesian reinforcement learning (RL); whereas existing approaches infer a posterior over the transition distribution or Q-function, we characterise the uncertainty in the Bellman operator. Our Bayesian Bellman operator (BBO) framework is motivated by the insight t...

Full description

Bibliographic Details
Main Authors:	Fellows, M, Hartikainen, K, Whiteson, S
Format:	Conference item
Language:	English
Published:	NeurIPS 2022

Bayesian Bellman operators

Similar Items