Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning

Bibliographic Details
Main Authors: Farquhar, G, Whiteson, S, Foerster, J
Format: Conference item
Published: Neural Information Processing Systems Foundation 2019