Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning

מידע ביבליוגרפי
Main Authors: Farquhar, G, Whiteson, S, Foerster, J
פורמט: Conference item
יצא לאור: Neural Information Processing Systems Foundation 2019