Learning dynamics and generalization in reinforcement learning

Solving a reinforcement learning (RL) problem poses two competing challenges: fitting a potentially discontinuous value function, and generalizing well to new observations. In this paper, we analyze the learning dynamics of temporal difference algorithms to gain novel insight into the tension betwee...

Бүрэн тодорхойлолт

Номзүйн дэлгэрэнгүй
Үндсэн зохиолчид: Lyle, C, Rowland, M, Dabney, W, Kwiatkowska, M, Gal, Y
Формат: Conference item
Хэл сонгох:English
Хэвлэсэн: Journal of Machine Learning Research 2022