Deep variational reinforcement learning for POMDPs

Many real-world sequential decision making problems are partially observable by nature, and the environment model is typically unknown. Consequently, there is great need for reinforcement learning methods that can tackle such problems given only a stream of incomplete and noisy observations. In this...

Бүрэн тодорхойлолт

Номзүйн дэлгэрэнгүй
Үндсэн зохиолчид: Igl, M, Zintgraf, L, Le, T, Wood, F, Whiteson, S
Формат: Conference item
Хэвлэсэн: Journal of Machine Learning Research 2018

Ижил төстэй зүйлс