Robust reinforcement learning with Bayesian optimisation and quadrature

Bayesian optimisation has been successfully applied to a variety of reinforcement learning problems. However, the traditional approach for learning optimal policies in simulators does not utilise the opportunity to improve learning by adjusting certain environment variables: state features that are...

Бүрэн тодорхойлолт

Номзүйн дэлгэрэнгүй
Үндсэн зохиолчид:	Paul, S, Chatzilygeroudis, K, Ciosek, K, Mouret, J-B, Osborne, MA, Whiteson, S
Формат:	Journal article
Хэл сонгох:	English
Хэвлэсэн:	Journal of Machine Learning Research 2020

Robust reinforcement learning with Bayesian optimisation and quadrature

Ижил төстэй зүйлс