Robust reinforcement learning with Bayesian optimisation and quadrature
Bayesian optimisation has been successfully applied to a variety of reinforcement learning problems. However, the traditional approach for learning optimal policies in simulators does not utilise the opportunity to improve learning by adjusting certain environment variables: state features that are...
主要な著者: | , , , , , |
---|---|
フォーマット: | Journal article |
言語: | English |
出版事項: |
Journal of Machine Learning Research
2020
|