OFFER: Off-environment reinforcement learning
Policy gradient methods have been widely applied in reinforcement learning. For reasons of safety and cost, learning is often conducted using a simulator. However, learning in simulation does not traditionally utilise the opportunity to improve learning by adjusting certain environment variables - s...
Những tác giả chính: | , |
---|---|
Định dạng: | Conference item |
Ngôn ngữ: | English |
Được phát hành: |
AAAI Press
2017
|