OFFER: Off-environment reinforcement learning

Policy gradient methods have been widely applied in reinforcement learning. For reasons of safety and cost, learning is often conducted using a simulator. However, learning in simulation does not traditionally utilise the opportunity to improve learning by adjusting certain environment variables - s...

সম্পূর্ণ বিবরণ

গ্রন্থ-পঞ্জীর বিবরন
প্রধান লেখক: Ciosek, K, Whiteson, S
বিন্যাস: Conference item
ভাষা:English
প্রকাশিত: AAAI Press 2017

অনুরূপ উপাদানগুলি