Exploration in approximate hyper-state space for meta reinforcement learning
To rapidly learn a new task, it is often essential for agents to explore efficiently - especially when performance matters from the first timestep. One way to learn such behaviour is via meta-learning. Many existing methods however rely on dense rewards for meta-training, and can fail catastrophical...
Main Authors: | , , , , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
PMLR
2021
|