Sample efficient model-free reinforcement learning from LTL specifications with optimality guarantees
Linear Temporal Logic (LTL) is widely used to specify high-level objectives for system policies, and it is highly desirable for autonomous systems to learn the optimal policy with respect to such specifications. However, learning the optimal policy from LTL specifications is not trivial. We present...
Main Authors: | , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
IJCAI
2023
|