Sample efficient model-free reinforcement learning from LTL specifications with optimality guarantees

Linear Temporal Logic (LTL) is widely used to specify high-level objectives for system policies, and it is highly desirable for autonomous systems to learn the optimal policy with respect to such specifications. However, learning the optimal policy from LTL specifications is not trivial. We present...

Full description

Bibliographic Details
Main Authors: Shao, D, Kwiatkowska, M
Format: Conference item
Language:English
Published: IJCAI 2023