Clock specifications for temporal tasks in planning and learning

Recently, Linear Temporal Logics on finite traces, such as LTL<sub>𝑓</sub> (or LDL<sub>𝑓</sub> ), have been advocated as high-level formalisms to express dynamic properties, such as goals in planning domains or rewards in Reinforcement Learning (RL). This paper addresses the...

Volledige beschrijving

Bibliografische gegevens
Hoofdauteurs: De Giacomo, G, Favorito, M, Patrizi, F
Formaat: Conference item
Taal:English
Gepubliceerd in: CEUR Workshop Proceedings 2024
Omschrijving
Samenvatting:Recently, Linear Temporal Logics on finite traces, such as LTL<sub>𝑓</sub> (or LDL<sub>𝑓</sub> ), have been advocated as high-level formalisms to express dynamic properties, such as goals in planning domains or rewards in Reinforcement Learning (RL). This paper addresses the challenge of separating high-level temporal specifications from the low-level details of the underlying environment (domain or MDP), by allowing for expressing the specifications at a different time granularity than the environment. We study the notion of a clock which progresses the high-level LTL<sub>𝑓</sub> specification, whose ticks are triggered by dynamic (low-level) properties defined on the underlying environment. The obtained separation enables terse high-level specifications while allowing for very expressive forms of clock expressed as general LTL<sub>𝑓</sub> properties over low-level features, such as counting or occurrence/alternation of special events. We devise an automata-based construction to compile away the clock into a deterministic automaton that is polynomial in the size of the automata characterizing the high-level and clock specifications. We show the correctness of the approach and discuss its application in several contexts, including FOND planning, RL with LTL<sub>𝑓</sub> Restraining Bolts, and Reward Machines.