Pure-past action masking

We present Pure-Past Action Masking (PPAM), a lightweight approach to action masking for safe reinforcement learning. In PPAM, actions are disallowed (“masked”) according to specifications expressed in Pure-Past Linear Temporal Logic (PPLTL). PPAM can enforce non-Markovian constraints, i.e., constra...

Full description

Bibliographic Details
Main Authors: Varricchione, G, Alechina, N, Dastani, M, De Giacomo, G, Logan, B, Perelli, G
Format: Conference item
Language:English
Published: AAAI Press 2024

Similar Items