Pure-past action masking

We present Pure-Past Action Masking (PPAM), a lightweight approach to action masking for safe reinforcement learning. In PPAM, actions are disallowed (“masked”) according to specifications expressed in Pure-Past Linear Temporal Logic (PPLTL). PPAM can enforce non-Markovian constraints, i.e., constra...

Full description

Bibliographic Details
Main Authors:	Varricchione, G, Alechina, N, Dastani, M, De Giacomo, G, Logan, B, Perelli, G
Format:	Conference item
Language:	English
Published:	AAAI Press 2024

Pure-past action masking

Similar Items