Pure-past action masking
We present Pure-Past Action Masking (PPAM), a lightweight approach to action masking for safe reinforcement learning. In PPAM, actions are disallowed (“masked”) according to specifications expressed in Pure-Past Linear Temporal Logic (PPLTL). PPAM can enforce non-Markovian constraints, i.e., constra...
Main Authors: | , , , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
AAAI Press
2024
|