Safe POMDP online planning via shielding

Partially observable Markov decision processes (POMDPs) have been widely used in many robotic applications for sequential decision-making under uncertainty. POMDP online planning algorithms such as Partially Observable Monte-Carlo Planning (POMCP) can solve very large POMDPs with the goal of maximiz...

Full description

Bibliographic Details
Main Authors: Sheng, S, Parker, D, Feng, L
Format: Conference item
Language:English
Published: IEEE 2024