Safe POMDP online planning via shielding

Partially observable Markov decision processes (POMDPs) have been widely used in many robotic applications for sequential decision-making under uncertainty. POMDP online planning algorithms such as Partially Observable Monte-Carlo Planning (POMCP) can solve very large POMDPs with the goal of maximiz...

Deskribapen osoa

Xehetasun bibliografikoak
Egile Nagusiak: Sheng, S, Parker, D, Feng, L
Formatua: Conference item
Hizkuntza:English
Argitaratua: IEEE 2024