Planning with hidden parameter polynomial MDPs

For many applications of Markov Decision Processes (MDPs), the transition function cannot be specified exactly. Bayes-Adaptive MDPs (BAMDPs) extend MDPs to consider transition probabilities governed by latent parameters. To act optimally in BAMDPs, one must maintain a belief distribution over the la...

Full description

Bibliographic Details
Main Authors: Costen, C, Rigter, M, Lacerda, B, Hawes, N
Format: Conference item
Language:English
Published: Association for the Advancement of Artificial Intelligence 2023