Monte-Carlo planning in large POMDPs

This paper introduces a Monte-Carlo algorithm for online planning in large POMDPs. The algorithm combines a Monte-Carlo update of the agent's belief state with a Monte-Carlo tree search from the current belief state. The new algorithm, POMCP, has two important properties. First, Monte-Carlo sam...

Full description

Bibliographic Details
Main Authors: Silver, David, Veness, Joel
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:en_US
Published: Neural Information Processing Systems 2015
Online Access:http://hdl.handle.net/1721.1/100395