Monte Carlo tree search with Boltzmann exploration

Monte-Carlo Tree Search (MCTS) methods, such as Upper Confidence Bound applied to Trees (UCT), are instrumental to automated planning techniques. However, UCT can be slow to explore an optimal action when it initially appears inferior to other actions. Maximum ENtropy Tree-Search (MENTS) incorporate...

Full description

Bibliographic Details
Main Authors: Painter, M, Baioumy, M, Hawes, N, Lacerda, B
Format: Conference item
Language:English
Published: Curran Associates 2023