Monte Carlo tree search with Boltzmann exploration
Monte-Carlo Tree Search (MCTS) methods, such as Upper Confidence Bound applied to Trees (UCT), are instrumental to automated planning techniques. However, UCT can be slow to explore an optimal action when it initially appears inferior to other actions. Maximum ENtropy Tree-Search (MENTS) incorporate...
Main Authors: | , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
Curran Associates
2023
|