Multi-Agent Planning under Uncertainty with Monte Carlo Q-Value Function

Decentralized partially observable Markov decision processes (Dec-POMDPs) are general multi-agent models for planning under uncertainty, but are intractable to solve. Doubly exponential growth of the search space as the horizon increases makes a brute-force search impossible. Heuristic methods can g...

Full description

Bibliographic Details
Main Authors: Jian Zhang, Yaozong Pan, Ruili Wang, Yuqiang Fang, Haitao Yang
Format: Article
Language:English
Published: MDPI AG 2019-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/9/7/1430