Actor-Critic Policy Learning in Cooperative Planning
In this paper, we introduce a method for learning and adapting cooperative control strategies in real-time stochastic domains. Our framework is an instance of the intelligent cooperative control architecture (iCCA)[superscript 1]. The agent starts by following the "safe" plan calculated by...
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | en_US |
Published: |
American Institute of Aeronautics and Astronautics
2013
|
Online Access: | http://hdl.handle.net/1721.1/81477 https://orcid.org/0000-0002-2508-1957 https://orcid.org/0000-0001-8576-1930 |