Actor-Critic Policy Learning in Cooperative Planning

In this paper, we introduce a method for learning and adapting cooperative control strategies in real-time stochastic domains. Our framework is an instance of the intelligent cooperative control architecture (iCCA)[superscript 1]. The agent starts by following the "safe" plan calculated by...

Full description

Bibliographic Details
Main Authors: Redding, Joshua, Geramifard, Alborz, Choi, Han-Lim, How, Jonathan P.
Other Authors: Massachusetts Institute of Technology. Aerospace Controls Laboratory
Format: Article
Language:en_US
Published: American Institute of Aeronautics and Astronautics 2013
Online Access:http://hdl.handle.net/1721.1/81477
https://orcid.org/0000-0002-2508-1957
https://orcid.org/0000-0001-8576-1930