Deterministic and discriminative imitation (D2-imitation): revisiting adversarial imitation for sample efficiency

Sample efficiency is crucial for imitation learning methods to be applicable in real-world applications. Many studies improve sample efficiency by extending adversarial imitation to be off-policy regardless of the fact that these off-policy extensions could either change the original objective or in...

Full description

Bibliographic Details
Main Authors: Sun, M, Devlin, S, Hofmann, K, Whiteson, S
Format: Conference item
Language:English
Published: Association for the Advancement of Artificial Intelligence 2022