Admissible policy teaching through reward design

We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy from a set of admissible policies. The goal of the reward designer is to modify the underlying reward function cost-efficiently while ensuring that any approximately optimal deterministic policy unde...

Full description

Bibliographic Details
Main Authors: Banihashem, K, Singla, A, Gan, J, Radanovic, G
Format: Conference item
Language:English
Published: Association for the Advancement of Artificial Intelligence 2022