Improving the efficiency of Bayesian inverse reinforcement learning
Inverse reinforcement learning (IRL) is the task of learning the reward function of a Markov Decision Process (MDP) given knowledge of the transition function and a set of expert demonstrations. While many IRL algorithms exist, Bayesian IRL [1] provides a general and principled method of reward lear...
المؤلفون الرئيسيون: | , |
---|---|
مؤلفون آخرون: | |
التنسيق: | مقال |
اللغة: | en_US |
منشور في: |
Institute of Electrical and Electronics Engineers (IEEE)
2013
|
الوصول للمادة أونلاين: | http://hdl.handle.net/1721.1/81489 https://orcid.org/0000-0001-8576-1930 |