Learning dynamics and generalization in reinforcement learning
Solving a reinforcement learning (RL) problem poses two competing challenges: fitting a potentially discontinuous value function, and generalizing well to new observations. In this paper, we analyze the learning dynamics of temporal difference algorithms to gain novel insight into the tension betwee...
主要な著者: | , , , , |
---|---|
フォーマット: | Conference item |
言語: | English |
出版事項: |
Journal of Machine Learning Research
2022
|