A Sample Aggregation Approach to Experiences Replay of Dyna-Q Learning

In a complex environment, the learning efficiency of reinforcement learning methods always decreases due to large-scale or continuous spaces problems, which can cause the well-known curse of dimensionality. To deal with this problem and enhance learning efficiency, this paper introduces an aggregati...

Full description

Bibliographic Details
Main Authors: Haobin Shi, Shike Yang, Kao-Shing Hwang, Jialin Chen, Mengkai Hu, Hengsheng Zhang
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8383982/