A Sample Aggregation Approach to Experiences Replay of Dyna-Q Learning

In a complex environment, the learning efficiency of reinforcement learning methods always decreases due to large-scale or continuous spaces problems, which can cause the well-known curse of dimensionality. To deal with this problem and enhance learning efficiency, this paper introduces an aggregati...

Full description

Bibliographic Details
Main Authors:	Haobin Shi, Shike Yang, Kao-Shing Hwang, Jialin Chen, Mengkai Hu, Hengsheng Zhang
Format:	Article
Language:	English
Published:	IEEE 2018-01-01
Series:	IEEE Access
Subjects:	Dyna-Q Minhash Chinese restaurant process FSA-CRP model prediction
Online Access:	https://ieeexplore.ieee.org/document/8383982/

Internet

https://ieeexplore.ieee.org/document/8383982/

A Sample Aggregation Approach to Experiences Replay of Dyna-Q Learning

Internet

Similar Items