A Self-Adaptive Reinforcement-Exploration Q-Learning Algorithm

Directing at various problems of the traditional Q-Learning algorithm, such as heavy repetition and disequilibrium of explorations, the reinforcement-exploration strategy was used to replace the decayed ε-greedy strategy in the traditional Q-Learning algorithm, and thus a novel self-adaptive reinfor...

Full description

Bibliographic Details
Main Authors: Lieping Zhang, Liu Tang, Shenglan Zhang, Zhengzhong Wang, Xianhao Shen, Zuqiong Zhang
Format: Article
Language:English
Published: MDPI AG 2021-06-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/13/6/1057