Sparse Reward Exploration Method Based on Trajectory Perception

When dealing with sparse reward problems,existing deep RL algorithms often lead to hard exploration,they often only rely on the pre-designed environment reward,so it is difficult to achieve good results.In this situation,it is necessary to design rewards more carefully,make more accurate judgments a...

Full description

Bibliographic Details
Main Author: ZHANG Qiyang, CHEN Xiliang, ZHANG Qiao
Format: Article
Language:zho
Published: Editorial office of Computer Science 2023-01-01
Series:Jisuanji kexue
Subjects:
Online Access:https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2023-50-1-262.pdf