Off-policy Maximum Entropy Deep Reinforcement Learning Algorithm Based on RandomlyWeighted Triple Q -Learning

Reinforcement learning is an important branch of machine learning.With the development of deep learning,deep reinforcement learning research has gradually developed into the focus of reinforcement learning research.Model-free off-policy deep reinforcement learning algorithms for continuous control a...

Full description

Bibliographic Details
Main Author:	FAN Jing-yu, LIU Quan
Format:	Article
Language:	zho
Published:	Editorial office of Computer Science 2022-06-01
Series:	Jisuanji kexue
Subjects:	q-learning\|deep learning\|off-policy reinforcement learning\|continuous action space\|maximum entropy\|soft actor critic algorithm
Online Access:	https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-6-335.pdf

Internet

https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-6-335.pdf

Off-policy Maximum Entropy Deep Reinforcement Learning Algorithm Based on RandomlyWeighted Triple Q -Learning

Internet

Similar Items