Improved Dyna-Q: A Reinforcement Learning Method Focused via Heuristic Graph for AGV Path Planning in Dynamic Environments

Dyna-Q is a reinforcement learning method widely used in AGV path planning. However, in large complex dynamic environments, due to the sparse reward function of Dyna-Q and the large searching space, this method has the problems of low search efficiency, slow convergence speed, and even inability to...

Full description

Bibliographic Details
Main Authors:	Yiyang Liu, Shuaihua Yan, Yang Zhao, Chunhe Song, Fei Li
Format:	Article
Language:	English
Published:	MDPI AG 2022-11-01
Series:	Drones
Subjects:	path planning complex dynamic environment Dyna-Q reinforcement learning
Online Access:	https://www.mdpi.com/2504-446X/6/11/365

Internet

https://www.mdpi.com/2504-446X/6/11/365

Improved Dyna-Q: A Reinforcement Learning Method Focused via Heuristic Graph for AGV Path Planning in Dynamic Environments

Internet

Similar Items