Improved Dyna-Q: A Reinforcement Learning Method Focused via Heuristic Graph for AGV Path Planning in Dynamic Environments
Dyna-Q is a reinforcement learning method widely used in AGV path planning. However, in large complex dynamic environments, due to the sparse reward function of Dyna-Q and the large searching space, this method has the problems of low search efficiency, slow convergence speed, and even inability to...
Main Authors: | Yiyang Liu, Shuaihua Yan, Yang Zhao, Chunhe Song, Fei Li |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-11-01
|
Series: | Drones |
Subjects: | |
Online Access: | https://www.mdpi.com/2504-446X/6/11/365 |
Similar Items
-
Anti-Jamming Path Selection Method in a Wireless Communication Network Based on Dyna-Q
by: Guoliang Zhang, et al.
Published: (2022-07-01) -
A Dyna_Q-learning Algorithm Used in Underground Path Planning
by: ZHU Mei-qiang, et al.
Published: (2012-12-01) -
Deep Dyna-Q for Rapid Learning and Improved Formation Achievement in Cooperative Transportation
by: Almira Budiyanto, et al.
Published: (2023-07-01) -
A Dyna-Q-Based Solution for UAV Networks Against Smart Jamming Attacks
by: Zhiwei Li, et al.
Published: (2019-05-01) -
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning
by: Xuecheng Niu, et al.
Published: (2024-01-01)