Double Broad Reinforcement Learning Based on Hindsight Experience Replay for Collision Avoidance of Unmanned Surface Vehicles

Although broad reinforcement learning (BRL) provides a more intelligent autonomous decision-making method for the collision avoidance problem of unmanned surface vehicles (USVs), the algorithm still has the problem of over-estimation and has difficulty converging quickly due to the sparse reward pro...

Full description

Bibliographic Details
Main Authors: Jiabao Yu, Jiawei Chen, Ying Chen, Zhiguo Zhou, Junwei Duan
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Journal of Marine Science and Engineering
Subjects:
Online Access:https://www.mdpi.com/2077-1312/10/12/2026