Buoy and Winch Collaborative Control System Based on Deep Reinforcement Learning

The improved control performance of the buoy and winch collaborative control system can enhance the stability of the connection between underwater robots and ground industrial control equipment. To overcome the challenge of mathematical modeling of this control system, this research introduces reinf...

Full description

Bibliographic Details
Main Authors:	Yang Gu, Jianjun Ni, Zaiming Geng, Bing Zhao, Haowen Yang
Format:	Article
Language:	English
Published:	MDPI AG 2025-02-01
Series:	Journal of Marine Science and Engineering
Subjects:	buoy and winch control system soft actor-critic transformer behavior cloning
Online Access:	https://www.mdpi.com/2077-1312/13/2/326

_version_	1826582229050982400
author	Yang Gu Jianjun Ni Zaiming Geng Bing Zhao Haowen Yang
author_facet	Yang Gu Jianjun Ni Zaiming Geng Bing Zhao Haowen Yang
author_sort	Yang Gu
collection	DOAJ
description	The improved control performance of the buoy and winch collaborative control system can enhance the stability of the connection between underwater robots and ground industrial control equipment. To overcome the challenge of mathematical modeling of this control system, this research introduces reinforcement learning and transformer models in the design process. The main contributions include the development of two simulation environments for training DRL agents, designing a reward function to guide the exploration process, proposing a buoy control algorithm based on the discrete soft actor-critic (SAC) algorithm, and proposing a winch cable length prediction algorithm based on a lightweight transformer model. The experiment results demonstrated significant improvements in rewards diagrams, buoy control trajectories, and winch model performance, showcasing the effectiveness of our proposed system. The average error of the buoy tracking trajectories induced by different policies trained in the two environments is less than 0.05, and the evaluation error of the behavior cloning lightweight transformer model is less than 0.03.
first_indexed	2025-03-14T15:02:38Z
format	Article
id	doaj.art-2047dfb43000449cb26d7143b1320f4b
institution	Directory Open Access Journal
issn	2077-1312
language	English
last_indexed	2025-03-14T15:02:38Z
publishDate	2025-02-01
publisher	MDPI AG
record_format	Article
series	Journal of Marine Science and Engineering
spelling	doaj.art-2047dfb43000449cb26d7143b1320f4b2025-02-25T13:33:45ZengMDPI AGJournal of Marine Science and Engineering2077-13122025-02-0113232610.3390/jmse13020326Buoy and Winch Collaborative Control System Based on Deep Reinforcement LearningYang Gu0Jianjun Ni1Zaiming Geng2Bing Zhao3Haowen Yang4College of Artificial Intelligence and Automation, Hohai University, Changzhou 213200, ChinaCollege of Artificial Intelligence and Automation, Hohai University, Changzhou 213200, ChinaChina Yangtze Power Co., Ltd., Yichang 443002, ChinaShenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, ChinaSchool of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, ChinaThe improved control performance of the buoy and winch collaborative control system can enhance the stability of the connection between underwater robots and ground industrial control equipment. To overcome the challenge of mathematical modeling of this control system, this research introduces reinforcement learning and transformer models in the design process. The main contributions include the development of two simulation environments for training DRL agents, designing a reward function to guide the exploration process, proposing a buoy control algorithm based on the discrete soft actor-critic (SAC) algorithm, and proposing a winch cable length prediction algorithm based on a lightweight transformer model. The experiment results demonstrated significant improvements in rewards diagrams, buoy control trajectories, and winch model performance, showcasing the effectiveness of our proposed system. The average error of the buoy tracking trajectories induced by different policies trained in the two environments is less than 0.05, and the evaluation error of the behavior cloning lightweight transformer model is less than 0.03.https://www.mdpi.com/2077-1312/13/2/326buoy and winch control systemsoft actor-critictransformerbehavior cloning
spellingShingle	Yang Gu Jianjun Ni Zaiming Geng Bing Zhao Haowen Yang Buoy and Winch Collaborative Control System Based on Deep Reinforcement Learning Journal of Marine Science and Engineering buoy and winch control system soft actor-critic transformer behavior cloning
title	Buoy and Winch Collaborative Control System Based on Deep Reinforcement Learning
title_full	Buoy and Winch Collaborative Control System Based on Deep Reinforcement Learning
title_fullStr	Buoy and Winch Collaborative Control System Based on Deep Reinforcement Learning
title_full_unstemmed	Buoy and Winch Collaborative Control System Based on Deep Reinforcement Learning
title_short	Buoy and Winch Collaborative Control System Based on Deep Reinforcement Learning
title_sort	buoy and winch collaborative control system based on deep reinforcement learning
topic	buoy and winch control system soft actor-critic transformer behavior cloning
url	https://www.mdpi.com/2077-1312/13/2/326
work_keys_str_mv	AT yanggu buoyandwinchcollaborativecontrolsystembasedondeepreinforcementlearning AT jianjunni buoyandwinchcollaborativecontrolsystembasedondeepreinforcementlearning AT zaiminggeng buoyandwinchcollaborativecontrolsystembasedondeepreinforcementlearning AT bingzhao buoyandwinchcollaborativecontrolsystembasedondeepreinforcementlearning AT haowenyang buoyandwinchcollaborativecontrolsystembasedondeepreinforcementlearning

Buoy and Winch Collaborative Control System Based on Deep Reinforcement Learning

Similar Items