Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following

Unmanned surface vessels (USVs) are required to follow a path during a task. This is essential for the USV, especially when following a curvilinear path or considering the interference of waves, and this work has been proven to be complicated. In this paper, a PID parameter tuning and optimizing met...

Full description

Bibliographic Details
Main Authors:	Lifei Song, Chuanyi Xu, Le Hao, Jianxi Yao, Rong Guo
Format:	Article
Language:	English
Published:	MDPI AG 2022-12-01
Series:	Journal of Marine Science and Engineering
Subjects:	unmanned surface vehicle deep reinforcement learning parameter tuning path-following control
Online Access:	https://www.mdpi.com/2077-1312/10/12/1847

_version_	1797456937813540864
author	Lifei Song Chuanyi Xu Le Hao Jianxi Yao Rong Guo
author_facet	Lifei Song Chuanyi Xu Le Hao Jianxi Yao Rong Guo
author_sort	Lifei Song
collection	DOAJ
description	Unmanned surface vessels (USVs) are required to follow a path during a task. This is essential for the USV, especially when following a curvilinear path or considering the interference of waves, and this work has been proven to be complicated. In this paper, a PID parameter tuning and optimizing method based on deep reinforcement learning were proposed to control the USV heading. Firstly, the Abkowite dynamics model with three degrees of freedom (DOF) is established. Secondly, the guidance law on the line-of-sight (LOS) method and the USV heading control system of the PID controller are designed. To satisfy the time-varying demand of PID parameters for guiding control, especially when the USV moves in waves, the soft actor–critic auto (SAC-auto) method is presented to adjust the PID parameters automatically. Thirdly, the state, action, and reward functions of the agent are designed for training and learning. Finally, numerical simulations are performed, and the results validated the feasibility and validity of the feasibility and effectiveness of the proposed method.
first_indexed	2024-03-09T16:15:02Z
format	Article
id	doaj.art-8d60148136844b9c81389c3334d16123
institution	Directory Open Access Journal
issn	2077-1312
language	English
last_indexed	2024-03-09T16:15:02Z
publishDate	2022-12-01
publisher	MDPI AG
record_format	Article
series	Journal of Marine Science and Engineering
spelling	doaj.art-8d60148136844b9c81389c3334d161232023-11-24T15:55:21ZengMDPI AGJournal of Marine Science and Engineering2077-13122022-12-011012184710.3390/jmse10121847Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path FollowingLifei Song0Chuanyi Xu1Le Hao2Jianxi Yao3Rong Guo4Key Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaKey Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaKey Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaKey Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaKey Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaUnmanned surface vessels (USVs) are required to follow a path during a task. This is essential for the USV, especially when following a curvilinear path or considering the interference of waves, and this work has been proven to be complicated. In this paper, a PID parameter tuning and optimizing method based on deep reinforcement learning were proposed to control the USV heading. Firstly, the Abkowite dynamics model with three degrees of freedom (DOF) is established. Secondly, the guidance law on the line-of-sight (LOS) method and the USV heading control system of the PID controller are designed. To satisfy the time-varying demand of PID parameters for guiding control, especially when the USV moves in waves, the soft actor–critic auto (SAC-auto) method is presented to adjust the PID parameters automatically. Thirdly, the state, action, and reward functions of the agent are designed for training and learning. Finally, numerical simulations are performed, and the results validated the feasibility and validity of the feasibility and effectiveness of the proposed method.https://www.mdpi.com/2077-1312/10/12/1847unmanned surface vehicledeep reinforcement learningparameter tuningpath-following control
spellingShingle	Lifei Song Chuanyi Xu Le Hao Jianxi Yao Rong Guo Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following Journal of Marine Science and Engineering unmanned surface vehicle deep reinforcement learning parameter tuning path-following control
title	Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_full	Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_fullStr	Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_full_unstemmed	Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_short	Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_sort	research on pid parameter tuning and optimization based on sac auto for usv path following
topic	unmanned surface vehicle deep reinforcement learning parameter tuning path-following control
url	https://www.mdpi.com/2077-1312/10/12/1847
work_keys_str_mv	AT lifeisong researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing AT chuanyixu researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing AT lehao researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing AT jianxiyao researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing AT rongguo researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing

Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following

Similar Items