Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following

Unmanned surface vessels (USVs) are required to follow a path during a task. This is essential for the USV, especially when following a curvilinear path or considering the interference of waves, and this work has been proven to be complicated. In this paper, a PID parameter tuning and optimizing met...

Full description

Bibliographic Details
Main Authors: Lifei Song, Chuanyi Xu, Le Hao, Jianxi Yao, Rong Guo
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Journal of Marine Science and Engineering
Subjects:
Online Access:https://www.mdpi.com/2077-1312/10/12/1847
_version_ 1797456937813540864
author Lifei Song
Chuanyi Xu
Le Hao
Jianxi Yao
Rong Guo
author_facet Lifei Song
Chuanyi Xu
Le Hao
Jianxi Yao
Rong Guo
author_sort Lifei Song
collection DOAJ
description Unmanned surface vessels (USVs) are required to follow a path during a task. This is essential for the USV, especially when following a curvilinear path or considering the interference of waves, and this work has been proven to be complicated. In this paper, a PID parameter tuning and optimizing method based on deep reinforcement learning were proposed to control the USV heading. Firstly, the Abkowite dynamics model with three degrees of freedom (DOF) is established. Secondly, the guidance law on the line-of-sight (LOS) method and the USV heading control system of the PID controller are designed. To satisfy the time-varying demand of PID parameters for guiding control, especially when the USV moves in waves, the soft actor–critic auto (SAC-auto) method is presented to adjust the PID parameters automatically. Thirdly, the state, action, and reward functions of the agent are designed for training and learning. Finally, numerical simulations are performed, and the results validated the feasibility and validity of the feasibility and effectiveness of the proposed method.
first_indexed 2024-03-09T16:15:02Z
format Article
id doaj.art-8d60148136844b9c81389c3334d16123
institution Directory Open Access Journal
issn 2077-1312
language English
last_indexed 2024-03-09T16:15:02Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Journal of Marine Science and Engineering
spelling doaj.art-8d60148136844b9c81389c3334d161232023-11-24T15:55:21ZengMDPI AGJournal of Marine Science and Engineering2077-13122022-12-011012184710.3390/jmse10121847Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path FollowingLifei Song0Chuanyi Xu1Le Hao2Jianxi Yao3Rong Guo4Key Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaKey Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaKey Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaKey Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaKey Laboratory of High Performance Ship Technology, Wuhan University of Technology, Ministry of Education, Wuhan 430070, ChinaUnmanned surface vessels (USVs) are required to follow a path during a task. This is essential for the USV, especially when following a curvilinear path or considering the interference of waves, and this work has been proven to be complicated. In this paper, a PID parameter tuning and optimizing method based on deep reinforcement learning were proposed to control the USV heading. Firstly, the Abkowite dynamics model with three degrees of freedom (DOF) is established. Secondly, the guidance law on the line-of-sight (LOS) method and the USV heading control system of the PID controller are designed. To satisfy the time-varying demand of PID parameters for guiding control, especially when the USV moves in waves, the soft actor–critic auto (SAC-auto) method is presented to adjust the PID parameters automatically. Thirdly, the state, action, and reward functions of the agent are designed for training and learning. Finally, numerical simulations are performed, and the results validated the feasibility and validity of the feasibility and effectiveness of the proposed method.https://www.mdpi.com/2077-1312/10/12/1847unmanned surface vehicledeep reinforcement learningparameter tuningpath-following control
spellingShingle Lifei Song
Chuanyi Xu
Le Hao
Jianxi Yao
Rong Guo
Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
Journal of Marine Science and Engineering
unmanned surface vehicle
deep reinforcement learning
parameter tuning
path-following control
title Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_full Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_fullStr Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_full_unstemmed Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_short Research on PID Parameter Tuning and Optimization Based on SAC-Auto for USV Path Following
title_sort research on pid parameter tuning and optimization based on sac auto for usv path following
topic unmanned surface vehicle
deep reinforcement learning
parameter tuning
path-following control
url https://www.mdpi.com/2077-1312/10/12/1847
work_keys_str_mv AT lifeisong researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing
AT chuanyixu researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing
AT lehao researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing
AT jianxiyao researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing
AT rongguo researchonpidparametertuningandoptimizationbasedonsacautoforusvpathfollowing