Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning

Biological fish often swim in a schooling manner, the mechanism of which comes from the fact that these schooling movements can improve the fishes’ hydrodynamic efficiency. Inspired by this phenomenon, a target-following control framework for a biomimetic autonomous system is proposed in this paper....

Full description

Bibliographic Details
Main Authors: Yu Wang, Jian Wang, Song Kang, Junzhi Yu
Format: Article
Language:English
Published: MDPI AG 2024-01-01
Series:Biomimetics
Subjects:
Online Access:https://www.mdpi.com/2313-7673/9/1/33
_version_ 1797344588225052672
author Yu Wang
Jian Wang
Song Kang
Junzhi Yu
author_facet Yu Wang
Jian Wang
Song Kang
Junzhi Yu
author_sort Yu Wang
collection DOAJ
description Biological fish often swim in a schooling manner, the mechanism of which comes from the fact that these schooling movements can improve the fishes’ hydrodynamic efficiency. Inspired by this phenomenon, a target-following control framework for a biomimetic autonomous system is proposed in this paper. Firstly, a following motion model is established based on the mechanism of fish schooling swimming, in which the follower robotic fish keeps a certain distance and orientation from the leader robotic fish. Second, by incorporating a predictive concept into reinforcement learning, a predictive deep deterministic policy gradient-following controller is provided with the normalized state space, action space, reward, and prediction design. It can avoid overshoot to a certain extent. A nonlinear model predictive controller is designed and can be selected for the follower robotic fish, together with the predictive reinforcement learning. Finally, extensive simulations are conducted, including the fix point and dynamic target following for single robotic fish, as well as cooperative following with the leader robotic fish. The obtained results indicate the effectiveness of the proposed methods, providing a valuable sight for the cooperative control of underwater robots to explore the ocean.
first_indexed 2024-03-08T11:04:54Z
format Article
id doaj.art-f4c0682501bf4c15a40b8ead0b3b807b
institution Directory Open Access Journal
issn 2313-7673
language English
last_indexed 2024-03-08T11:04:54Z
publishDate 2024-01-01
publisher MDPI AG
record_format Article
series Biomimetics
spelling doaj.art-f4c0682501bf4c15a40b8ead0b3b807b2024-01-26T15:15:50ZengMDPI AGBiomimetics2313-76732024-01-01913310.3390/biomimetics9010033Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement LearningYu Wang0Jian Wang1Song Kang2Junzhi Yu3Department of Automation, Tsinghua University, Beijing 100084, ChinaThe Laboratory of Cognitive and Decision Intelligence for Complex System, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, ChinaThe Laboratory of Cognitive and Decision Intelligence for Complex System, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, ChinaThe Laboratory of Cognitive and Decision Intelligence for Complex System, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, ChinaBiological fish often swim in a schooling manner, the mechanism of which comes from the fact that these schooling movements can improve the fishes’ hydrodynamic efficiency. Inspired by this phenomenon, a target-following control framework for a biomimetic autonomous system is proposed in this paper. Firstly, a following motion model is established based on the mechanism of fish schooling swimming, in which the follower robotic fish keeps a certain distance and orientation from the leader robotic fish. Second, by incorporating a predictive concept into reinforcement learning, a predictive deep deterministic policy gradient-following controller is provided with the normalized state space, action space, reward, and prediction design. It can avoid overshoot to a certain extent. A nonlinear model predictive controller is designed and can be selected for the follower robotic fish, together with the predictive reinforcement learning. Finally, extensive simulations are conducted, including the fix point and dynamic target following for single robotic fish, as well as cooperative following with the leader robotic fish. The obtained results indicate the effectiveness of the proposed methods, providing a valuable sight for the cooperative control of underwater robots to explore the ocean.https://www.mdpi.com/2313-7673/9/1/33biomimetic motionbiomimetic autonomous systemtarget followingdeep reinforcement learningpredictive control
spellingShingle Yu Wang
Jian Wang
Song Kang
Junzhi Yu
Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
Biomimetics
biomimetic motion
biomimetic autonomous system
target following
deep reinforcement learning
predictive control
title Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_full Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_fullStr Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_full_unstemmed Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_short Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_sort target following control of a biomimetic autonomous system based on predictive reinforcement learning
topic biomimetic motion
biomimetic autonomous system
target following
deep reinforcement learning
predictive control
url https://www.mdpi.com/2313-7673/9/1/33
work_keys_str_mv AT yuwang targetfollowingcontrolofabiomimeticautonomoussystembasedonpredictivereinforcementlearning
AT jianwang targetfollowingcontrolofabiomimeticautonomoussystembasedonpredictivereinforcementlearning
AT songkang targetfollowingcontrolofabiomimeticautonomoussystembasedonpredictivereinforcementlearning
AT junzhiyu targetfollowingcontrolofabiomimeticautonomoussystembasedonpredictivereinforcementlearning