Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning

Biological fish often swim in a schooling manner, the mechanism of which comes from the fact that these schooling movements can improve the fishes’ hydrodynamic efficiency. Inspired by this phenomenon, a target-following control framework for a biomimetic autonomous system is proposed in this paper....

Full description

Bibliographic Details
Main Authors:	Yu Wang, Jian Wang, Song Kang, Junzhi Yu
Format:	Article
Language:	English
Published:	MDPI AG 2024-01-01
Series:	Biomimetics
Subjects:	biomimetic motion biomimetic autonomous system target following deep reinforcement learning predictive control
Online Access:	https://www.mdpi.com/2313-7673/9/1/33

_version_	1797344588225052672
author	Yu Wang Jian Wang Song Kang Junzhi Yu
author_facet	Yu Wang Jian Wang Song Kang Junzhi Yu
author_sort	Yu Wang
collection	DOAJ
description	Biological fish often swim in a schooling manner, the mechanism of which comes from the fact that these schooling movements can improve the fishes’ hydrodynamic efficiency. Inspired by this phenomenon, a target-following control framework for a biomimetic autonomous system is proposed in this paper. Firstly, a following motion model is established based on the mechanism of fish schooling swimming, in which the follower robotic fish keeps a certain distance and orientation from the leader robotic fish. Second, by incorporating a predictive concept into reinforcement learning, a predictive deep deterministic policy gradient-following controller is provided with the normalized state space, action space, reward, and prediction design. It can avoid overshoot to a certain extent. A nonlinear model predictive controller is designed and can be selected for the follower robotic fish, together with the predictive reinforcement learning. Finally, extensive simulations are conducted, including the fix point and dynamic target following for single robotic fish, as well as cooperative following with the leader robotic fish. The obtained results indicate the effectiveness of the proposed methods, providing a valuable sight for the cooperative control of underwater robots to explore the ocean.
first_indexed	2024-03-08T11:04:54Z
format	Article
id	doaj.art-f4c0682501bf4c15a40b8ead0b3b807b
institution	Directory Open Access Journal
issn	2313-7673
language	English
last_indexed	2024-03-08T11:04:54Z
publishDate	2024-01-01
publisher	MDPI AG
record_format	Article
series	Biomimetics
spelling	doaj.art-f4c0682501bf4c15a40b8ead0b3b807b2024-01-26T15:15:50ZengMDPI AGBiomimetics2313-76732024-01-01913310.3390/biomimetics9010033Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement LearningYu Wang0Jian Wang1Song Kang2Junzhi Yu3Department of Automation, Tsinghua University, Beijing 100084, ChinaThe Laboratory of Cognitive and Decision Intelligence for Complex System, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, ChinaThe Laboratory of Cognitive and Decision Intelligence for Complex System, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, ChinaThe Laboratory of Cognitive and Decision Intelligence for Complex System, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, ChinaBiological fish often swim in a schooling manner, the mechanism of which comes from the fact that these schooling movements can improve the fishes’ hydrodynamic efficiency. Inspired by this phenomenon, a target-following control framework for a biomimetic autonomous system is proposed in this paper. Firstly, a following motion model is established based on the mechanism of fish schooling swimming, in which the follower robotic fish keeps a certain distance and orientation from the leader robotic fish. Second, by incorporating a predictive concept into reinforcement learning, a predictive deep deterministic policy gradient-following controller is provided with the normalized state space, action space, reward, and prediction design. It can avoid overshoot to a certain extent. A nonlinear model predictive controller is designed and can be selected for the follower robotic fish, together with the predictive reinforcement learning. Finally, extensive simulations are conducted, including the fix point and dynamic target following for single robotic fish, as well as cooperative following with the leader robotic fish. The obtained results indicate the effectiveness of the proposed methods, providing a valuable sight for the cooperative control of underwater robots to explore the ocean.https://www.mdpi.com/2313-7673/9/1/33biomimetic motionbiomimetic autonomous systemtarget followingdeep reinforcement learningpredictive control
spellingShingle	Yu Wang Jian Wang Song Kang Junzhi Yu Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning Biomimetics biomimetic motion biomimetic autonomous system target following deep reinforcement learning predictive control
title	Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_full	Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_fullStr	Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_full_unstemmed	Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_short	Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning
title_sort	target following control of a biomimetic autonomous system based on predictive reinforcement learning
topic	biomimetic motion biomimetic autonomous system target following deep reinforcement learning predictive control
url	https://www.mdpi.com/2313-7673/9/1/33
work_keys_str_mv	AT yuwang targetfollowingcontrolofabiomimeticautonomoussystembasedonpredictivereinforcementlearning AT jianwang targetfollowingcontrolofabiomimeticautonomoussystembasedonpredictivereinforcementlearning AT songkang targetfollowingcontrolofabiomimeticautonomoussystembasedonpredictivereinforcementlearning AT junzhiyu targetfollowingcontrolofabiomimeticautonomoussystembasedonpredictivereinforcementlearning

Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning

Similar Items