Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions

This paper presents a decentralized zero-sum optimal control method for MRMs with environmental collisions via an actor-critic-identifier (ACI) structure-based adaptive dynamic programming (ADP) algorithm. The dynamic model of the MRMs is formulated via a novel collision identification method that i...

Full description

Bibliographic Details
Main Authors:	Bo Dong, Tianjiao An, Fan Zhou, Keping Liu, Weibo Yu, Yuanchun Li
Format:	Article
Language:	English
Published:	IEEE 2019-01-01
Series:	IEEE Access
Subjects:	Adaptive dynamic programming collision identification decentralized optimal control modular robot manipulators zero-sum game
Online Access:	https://ieeexplore.ieee.org/document/8758094/

_version_	1819073655972298752
author	Bo Dong Tianjiao An Fan Zhou Keping Liu Weibo Yu Yuanchun Li
author_facet	Bo Dong Tianjiao An Fan Zhou Keping Liu Weibo Yu Yuanchun Li
author_sort	Bo Dong
collection	DOAJ
description	This paper presents a decentralized zero-sum optimal control method for MRMs with environmental collisions via an actor-critic-identifier (ACI) structure-based adaptive dynamic programming (ADP) algorithm. The dynamic model of the MRMs is formulated via a novel collision identification method that is deployed for each joint module, in which the local position and torque information are used to design the model compensation controller. A neural network (NN) identifier is developed to compensate the model uncertainties and then, the optimal control problem of the MRMs with environmental collisions can be transformed into a two-player zero-sum optimal control one. Based on the ADP algorithm, the Hamilton-Jacobi-Isaacs (HJI) equation is solved by constructing the actor-critic NNs, thus making the derivation of the approximate optimal control policy feasible. Based on the Lyapunov theory, the closed-loop robotic system is proved to be asymptotically stable. Finally, the experiments are conducted to verify the effectiveness and advantages of the proposed method.
first_indexed	2024-12-21T17:57:05Z
format	Article
id	doaj.art-4245bf84974744e8a73a24676685443d
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-12-21T17:57:05Z
publishDate	2019-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-4245bf84974744e8a73a24676685443d2022-12-21T18:55:10ZengIEEEIEEE Access2169-35362019-01-017961489616510.1109/ACCESS.2019.29275118758094Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental CollisionsBo Dong0Tianjiao An1Fan Zhou2Keping Liu3Weibo Yu4Yuanchun Li5https://orcid.org/0000-0002-0989-6312Department of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaThis paper presents a decentralized zero-sum optimal control method for MRMs with environmental collisions via an actor-critic-identifier (ACI) structure-based adaptive dynamic programming (ADP) algorithm. The dynamic model of the MRMs is formulated via a novel collision identification method that is deployed for each joint module, in which the local position and torque information are used to design the model compensation controller. A neural network (NN) identifier is developed to compensate the model uncertainties and then, the optimal control problem of the MRMs with environmental collisions can be transformed into a two-player zero-sum optimal control one. Based on the ADP algorithm, the Hamilton-Jacobi-Isaacs (HJI) equation is solved by constructing the actor-critic NNs, thus making the derivation of the approximate optimal control policy feasible. Based on the Lyapunov theory, the closed-loop robotic system is proved to be asymptotically stable. Finally, the experiments are conducted to verify the effectiveness and advantages of the proposed method.https://ieeexplore.ieee.org/document/8758094/Adaptive dynamic programmingcollision identificationdecentralized optimal controlmodular robot manipulatorszero-sum game
spellingShingle	Bo Dong Tianjiao An Fan Zhou Keping Liu Weibo Yu Yuanchun Li Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions IEEE Access Adaptive dynamic programming collision identification decentralized optimal control modular robot manipulators zero-sum game
title	Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_full	Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_fullStr	Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_full_unstemmed	Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_short	Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_sort	actor critic identifier structure based decentralized neuro optimal control of modular robot manipulators with environmental collisions
topic	Adaptive dynamic programming collision identification decentralized optimal control modular robot manipulators zero-sum game
url	https://ieeexplore.ieee.org/document/8758094/
work_keys_str_mv	AT bodong actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions AT tianjiaoan actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions AT fanzhou actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions AT kepingliu actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions AT weiboyu actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions AT yuanchunli actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions

Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions

Similar Items