Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions

This paper presents a decentralized zero-sum optimal control method for MRMs with environmental collisions via an actor-critic-identifier (ACI) structure-based adaptive dynamic programming (ADP) algorithm. The dynamic model of the MRMs is formulated via a novel collision identification method that i...

Full description

Bibliographic Details
Main Authors: Bo Dong, Tianjiao An, Fan Zhou, Keping Liu, Weibo Yu, Yuanchun Li
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8758094/
_version_ 1819073655972298752
author Bo Dong
Tianjiao An
Fan Zhou
Keping Liu
Weibo Yu
Yuanchun Li
author_facet Bo Dong
Tianjiao An
Fan Zhou
Keping Liu
Weibo Yu
Yuanchun Li
author_sort Bo Dong
collection DOAJ
description This paper presents a decentralized zero-sum optimal control method for MRMs with environmental collisions via an actor-critic-identifier (ACI) structure-based adaptive dynamic programming (ADP) algorithm. The dynamic model of the MRMs is formulated via a novel collision identification method that is deployed for each joint module, in which the local position and torque information are used to design the model compensation controller. A neural network (NN) identifier is developed to compensate the model uncertainties and then, the optimal control problem of the MRMs with environmental collisions can be transformed into a two-player zero-sum optimal control one. Based on the ADP algorithm, the Hamilton-Jacobi-Isaacs (HJI) equation is solved by constructing the actor-critic NNs, thus making the derivation of the approximate optimal control policy feasible. Based on the Lyapunov theory, the closed-loop robotic system is proved to be asymptotically stable. Finally, the experiments are conducted to verify the effectiveness and advantages of the proposed method.
first_indexed 2024-12-21T17:57:05Z
format Article
id doaj.art-4245bf84974744e8a73a24676685443d
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-21T17:57:05Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-4245bf84974744e8a73a24676685443d2022-12-21T18:55:10ZengIEEEIEEE Access2169-35362019-01-017961489616510.1109/ACCESS.2019.29275118758094Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental CollisionsBo Dong0Tianjiao An1Fan Zhou2Keping Liu3Weibo Yu4Yuanchun Li5https://orcid.org/0000-0002-0989-6312Department of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaDepartment of Control Science and Engineering, Changchun University of Technology, Changchun, ChinaThis paper presents a decentralized zero-sum optimal control method for MRMs with environmental collisions via an actor-critic-identifier (ACI) structure-based adaptive dynamic programming (ADP) algorithm. The dynamic model of the MRMs is formulated via a novel collision identification method that is deployed for each joint module, in which the local position and torque information are used to design the model compensation controller. A neural network (NN) identifier is developed to compensate the model uncertainties and then, the optimal control problem of the MRMs with environmental collisions can be transformed into a two-player zero-sum optimal control one. Based on the ADP algorithm, the Hamilton-Jacobi-Isaacs (HJI) equation is solved by constructing the actor-critic NNs, thus making the derivation of the approximate optimal control policy feasible. Based on the Lyapunov theory, the closed-loop robotic system is proved to be asymptotically stable. Finally, the experiments are conducted to verify the effectiveness and advantages of the proposed method.https://ieeexplore.ieee.org/document/8758094/Adaptive dynamic programmingcollision identificationdecentralized optimal controlmodular robot manipulatorszero-sum game
spellingShingle Bo Dong
Tianjiao An
Fan Zhou
Keping Liu
Weibo Yu
Yuanchun Li
Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
IEEE Access
Adaptive dynamic programming
collision identification
decentralized optimal control
modular robot manipulators
zero-sum game
title Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_full Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_fullStr Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_full_unstemmed Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_short Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions
title_sort actor critic identifier structure based decentralized neuro optimal control of modular robot manipulators with environmental collisions
topic Adaptive dynamic programming
collision identification
decentralized optimal control
modular robot manipulators
zero-sum game
url https://ieeexplore.ieee.org/document/8758094/
work_keys_str_mv AT bodong actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions
AT tianjiaoan actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions
AT fanzhou actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions
AT kepingliu actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions
AT weiboyu actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions
AT yuanchunli actorcriticidentifierstructurebaseddecentralizedneurooptimalcontrolofmodularrobotmanipulatorswithenvironmentalcollisions