Research on Dual-Arm Control of Lunar Assisted Robot Based on Hierarchical Reinforcement Learning under Unstructured Environment

When a lunar assisted robot helps an astronaut turn over or transports the astronaut from the ground, the trajectory of the robot’s dual arms should be automatically planned according to the unstructured environment on the lunar surface. In this paper, a dual-arm control strategy model of a lunar as...

Full description

Bibliographic Details
Main Authors:	Weiyan Ren, Dapeng Han, Zhaokui Wang
Format:	Article
Language:	English
Published:	MDPI AG 2022-06-01
Series:	Aerospace
Subjects:	lunar assisted robot hierarchical reinforcement learning dual-arm control reward functions
Online Access:	https://www.mdpi.com/2226-4310/9/6/315

_version_	1797491047373209600
author	Weiyan Ren Dapeng Han Zhaokui Wang
author_facet	Weiyan Ren Dapeng Han Zhaokui Wang
author_sort	Weiyan Ren
collection	DOAJ
description	When a lunar assisted robot helps an astronaut turn over or transports the astronaut from the ground, the trajectory of the robot’s dual arms should be automatically planned according to the unstructured environment on the lunar surface. In this paper, a dual-arm control strategy model of a lunar assisted robot based on hierarchical reinforcement learning is proposed, and the trajectory planning problem is modeled as a two-layer Markov decision process. In the training process, a reward function design method based on the idea of the artificial potential field method is proposed, and the reward information is fed back in a dense reward method, which significantly reduces the invalid exploration space and improves the learning efficiency. Large-scale tests are carried out in both simulated and physical environments, and the results demonstrate the effectiveness of the method proposed in this paper. This research is of great significance in respect of human–robot interaction, environmental interaction, and intelligent control of robots.
first_indexed	2024-03-10T00:41:49Z
format	Article
id	doaj.art-059cbb95cb7e492bbfd79f0db7bef09b
institution	Directory Open Access Journal
issn	2226-4310
language	English
last_indexed	2024-03-10T00:41:49Z
publishDate	2022-06-01
publisher	MDPI AG
record_format	Article
series	Aerospace
spelling	doaj.art-059cbb95cb7e492bbfd79f0db7bef09b2023-11-23T15:05:38ZengMDPI AGAerospace2226-43102022-06-019631510.3390/aerospace9060315Research on Dual-Arm Control of Lunar Assisted Robot Based on Hierarchical Reinforcement Learning under Unstructured EnvironmentWeiyan Ren0Dapeng Han1Zhaokui Wang2School of Aerospace Engineering, Tsinghua University, Beijing 100084, ChinaSchool of Aerospace Engineering, Tsinghua University, Beijing 100084, ChinaSchool of Aerospace Engineering, Tsinghua University, Beijing 100084, ChinaWhen a lunar assisted robot helps an astronaut turn over or transports the astronaut from the ground, the trajectory of the robot’s dual arms should be automatically planned according to the unstructured environment on the lunar surface. In this paper, a dual-arm control strategy model of a lunar assisted robot based on hierarchical reinforcement learning is proposed, and the trajectory planning problem is modeled as a two-layer Markov decision process. In the training process, a reward function design method based on the idea of the artificial potential field method is proposed, and the reward information is fed back in a dense reward method, which significantly reduces the invalid exploration space and improves the learning efficiency. Large-scale tests are carried out in both simulated and physical environments, and the results demonstrate the effectiveness of the method proposed in this paper. This research is of great significance in respect of human–robot interaction, environmental interaction, and intelligent control of robots.https://www.mdpi.com/2226-4310/9/6/315lunar assisted robothierarchical reinforcement learningdual-arm controlreward functions
spellingShingle	Weiyan Ren Dapeng Han Zhaokui Wang Research on Dual-Arm Control of Lunar Assisted Robot Based on Hierarchical Reinforcement Learning under Unstructured Environment Aerospace lunar assisted robot hierarchical reinforcement learning dual-arm control reward functions
title	Research on Dual-Arm Control of Lunar Assisted Robot Based on Hierarchical Reinforcement Learning under Unstructured Environment
title_full	Research on Dual-Arm Control of Lunar Assisted Robot Based on Hierarchical Reinforcement Learning under Unstructured Environment
title_fullStr	Research on Dual-Arm Control of Lunar Assisted Robot Based on Hierarchical Reinforcement Learning under Unstructured Environment
title_full_unstemmed	Research on Dual-Arm Control of Lunar Assisted Robot Based on Hierarchical Reinforcement Learning under Unstructured Environment
title_short	Research on Dual-Arm Control of Lunar Assisted Robot Based on Hierarchical Reinforcement Learning under Unstructured Environment
title_sort	research on dual arm control of lunar assisted robot based on hierarchical reinforcement learning under unstructured environment
topic	lunar assisted robot hierarchical reinforcement learning dual-arm control reward functions
url	https://www.mdpi.com/2226-4310/9/6/315
work_keys_str_mv	AT weiyanren researchondualarmcontroloflunarassistedrobotbasedonhierarchicalreinforcementlearningunderunstructuredenvironment AT dapenghan researchondualarmcontroloflunarassistedrobotbasedonhierarchicalreinforcementlearningunderunstructuredenvironment AT zhaokuiwang researchondualarmcontroloflunarassistedrobotbasedonhierarchicalreinforcementlearningunderunstructuredenvironment

Research on Dual-Arm Control of Lunar Assisted Robot Based on Hierarchical Reinforcement Learning under Unstructured Environment

Similar Items