Research on game strategy of spacecraft chase and escape based on adaptive augmented random search

To solve the problem of the survival differential policy interception between a spacecraft and a non-cooperative target pursuit game, the pursuit game policy is studied based on reinforcement learning, and the adaptive-augmented random search algorithm is proposed. Firstly, to solve the sparse rewar...

Full description

Bibliographic Details
Main Authors:	JIAO Jie, GOU Yongjie, WU Wenbo, PAN Binfeng
Format:	Article
Language:	zho
Published:	EDP Sciences 2024-02-01
Series:	Xibei Gongye Daxue Xuebao
Subjects:	非合作目标追逃博弈微分对策强化学习稀疏奖励
Online Access:	https://www.jnwpu.org/articles/jnwpu/full_html/2024/01/jnwpu2024421p117/jnwpu2024421p117.html

_version_	1797221500409872384
author	JIAO Jie GOU Yongjie WU Wenbo PAN Binfeng
author_facet	JIAO Jie GOU Yongjie WU Wenbo PAN Binfeng
author_sort	JIAO Jie
collection	DOAJ
description	To solve the problem of the survival differential policy interception between a spacecraft and a non-cooperative target pursuit game, the pursuit game policy is studied based on reinforcement learning, and the adaptive-augmented random search algorithm is proposed. Firstly, to solve the sparse reward problem of sequential decision making, an exploration method based on the spatial perturbation of parameters of the policy is designed, thus accelerating its convergence speed. Secondly, to avoid the possibility of falling into local optimum prematurely, a novelty degree function is designed to guide the policy update, enhancing the efficiency of data utilization. Finally, the effectiveness and advancement of the exploration method are verified with numerical simulations and compared with those of the augmented random search algorithm, the proximal policy optimization algorithm and the deep deterministic policy gradient algorithm.
first_indexed	2024-04-24T13:06:25Z
format	Article
id	doaj.art-85ea93878bcc4b82b5dd2bc5e1c38cf2
institution	Directory Open Access Journal
issn	1000-2758 2609-7125
language	zho
last_indexed	2024-04-24T13:06:25Z
publishDate	2024-02-01
publisher	EDP Sciences
record_format	Article
series	Xibei Gongye Daxue Xuebao
spelling	doaj.art-85ea93878bcc4b82b5dd2bc5e1c38cf22024-04-05T07:31:28ZzhoEDP SciencesXibei Gongye Daxue Xuebao1000-27582609-71252024-02-0142111712810.1051/jnwpu/20244210117jnwpu2024421p117Research on game strategy of spacecraft chase and escape based on adaptive augmented random searchJIAO Jie0GOU Yongjie1WU Wenbo2PAN Binfeng3School of Astronautics, Northwestern Polytechnical UniversityShanghai Aerospace Systems Engineering InstituteSchool of Astronautics, Northwestern Polytechnical UniversitySchool of Astronautics, Northwestern Polytechnical UniversityTo solve the problem of the survival differential policy interception between a spacecraft and a non-cooperative target pursuit game, the pursuit game policy is studied based on reinforcement learning, and the adaptive-augmented random search algorithm is proposed. Firstly, to solve the sparse reward problem of sequential decision making, an exploration method based on the spatial perturbation of parameters of the policy is designed, thus accelerating its convergence speed. Secondly, to avoid the possibility of falling into local optimum prematurely, a novelty degree function is designed to guide the policy update, enhancing the efficiency of data utilization. Finally, the effectiveness and advancement of the exploration method are verified with numerical simulations and compared with those of the augmented random search algorithm, the proximal policy optimization algorithm and the deep deterministic policy gradient algorithm.https://www.jnwpu.org/articles/jnwpu/full_html/2024/01/jnwpu2024421p117/jnwpu2024421p117.html非合作目标追逃博弈微分对策强化学习稀疏奖励
spellingShingle	JIAO Jie GOU Yongjie WU Wenbo PAN Binfeng Research on game strategy of spacecraft chase and escape based on adaptive augmented random search Xibei Gongye Daxue Xuebao 非合作目标追逃博弈微分对策强化学习稀疏奖励
title	Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_full	Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_fullStr	Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_full_unstemmed	Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_short	Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_sort	research on game strategy of spacecraft chase and escape based on adaptive augmented random search
topic	非合作目标追逃博弈微分对策强化学习稀疏奖励
url	https://www.jnwpu.org/articles/jnwpu/full_html/2024/01/jnwpu2024421p117/jnwpu2024421p117.html
work_keys_str_mv	AT jiaojie researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch AT gouyongjie researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch AT wuwenbo researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch AT panbinfeng researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch

Research on game strategy of spacecraft chase and escape based on adaptive augmented random search

Similar Items