Research on game strategy of spacecraft chase and escape based on adaptive augmented random search

To solve the problem of the survival differential policy interception between a spacecraft and a non-cooperative target pursuit game, the pursuit game policy is studied based on reinforcement learning, and the adaptive-augmented random search algorithm is proposed. Firstly, to solve the sparse rewar...

Full description

Bibliographic Details
Main Authors: JIAO Jie, GOU Yongjie, WU Wenbo, PAN Binfeng
Format: Article
Language:zho
Published: EDP Sciences 2024-02-01
Series:Xibei Gongye Daxue Xuebao
Subjects:
Online Access:https://www.jnwpu.org/articles/jnwpu/full_html/2024/01/jnwpu2024421p117/jnwpu2024421p117.html
_version_ 1797221500409872384
author JIAO Jie
GOU Yongjie
WU Wenbo
PAN Binfeng
author_facet JIAO Jie
GOU Yongjie
WU Wenbo
PAN Binfeng
author_sort JIAO Jie
collection DOAJ
description To solve the problem of the survival differential policy interception between a spacecraft and a non-cooperative target pursuit game, the pursuit game policy is studied based on reinforcement learning, and the adaptive-augmented random search algorithm is proposed. Firstly, to solve the sparse reward problem of sequential decision making, an exploration method based on the spatial perturbation of parameters of the policy is designed, thus accelerating its convergence speed. Secondly, to avoid the possibility of falling into local optimum prematurely, a novelty degree function is designed to guide the policy update, enhancing the efficiency of data utilization. Finally, the effectiveness and advancement of the exploration method are verified with numerical simulations and compared with those of the augmented random search algorithm, the proximal policy optimization algorithm and the deep deterministic policy gradient algorithm.
first_indexed 2024-04-24T13:06:25Z
format Article
id doaj.art-85ea93878bcc4b82b5dd2bc5e1c38cf2
institution Directory Open Access Journal
issn 1000-2758
2609-7125
language zho
last_indexed 2024-04-24T13:06:25Z
publishDate 2024-02-01
publisher EDP Sciences
record_format Article
series Xibei Gongye Daxue Xuebao
spelling doaj.art-85ea93878bcc4b82b5dd2bc5e1c38cf22024-04-05T07:31:28ZzhoEDP SciencesXibei Gongye Daxue Xuebao1000-27582609-71252024-02-0142111712810.1051/jnwpu/20244210117jnwpu2024421p117Research on game strategy of spacecraft chase and escape based on adaptive augmented random searchJIAO Jie0GOU Yongjie1WU Wenbo2PAN Binfeng3School of Astronautics, Northwestern Polytechnical UniversityShanghai Aerospace Systems Engineering InstituteSchool of Astronautics, Northwestern Polytechnical UniversitySchool of Astronautics, Northwestern Polytechnical UniversityTo solve the problem of the survival differential policy interception between a spacecraft and a non-cooperative target pursuit game, the pursuit game policy is studied based on reinforcement learning, and the adaptive-augmented random search algorithm is proposed. Firstly, to solve the sparse reward problem of sequential decision making, an exploration method based on the spatial perturbation of parameters of the policy is designed, thus accelerating its convergence speed. Secondly, to avoid the possibility of falling into local optimum prematurely, a novelty degree function is designed to guide the policy update, enhancing the efficiency of data utilization. Finally, the effectiveness and advancement of the exploration method are verified with numerical simulations and compared with those of the augmented random search algorithm, the proximal policy optimization algorithm and the deep deterministic policy gradient algorithm.https://www.jnwpu.org/articles/jnwpu/full_html/2024/01/jnwpu2024421p117/jnwpu2024421p117.html非合作目标追逃博弈微分对策强化学习稀疏奖励
spellingShingle JIAO Jie
GOU Yongjie
WU Wenbo
PAN Binfeng
Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
Xibei Gongye Daxue Xuebao
非合作目标
追逃博弈
微分对策
强化学习
稀疏奖励
title Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_full Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_fullStr Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_full_unstemmed Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_short Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
title_sort research on game strategy of spacecraft chase and escape based on adaptive augmented random search
topic 非合作目标
追逃博弈
微分对策
强化学习
稀疏奖励
url https://www.jnwpu.org/articles/jnwpu/full_html/2024/01/jnwpu2024421p117/jnwpu2024421p117.html
work_keys_str_mv AT jiaojie researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch
AT gouyongjie researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch
AT wuwenbo researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch
AT panbinfeng researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch