Research on game strategy of spacecraft chase and escape based on adaptive augmented random search
To solve the problem of the survival differential policy interception between a spacecraft and a non-cooperative target pursuit game, the pursuit game policy is studied based on reinforcement learning, and the adaptive-augmented random search algorithm is proposed. Firstly, to solve the sparse rewar...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
EDP Sciences
2024-02-01
|
Series: | Xibei Gongye Daxue Xuebao |
Subjects: | |
Online Access: | https://www.jnwpu.org/articles/jnwpu/full_html/2024/01/jnwpu2024421p117/jnwpu2024421p117.html |
_version_ | 1797221500409872384 |
---|---|
author | JIAO Jie GOU Yongjie WU Wenbo PAN Binfeng |
author_facet | JIAO Jie GOU Yongjie WU Wenbo PAN Binfeng |
author_sort | JIAO Jie |
collection | DOAJ |
description | To solve the problem of the survival differential policy interception between a spacecraft and a non-cooperative target pursuit game, the pursuit game policy is studied based on reinforcement learning, and the adaptive-augmented random search algorithm is proposed. Firstly, to solve the sparse reward problem of sequential decision making, an exploration method based on the spatial perturbation of parameters of the policy is designed, thus accelerating its convergence speed. Secondly, to avoid the possibility of falling into local optimum prematurely, a novelty degree function is designed to guide the policy update, enhancing the efficiency of data utilization. Finally, the effectiveness and advancement of the exploration method are verified with numerical simulations and compared with those of the augmented random search algorithm, the proximal policy optimization algorithm and the deep deterministic policy gradient algorithm. |
first_indexed | 2024-04-24T13:06:25Z |
format | Article |
id | doaj.art-85ea93878bcc4b82b5dd2bc5e1c38cf2 |
institution | Directory Open Access Journal |
issn | 1000-2758 2609-7125 |
language | zho |
last_indexed | 2024-04-24T13:06:25Z |
publishDate | 2024-02-01 |
publisher | EDP Sciences |
record_format | Article |
series | Xibei Gongye Daxue Xuebao |
spelling | doaj.art-85ea93878bcc4b82b5dd2bc5e1c38cf22024-04-05T07:31:28ZzhoEDP SciencesXibei Gongye Daxue Xuebao1000-27582609-71252024-02-0142111712810.1051/jnwpu/20244210117jnwpu2024421p117Research on game strategy of spacecraft chase and escape based on adaptive augmented random searchJIAO Jie0GOU Yongjie1WU Wenbo2PAN Binfeng3School of Astronautics, Northwestern Polytechnical UniversityShanghai Aerospace Systems Engineering InstituteSchool of Astronautics, Northwestern Polytechnical UniversitySchool of Astronautics, Northwestern Polytechnical UniversityTo solve the problem of the survival differential policy interception between a spacecraft and a non-cooperative target pursuit game, the pursuit game policy is studied based on reinforcement learning, and the adaptive-augmented random search algorithm is proposed. Firstly, to solve the sparse reward problem of sequential decision making, an exploration method based on the spatial perturbation of parameters of the policy is designed, thus accelerating its convergence speed. Secondly, to avoid the possibility of falling into local optimum prematurely, a novelty degree function is designed to guide the policy update, enhancing the efficiency of data utilization. Finally, the effectiveness and advancement of the exploration method are verified with numerical simulations and compared with those of the augmented random search algorithm, the proximal policy optimization algorithm and the deep deterministic policy gradient algorithm.https://www.jnwpu.org/articles/jnwpu/full_html/2024/01/jnwpu2024421p117/jnwpu2024421p117.html非合作目标追逃博弈微分对策强化学习稀疏奖励 |
spellingShingle | JIAO Jie GOU Yongjie WU Wenbo PAN Binfeng Research on game strategy of spacecraft chase and escape based on adaptive augmented random search Xibei Gongye Daxue Xuebao 非合作目标 追逃博弈 微分对策 强化学习 稀疏奖励 |
title | Research on game strategy of spacecraft chase and escape based on adaptive augmented random search |
title_full | Research on game strategy of spacecraft chase and escape based on adaptive augmented random search |
title_fullStr | Research on game strategy of spacecraft chase and escape based on adaptive augmented random search |
title_full_unstemmed | Research on game strategy of spacecraft chase and escape based on adaptive augmented random search |
title_short | Research on game strategy of spacecraft chase and escape based on adaptive augmented random search |
title_sort | research on game strategy of spacecraft chase and escape based on adaptive augmented random search |
topic | 非合作目标 追逃博弈 微分对策 强化学习 稀疏奖励 |
url | https://www.jnwpu.org/articles/jnwpu/full_html/2024/01/jnwpu2024421p117/jnwpu2024421p117.html |
work_keys_str_mv | AT jiaojie researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch AT gouyongjie researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch AT wuwenbo researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch AT panbinfeng researchongamestrategyofspacecraftchaseandescapebasedonadaptiveaugmentedrandomsearch |