Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control

In this paper, we present a novel guidance scheme based on model-based deep reinforcement learning (RL) technique. With model-based deep RL method, a deep neural network is trained as a predictive model of guidance dynamics which is incorporated into a model predictive path integral (MPPI) control f...

Full description

Bibliographic Details
Main Authors: Chen Liang, Weihong Wang, Zhenghua Liu, Chao Lai, Benchun Zhou
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8682051/
_version_ 1819276091229995008
author Chen Liang
Weihong Wang
Zhenghua Liu
Chao Lai
Benchun Zhou
author_facet Chen Liang
Weihong Wang
Zhenghua Liu
Chao Lai
Benchun Zhou
author_sort Chen Liang
collection DOAJ
description In this paper, we present a novel guidance scheme based on model-based deep reinforcement learning (RL) technique. With model-based deep RL method, a deep neural network is trained as a predictive model of guidance dynamics which is incorporated into a model predictive path integral (MPPI) control framework. However, the traditional MPPI framework assumes the actual environment similar to the training dataset for the deep neural network which is impractical in practice with different maneuvering of the target, other perturbations, and actuator failures. To address this problem, our method utilizes meta-learning technique to make the deep neural dynamics model adapt to such changes online. With this approach, we can alleviate the performance deterioration of standard MPPI control caused by the difference between the actual environment and training data. Then, a novel guidance law for a varying velocity interceptor intercepting maneuvering target with desired terminal impact angle under actuator failure is constructed based on the aforementioned techniques. The simulation and experiment results under different cases show the effectiveness and robustness of the proposed guidance law in achieving successful interceptions of maneuvering target.
first_indexed 2024-12-23T23:34:42Z
format Article
id doaj.art-b1f68edf997e48c4a102079545bb0894
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-23T23:34:42Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-b1f68edf997e48c4a102079545bb08942022-12-21T17:25:54ZengIEEEIEEE Access2169-35362019-01-017473534736510.1109/ACCESS.2019.29095798682051Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral ControlChen Liang0https://orcid.org/0000-0002-1991-7586Weihong Wang1Zhenghua Liu2Chao Lai3https://orcid.org/0000-0003-2758-4125Benchun Zhou4School of Automation Science and Electrical Engineering, Beihang University, Beijing, ChinaSchool of Automation Science and Electrical Engineering, Beihang University, Beijing, ChinaSchool of Automation Science and Electrical Engineering, Beihang University, Beijing, ChinaNavigation and Control Technology Research Institute, China North Industries Group Corporation, Beijing, ChinaSchool of Automation Science and Electrical Engineering, Beihang University, Beijing, ChinaIn this paper, we present a novel guidance scheme based on model-based deep reinforcement learning (RL) technique. With model-based deep RL method, a deep neural network is trained as a predictive model of guidance dynamics which is incorporated into a model predictive path integral (MPPI) control framework. However, the traditional MPPI framework assumes the actual environment similar to the training dataset for the deep neural network which is impractical in practice with different maneuvering of the target, other perturbations, and actuator failures. To address this problem, our method utilizes meta-learning technique to make the deep neural dynamics model adapt to such changes online. With this approach, we can alleviate the performance deterioration of standard MPPI control caused by the difference between the actual environment and training data. Then, a novel guidance law for a varying velocity interceptor intercepting maneuvering target with desired terminal impact angle under actuator failure is constructed based on the aforementioned techniques. The simulation and experiment results under different cases show the effectiveness and robustness of the proposed guidance law in achieving successful interceptions of maneuvering target.https://ieeexplore.ieee.org/document/8682051/Missile guidancemodel predictive controlmeta-learningdeep reinforcement learningimpact angle constraint
spellingShingle Chen Liang
Weihong Wang
Zhenghua Liu
Chao Lai
Benchun Zhou
Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
IEEE Access
Missile guidance
model predictive control
meta-learning
deep reinforcement learning
impact angle constraint
title Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
title_full Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
title_fullStr Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
title_full_unstemmed Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
title_short Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
title_sort learning to guide guidance law based on deep meta learning and model predictive path integral control
topic Missile guidance
model predictive control
meta-learning
deep reinforcement learning
impact angle constraint
url https://ieeexplore.ieee.org/document/8682051/
work_keys_str_mv AT chenliang learningtoguideguidancelawbasedondeepmetalearningandmodelpredictivepathintegralcontrol
AT weihongwang learningtoguideguidancelawbasedondeepmetalearningandmodelpredictivepathintegralcontrol
AT zhenghualiu learningtoguideguidancelawbasedondeepmetalearningandmodelpredictivepathintegralcontrol
AT chaolai learningtoguideguidancelawbasedondeepmetalearningandmodelpredictivepathintegralcontrol
AT benchunzhou learningtoguideguidancelawbasedondeepmetalearningandmodelpredictivepathintegralcontrol