Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic Curriculum

Collaborative autonomous landing of a quadrotor Unmanned Aerial Vehicle (UAV) on a moving Unmanned Ground Vehicle (UGV) presents challenges due to the need for accurate real-time tracking of the UGV and the adjustment for the landing policy. To address this challenge, we propose a progressive learni...

Full description

Bibliographic Details
Main Authors: Chang Wang, Jiaqing Wang, Changyun Wei, Yi Zhu, Dong Yin, Jie Li
Format: Article
Language:English
Published: MDPI AG 2023-11-01
Series:Drones
Subjects:
Online Access:https://www.mdpi.com/2504-446X/7/11/676
_version_ 1797459510656237568
author Chang Wang
Jiaqing Wang
Changyun Wei
Yi Zhu
Dong Yin
Jie Li
author_facet Chang Wang
Jiaqing Wang
Changyun Wei
Yi Zhu
Dong Yin
Jie Li
author_sort Chang Wang
collection DOAJ
description Collaborative autonomous landing of a quadrotor Unmanned Aerial Vehicle (UAV) on a moving Unmanned Ground Vehicle (UGV) presents challenges due to the need for accurate real-time tracking of the UGV and the adjustment for the landing policy. To address this challenge, we propose a progressive learning framework for generating an optimal landing policy based on vision without the need of communication between the UAV and the UGV. First, we propose the Landing Vision System (LVS) to offer rapid localization and pose estimation of the UGV. Then, we design an Automatic Curriculum Learning (ACL) approach to learn the landing tasks under different conditions of UGV motions and wind interference. Specifically, we introduce a neural network-based difficulty discriminator to schedule the landing tasks according to their levels of difficulty. Our method achieves a higher landing success rate and accuracy compared with the state-of-the-art TD3 reinforcement learning algorithm.
first_indexed 2024-03-09T16:53:24Z
format Article
id doaj.art-0421c960b49c43bcb324651413e7d547
institution Directory Open Access Journal
issn 2504-446X
language English
last_indexed 2024-03-09T16:53:24Z
publishDate 2023-11-01
publisher MDPI AG
record_format Article
series Drones
spelling doaj.art-0421c960b49c43bcb324651413e7d5472023-11-24T14:38:12ZengMDPI AGDrones2504-446X2023-11-0171167610.3390/drones7110676Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic CurriculumChang Wang0Jiaqing Wang1Changyun Wei2Yi Zhu3Dong Yin4Jie Li5College of Intelligence Science and Technology, National University of Defense Technology, Changsha 410073, ChinaCollege of Mechanical and Electrical Engineering, Hohai University, Changzhou 213022, ChinaCollege of Mechanical and Electrical Engineering, Hohai University, Changzhou 213022, ChinaSchool of Computer Science, Nanjing Audit University, Nanjing 211800, ChinaCollege of Intelligence Science and Technology, National University of Defense Technology, Changsha 410073, ChinaCollege of Intelligence Science and Technology, National University of Defense Technology, Changsha 410073, ChinaCollaborative autonomous landing of a quadrotor Unmanned Aerial Vehicle (UAV) on a moving Unmanned Ground Vehicle (UGV) presents challenges due to the need for accurate real-time tracking of the UGV and the adjustment for the landing policy. To address this challenge, we propose a progressive learning framework for generating an optimal landing policy based on vision without the need of communication between the UAV and the UGV. First, we propose the Landing Vision System (LVS) to offer rapid localization and pose estimation of the UGV. Then, we design an Automatic Curriculum Learning (ACL) approach to learn the landing tasks under different conditions of UGV motions and wind interference. Specifically, we introduce a neural network-based difficulty discriminator to schedule the landing tasks according to their levels of difficulty. Our method achieves a higher landing success rate and accuracy compared with the state-of-the-art TD3 reinforcement learning algorithm.https://www.mdpi.com/2504-446X/7/11/676deep reinforcement learningautomatic curriculum learningUAV landing
spellingShingle Chang Wang
Jiaqing Wang
Changyun Wei
Yi Zhu
Dong Yin
Jie Li
Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic Curriculum
Drones
deep reinforcement learning
automatic curriculum learning
UAV landing
title Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic Curriculum
title_full Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic Curriculum
title_fullStr Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic Curriculum
title_full_unstemmed Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic Curriculum
title_short Vision-Based Deep Reinforcement Learning of UAV-UGV Collaborative Landing Policy Using Automatic Curriculum
title_sort vision based deep reinforcement learning of uav ugv collaborative landing policy using automatic curriculum
topic deep reinforcement learning
automatic curriculum learning
UAV landing
url https://www.mdpi.com/2504-446X/7/11/676
work_keys_str_mv AT changwang visionbaseddeepreinforcementlearningofuavugvcollaborativelandingpolicyusingautomaticcurriculum
AT jiaqingwang visionbaseddeepreinforcementlearningofuavugvcollaborativelandingpolicyusingautomaticcurriculum
AT changyunwei visionbaseddeepreinforcementlearningofuavugvcollaborativelandingpolicyusingautomaticcurriculum
AT yizhu visionbaseddeepreinforcementlearningofuavugvcollaborativelandingpolicyusingautomaticcurriculum
AT dongyin visionbaseddeepreinforcementlearningofuavugvcollaborativelandingpolicyusingautomaticcurriculum
AT jieli visionbaseddeepreinforcementlearningofuavugvcollaborativelandingpolicyusingautomaticcurriculum