Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs

Ducted-fan tail-sitter unmanned aerial vehicles (UAVs) provide versatility and unique benefits, attracting significant attention in various applications. This study focuses on developing a safe reinforcement learning method for back-transition control between level flight mode and hover mode for duc...

Full description

Bibliographic Details
Main Authors:	Yanbo Fu, Wenjie Zhao, Liu Liu
Format:	Article
Language:	English
Published:	MDPI AG 2023-05-01
Series:	Drones
Subjects:	safe reinforcement learning ducted fan transition control unmanned aerial vehicle (UAV)
Online Access:	https://www.mdpi.com/2504-446X/7/5/332

_version_	1797600446909513728
author	Yanbo Fu Wenjie Zhao Liu Liu
author_facet	Yanbo Fu Wenjie Zhao Liu Liu
author_sort	Yanbo Fu
collection	DOAJ
description	Ducted-fan tail-sitter unmanned aerial vehicles (UAVs) provide versatility and unique benefits, attracting significant attention in various applications. This study focuses on developing a safe reinforcement learning method for back-transition control between level flight mode and hover mode for ducted-fan tail-sitter UAVs. Our method enables transition control with a minimal altitude change and transition time while adhering to the velocity constraint. We employ the Trust Region Policy Optimization, Proximal Policy Optimization with Lagrangian, and Constrained Policy Optimization (CPO) algorithms for controller training, showcasing the superiority of the CPO algorithm and the necessity of the velocity constraint. The transition trajectory achieved using the CPO algorithm closely resembles the optimal trajectory obtained via the well-known GPOPS-II software with the SNOPT solver. Meanwhile, the CPO algorithm also exhibits strong robustness under unknown perturbations of UAV model parameters and wind disturbance.
first_indexed	2024-03-11T03:48:15Z
format	Article
id	doaj.art-f933d9b8812d4c11b1dbcdb7211e366c
institution	Directory Open Access Journal
issn	2504-446X
language	English
last_indexed	2024-03-11T03:48:15Z
publishDate	2023-05-01
publisher	MDPI AG
record_format	Article
series	Drones
spelling	doaj.art-f933d9b8812d4c11b1dbcdb7211e366c2023-11-18T01:07:27ZengMDPI AGDrones2504-446X2023-05-017533210.3390/drones7050332Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVsYanbo Fu0Wenjie Zhao1Liu Liu2School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, ChinaSchool of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, ChinaSchool of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, ChinaDucted-fan tail-sitter unmanned aerial vehicles (UAVs) provide versatility and unique benefits, attracting significant attention in various applications. This study focuses on developing a safe reinforcement learning method for back-transition control between level flight mode and hover mode for ducted-fan tail-sitter UAVs. Our method enables transition control with a minimal altitude change and transition time while adhering to the velocity constraint. We employ the Trust Region Policy Optimization, Proximal Policy Optimization with Lagrangian, and Constrained Policy Optimization (CPO) algorithms for controller training, showcasing the superiority of the CPO algorithm and the necessity of the velocity constraint. The transition trajectory achieved using the CPO algorithm closely resembles the optimal trajectory obtained via the well-known GPOPS-II software with the SNOPT solver. Meanwhile, the CPO algorithm also exhibits strong robustness under unknown perturbations of UAV model parameters and wind disturbance.https://www.mdpi.com/2504-446X/7/5/332safe reinforcement learningducted fantransition controlunmanned aerial vehicle (UAV)
spellingShingle	Yanbo Fu Wenjie Zhao Liu Liu Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs Drones safe reinforcement learning ducted fan transition control unmanned aerial vehicle (UAV)
title	Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs
title_full	Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs
title_fullStr	Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs
title_full_unstemmed	Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs
title_short	Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs
title_sort	safe reinforcement learning for transition control of ducted fan uavs
topic	safe reinforcement learning ducted fan transition control unmanned aerial vehicle (UAV)
url	https://www.mdpi.com/2504-446X/7/5/332
work_keys_str_mv	AT yanbofu safereinforcementlearningfortransitioncontrolofductedfanuavs AT wenjiezhao safereinforcementlearningfortransitioncontrolofductedfanuavs AT liuliu safereinforcementlearningfortransitioncontrolofductedfanuavs

Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs

Similar Items