Study on the Glider Soaring Strategy in Random Location Thermal Updraft via Reinforcement Learning

Soaring birds can use thermal updrafts in natural environments to fly for long periods or distances. The flight strategy of soaring birds can be implemented to gliders to increase their flight time. Currently, studies on soaring flight strategies focus on the turbulent nature of updrafts while negle...

Full description

Bibliographic Details
Main Authors: Yunxiang Cui, De Yan, Zhiqiang Wan
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Aerospace
Subjects:
Online Access:https://www.mdpi.com/2226-4310/10/10/834
_version_ 1797575078991364096
author Yunxiang Cui
De Yan
Zhiqiang Wan
author_facet Yunxiang Cui
De Yan
Zhiqiang Wan
author_sort Yunxiang Cui
collection DOAJ
description Soaring birds can use thermal updrafts in natural environments to fly for long periods or distances. The flight strategy of soaring birds can be implemented to gliders to increase their flight time. Currently, studies on soaring flight strategies focus on the turbulent nature of updrafts while neglecting the random characteristics of its generation and disappearance. In addition, most flight strategies only focus on utilizing updrafts while neglecting how to explore it. Therefore, in this paper, a complete flight strategy that seeks and uses random location thermal updrafts is mainly emphasized and developed. Moreover, through the derivation of flight dynamics and related formulas, the principle of gliders acquiring energy from thermal updrafts is explained through energy concepts. This concept lays a theoretical foundation for research on soaring flight strategies. Furthermore, the method of reinforcement learning is adopted, and a perception strategy suitable for gliders that considers the vertical ground speed, vertical ground speed change rate, heading angle, and heading angle change as the main perception factors is developed. Meanwhile, an area exploring strategy was trained by reinforcement learning, and the two strategies were combined into a complete flight strategy that seeks and uses updrafts. Finally, based on the guidance of the soaring strategy, the flight of the glider in the simulation environment is tested. The soaring strategy is verified to significantly improve the flight time lengths of gliders.
first_indexed 2024-03-10T21:31:14Z
format Article
id doaj.art-57ebf6456c334f0091da18cf880cb7db
institution Directory Open Access Journal
issn 2226-4310
language English
last_indexed 2024-03-10T21:31:14Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Aerospace
spelling doaj.art-57ebf6456c334f0091da18cf880cb7db2023-11-19T15:16:48ZengMDPI AGAerospace2226-43102023-09-01101083410.3390/aerospace10100834Study on the Glider Soaring Strategy in Random Location Thermal Updraft via Reinforcement LearningYunxiang Cui0De Yan1Zhiqiang Wan2School of Aeronautic Science and Engineering, Beihang University, Beijing 100191, ChinaSchool of Aeronautic Science and Engineering, Beihang University, Beijing 100191, ChinaSchool of Aeronautic Science and Engineering, Beihang University, Beijing 100191, ChinaSoaring birds can use thermal updrafts in natural environments to fly for long periods or distances. The flight strategy of soaring birds can be implemented to gliders to increase their flight time. Currently, studies on soaring flight strategies focus on the turbulent nature of updrafts while neglecting the random characteristics of its generation and disappearance. In addition, most flight strategies only focus on utilizing updrafts while neglecting how to explore it. Therefore, in this paper, a complete flight strategy that seeks and uses random location thermal updrafts is mainly emphasized and developed. Moreover, through the derivation of flight dynamics and related formulas, the principle of gliders acquiring energy from thermal updrafts is explained through energy concepts. This concept lays a theoretical foundation for research on soaring flight strategies. Furthermore, the method of reinforcement learning is adopted, and a perception strategy suitable for gliders that considers the vertical ground speed, vertical ground speed change rate, heading angle, and heading angle change as the main perception factors is developed. Meanwhile, an area exploring strategy was trained by reinforcement learning, and the two strategies were combined into a complete flight strategy that seeks and uses updrafts. Finally, based on the guidance of the soaring strategy, the flight of the glider in the simulation environment is tested. The soaring strategy is verified to significantly improve the flight time lengths of gliders.https://www.mdpi.com/2226-4310/10/10/834soaring strategythermal updraftreinforcement learninggliderlong-endurance
spellingShingle Yunxiang Cui
De Yan
Zhiqiang Wan
Study on the Glider Soaring Strategy in Random Location Thermal Updraft via Reinforcement Learning
Aerospace
soaring strategy
thermal updraft
reinforcement learning
glider
long-endurance
title Study on the Glider Soaring Strategy in Random Location Thermal Updraft via Reinforcement Learning
title_full Study on the Glider Soaring Strategy in Random Location Thermal Updraft via Reinforcement Learning
title_fullStr Study on the Glider Soaring Strategy in Random Location Thermal Updraft via Reinforcement Learning
title_full_unstemmed Study on the Glider Soaring Strategy in Random Location Thermal Updraft via Reinforcement Learning
title_short Study on the Glider Soaring Strategy in Random Location Thermal Updraft via Reinforcement Learning
title_sort study on the glider soaring strategy in random location thermal updraft via reinforcement learning
topic soaring strategy
thermal updraft
reinforcement learning
glider
long-endurance
url https://www.mdpi.com/2226-4310/10/10/834
work_keys_str_mv AT yunxiangcui studyontheglidersoaringstrategyinrandomlocationthermalupdraftviareinforcementlearning
AT deyan studyontheglidersoaringstrategyinrandomlocationthermalupdraftviareinforcementlearning
AT zhiqiangwan studyontheglidersoaringstrategyinrandomlocationthermalupdraftviareinforcementlearning