Deep Reinforcement Learning for Drone Delivery

Drones are expected to be used extensively for delivery tasks in the future. In the absence of obstacles, satellite based navigation from departure to the geo-located destination is a simple task. When obstacles are known to be in the path, pilots must build a flight plan to avoid them. However, whe...

Full description

Bibliographic Details
Main Authors: Guillem Muñoz, Cristina Barrado, Ender Çetin, Esther Salami
Format: Article
Language:English
Published: MDPI AG 2019-09-01
Series:Drones
Subjects:
Online Access:https://www.mdpi.com/2504-446X/3/3/72
_version_ 1818161926396444672
author Guillem Muñoz
Cristina Barrado
Ender Çetin
Esther Salami
author_facet Guillem Muñoz
Cristina Barrado
Ender Çetin
Esther Salami
author_sort Guillem Muñoz
collection DOAJ
description Drones are expected to be used extensively for delivery tasks in the future. In the absence of obstacles, satellite based navigation from departure to the geo-located destination is a simple task. When obstacles are known to be in the path, pilots must build a flight plan to avoid them. However, when they are unknown, there are too many or they are in places that are not fixed positions, then to build a safe flight plan becomes very challenging. Moreover, in a weak satellite signal environment, such as indoors, under trees canopy or in urban canyons, the current drone navigation systems may fail. Artificial intelligence, a research area with increasing activity, can be used to overcome such challenges. Initially focused on robots and now mostly applied to ground vehicles, artificial intelligence begins to be used also to train drones. Reinforcement learning is the branch of artificial intelligence able to train machines. The application of reinforcement learning to drones will provide them with more intelligence, eventually converting drones in fully-autonomous machines. In this work, reinforcement learning is studied for drone delivery. As sensors, the drone only has a stereo-vision front camera, from which depth information is obtained. The drone is trained to fly to a destination in a neighborhood environment that has plenty of obstacles such as trees, cables, cars and houses. The flying area is also delimited by a geo-fence; this is a virtual (non-visible) fence that prevents the drone from entering or leaving a defined area. The drone has to avoid visible obstacles and has to reach a goal. Results show that, in comparison with the previous results, the new algorithms have better results, not only with a better reward, but also with a reduction of its variance. The second contribution is the checkpoints. They consist of saving a trained model every time a better reward is achieved. Results show how checkpoints improve the test results.
first_indexed 2024-12-11T16:25:32Z
format Article
id doaj.art-78e7fa2e8fde49f4b25b948f606e86de
institution Directory Open Access Journal
issn 2504-446X
language English
last_indexed 2024-12-11T16:25:32Z
publishDate 2019-09-01
publisher MDPI AG
record_format Article
series Drones
spelling doaj.art-78e7fa2e8fde49f4b25b948f606e86de2022-12-22T00:58:44ZengMDPI AGDrones2504-446X2019-09-01337210.3390/drones3030072drones3030072Deep Reinforcement Learning for Drone DeliveryGuillem Muñoz0Cristina Barrado1Ender Çetin2Esther Salami3Computer Architecture Department, UPC BarcelonaTECH, Esteve Terrades 7, 08860 Castelldefels, SpainComputer Architecture Department, UPC BarcelonaTECH, Esteve Terrades 7, 08860 Castelldefels, SpainComputer Architecture Department, UPC BarcelonaTECH, Esteve Terrades 7, 08860 Castelldefels, SpainComputer Architecture Department, UPC BarcelonaTECH, Esteve Terrades 7, 08860 Castelldefels, SpainDrones are expected to be used extensively for delivery tasks in the future. In the absence of obstacles, satellite based navigation from departure to the geo-located destination is a simple task. When obstacles are known to be in the path, pilots must build a flight plan to avoid them. However, when they are unknown, there are too many or they are in places that are not fixed positions, then to build a safe flight plan becomes very challenging. Moreover, in a weak satellite signal environment, such as indoors, under trees canopy or in urban canyons, the current drone navigation systems may fail. Artificial intelligence, a research area with increasing activity, can be used to overcome such challenges. Initially focused on robots and now mostly applied to ground vehicles, artificial intelligence begins to be used also to train drones. Reinforcement learning is the branch of artificial intelligence able to train machines. The application of reinforcement learning to drones will provide them with more intelligence, eventually converting drones in fully-autonomous machines. In this work, reinforcement learning is studied for drone delivery. As sensors, the drone only has a stereo-vision front camera, from which depth information is obtained. The drone is trained to fly to a destination in a neighborhood environment that has plenty of obstacles such as trees, cables, cars and houses. The flying area is also delimited by a geo-fence; this is a virtual (non-visible) fence that prevents the drone from entering or leaving a defined area. The drone has to avoid visible obstacles and has to reach a goal. Results show that, in comparison with the previous results, the new algorithms have better results, not only with a better reward, but also with a reduction of its variance. The second contribution is the checkpoints. They consist of saving a trained model every time a better reward is achieved. Results show how checkpoints improve the test results.https://www.mdpi.com/2504-446X/3/3/72dronesdeep learningreinforcement learningQ-learningDQN, JNN
spellingShingle Guillem Muñoz
Cristina Barrado
Ender Çetin
Esther Salami
Deep Reinforcement Learning for Drone Delivery
Drones
drones
deep learning
reinforcement learning
Q-learning
DQN, JNN
title Deep Reinforcement Learning for Drone Delivery
title_full Deep Reinforcement Learning for Drone Delivery
title_fullStr Deep Reinforcement Learning for Drone Delivery
title_full_unstemmed Deep Reinforcement Learning for Drone Delivery
title_short Deep Reinforcement Learning for Drone Delivery
title_sort deep reinforcement learning for drone delivery
topic drones
deep learning
reinforcement learning
Q-learning
DQN, JNN
url https://www.mdpi.com/2504-446X/3/3/72
work_keys_str_mv AT guillemmunoz deepreinforcementlearningfordronedelivery
AT cristinabarrado deepreinforcementlearningfordronedelivery
AT endercetin deepreinforcementlearningfordronedelivery
AT esthersalami deepreinforcementlearningfordronedelivery