Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation

Online food delivery services today are considered an essential service that gets significant attention worldwide. Many companies and individuals are involved in this field as it offers good income and numerous jobs to the community. In this research, we consider the problem of online food delivery...

Full description

Bibliographic Details
Main Authors: Suleiman Abahussein, Dayong Ye, Congcong Zhu, Zishuo Cheng, Umer Siddique, Sheng Shen
Format: Article
Language:English
Published: MDPI AG 2023-11-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/14/11/597
_version_ 1797458920775614464
author Suleiman Abahussein
Dayong Ye
Congcong Zhu
Zishuo Cheng
Umer Siddique
Sheng Shen
author_facet Suleiman Abahussein
Dayong Ye
Congcong Zhu
Zishuo Cheng
Umer Siddique
Sheng Shen
author_sort Suleiman Abahussein
collection DOAJ
description Online food delivery services today are considered an essential service that gets significant attention worldwide. Many companies and individuals are involved in this field as it offers good income and numerous jobs to the community. In this research, we consider the problem of online food delivery services and how we can increase the number of received orders by couriers and thereby increase their income. Multi-agent reinforcement learning (MARL) is employed to guide the couriers to areas with high demand for food delivery requests. A map of the city is divided into small grids, and each grid represents a small area of the city that has different demand for online food delivery orders. The MARL agent trains and learns which grid has the highest demand and then selects it. Thus, couriers can get more food delivery orders and thereby increase long-term income. While increasing the number of received orders is important, protecting customer location is also essential. Therefore, the Protect User Location Method (PULM) is proposed in this research in order to protect customer location information. The PULM injects differential privacy (DP) Laplace noise based on two parameters: city area size and customer frequency of online food delivery orders. We use two datasets—Shenzhen, China, and Iowa, USA—to demonstrate the results of our experiments. The results show an increase in the number of received orders in the Shenzhen and Iowa City datasets. We also show the similarity and data utility of courier trajectories after we use our obfuscation (PULM) method.
first_indexed 2024-03-09T16:44:04Z
format Article
id doaj.art-7c679db56151419d87780f1d1c1c9200
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-03-09T16:44:04Z
publishDate 2023-11-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-7c679db56151419d87780f1d1c1c92002023-11-24T14:48:13ZengMDPI AGInformation2078-24892023-11-01141159710.3390/info14110597Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy PreservationSuleiman Abahussein0Dayong Ye1Congcong Zhu2Zishuo Cheng3Umer Siddique4Sheng Shen5Computer Science School, The University of Technology Sydney, Sydney, NSW 2007, AustraliaComputer Science School, The University of Technology Sydney, Sydney, NSW 2007, AustraliaFaculty of Data Science, City University of Macau, Macau 999078, ChinaSchool of Electrical and Information Engineering, The University of Sydney, Camperdown, NSW 2050, AustraliaDepartment of Computer Science, University of Texas at San Antonio, San Antonio, TX 78249, USASchool of Electrical and Information Engineering, The University of Sydney, Camperdown, NSW 2050, AustraliaOnline food delivery services today are considered an essential service that gets significant attention worldwide. Many companies and individuals are involved in this field as it offers good income and numerous jobs to the community. In this research, we consider the problem of online food delivery services and how we can increase the number of received orders by couriers and thereby increase their income. Multi-agent reinforcement learning (MARL) is employed to guide the couriers to areas with high demand for food delivery requests. A map of the city is divided into small grids, and each grid represents a small area of the city that has different demand for online food delivery orders. The MARL agent trains and learns which grid has the highest demand and then selects it. Thus, couriers can get more food delivery orders and thereby increase long-term income. While increasing the number of received orders is important, protecting customer location is also essential. Therefore, the Protect User Location Method (PULM) is proposed in this research in order to protect customer location information. The PULM injects differential privacy (DP) Laplace noise based on two parameters: city area size and customer frequency of online food delivery orders. We use two datasets—Shenzhen, China, and Iowa, USA—to demonstrate the results of our experiments. The results show an increase in the number of received orders in the Shenzhen and Iowa City datasets. We also show the similarity and data utility of courier trajectories after we use our obfuscation (PULM) method.https://www.mdpi.com/2078-2489/14/11/597privacydifferential privacyonline food deliverydeep reinforcement learningmulti-agent reinforcement learning
spellingShingle Suleiman Abahussein
Dayong Ye
Congcong Zhu
Zishuo Cheng
Umer Siddique
Sheng Shen
Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation
Information
privacy
differential privacy
online food delivery
deep reinforcement learning
multi-agent reinforcement learning
title Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation
title_full Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation
title_fullStr Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation
title_full_unstemmed Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation
title_short Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation
title_sort multi agent reinforcement learning for online food delivery with location privacy preservation
topic privacy
differential privacy
online food delivery
deep reinforcement learning
multi-agent reinforcement learning
url https://www.mdpi.com/2078-2489/14/11/597
work_keys_str_mv AT suleimanabahussein multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation
AT dayongye multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation
AT congcongzhu multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation
AT zishuocheng multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation
AT umersiddique multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation
AT shengshen multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation