Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation
Online food delivery services today are considered an essential service that gets significant attention worldwide. Many companies and individuals are involved in this field as it offers good income and numerous jobs to the community. In this research, we consider the problem of online food delivery...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-11-01
|
Series: | Information |
Subjects: | |
Online Access: | https://www.mdpi.com/2078-2489/14/11/597 |
_version_ | 1797458920775614464 |
---|---|
author | Suleiman Abahussein Dayong Ye Congcong Zhu Zishuo Cheng Umer Siddique Sheng Shen |
author_facet | Suleiman Abahussein Dayong Ye Congcong Zhu Zishuo Cheng Umer Siddique Sheng Shen |
author_sort | Suleiman Abahussein |
collection | DOAJ |
description | Online food delivery services today are considered an essential service that gets significant attention worldwide. Many companies and individuals are involved in this field as it offers good income and numerous jobs to the community. In this research, we consider the problem of online food delivery services and how we can increase the number of received orders by couriers and thereby increase their income. Multi-agent reinforcement learning (MARL) is employed to guide the couriers to areas with high demand for food delivery requests. A map of the city is divided into small grids, and each grid represents a small area of the city that has different demand for online food delivery orders. The MARL agent trains and learns which grid has the highest demand and then selects it. Thus, couriers can get more food delivery orders and thereby increase long-term income. While increasing the number of received orders is important, protecting customer location is also essential. Therefore, the Protect User Location Method (PULM) is proposed in this research in order to protect customer location information. The PULM injects differential privacy (DP) Laplace noise based on two parameters: city area size and customer frequency of online food delivery orders. We use two datasets—Shenzhen, China, and Iowa, USA—to demonstrate the results of our experiments. The results show an increase in the number of received orders in the Shenzhen and Iowa City datasets. We also show the similarity and data utility of courier trajectories after we use our obfuscation (PULM) method. |
first_indexed | 2024-03-09T16:44:04Z |
format | Article |
id | doaj.art-7c679db56151419d87780f1d1c1c9200 |
institution | Directory Open Access Journal |
issn | 2078-2489 |
language | English |
last_indexed | 2024-03-09T16:44:04Z |
publishDate | 2023-11-01 |
publisher | MDPI AG |
record_format | Article |
series | Information |
spelling | doaj.art-7c679db56151419d87780f1d1c1c92002023-11-24T14:48:13ZengMDPI AGInformation2078-24892023-11-01141159710.3390/info14110597Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy PreservationSuleiman Abahussein0Dayong Ye1Congcong Zhu2Zishuo Cheng3Umer Siddique4Sheng Shen5Computer Science School, The University of Technology Sydney, Sydney, NSW 2007, AustraliaComputer Science School, The University of Technology Sydney, Sydney, NSW 2007, AustraliaFaculty of Data Science, City University of Macau, Macau 999078, ChinaSchool of Electrical and Information Engineering, The University of Sydney, Camperdown, NSW 2050, AustraliaDepartment of Computer Science, University of Texas at San Antonio, San Antonio, TX 78249, USASchool of Electrical and Information Engineering, The University of Sydney, Camperdown, NSW 2050, AustraliaOnline food delivery services today are considered an essential service that gets significant attention worldwide. Many companies and individuals are involved in this field as it offers good income and numerous jobs to the community. In this research, we consider the problem of online food delivery services and how we can increase the number of received orders by couriers and thereby increase their income. Multi-agent reinforcement learning (MARL) is employed to guide the couriers to areas with high demand for food delivery requests. A map of the city is divided into small grids, and each grid represents a small area of the city that has different demand for online food delivery orders. The MARL agent trains and learns which grid has the highest demand and then selects it. Thus, couriers can get more food delivery orders and thereby increase long-term income. While increasing the number of received orders is important, protecting customer location is also essential. Therefore, the Protect User Location Method (PULM) is proposed in this research in order to protect customer location information. The PULM injects differential privacy (DP) Laplace noise based on two parameters: city area size and customer frequency of online food delivery orders. We use two datasets—Shenzhen, China, and Iowa, USA—to demonstrate the results of our experiments. The results show an increase in the number of received orders in the Shenzhen and Iowa City datasets. We also show the similarity and data utility of courier trajectories after we use our obfuscation (PULM) method.https://www.mdpi.com/2078-2489/14/11/597privacydifferential privacyonline food deliverydeep reinforcement learningmulti-agent reinforcement learning |
spellingShingle | Suleiman Abahussein Dayong Ye Congcong Zhu Zishuo Cheng Umer Siddique Sheng Shen Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation Information privacy differential privacy online food delivery deep reinforcement learning multi-agent reinforcement learning |
title | Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation |
title_full | Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation |
title_fullStr | Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation |
title_full_unstemmed | Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation |
title_short | Multi-Agent Reinforcement Learning for Online Food Delivery with Location Privacy Preservation |
title_sort | multi agent reinforcement learning for online food delivery with location privacy preservation |
topic | privacy differential privacy online food delivery deep reinforcement learning multi-agent reinforcement learning |
url | https://www.mdpi.com/2078-2489/14/11/597 |
work_keys_str_mv | AT suleimanabahussein multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation AT dayongye multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation AT congcongzhu multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation AT zishuocheng multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation AT umersiddique multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation AT shengshen multiagentreinforcementlearningforonlinefooddeliverywithlocationprivacypreservation |