Oblivious Statistic Collection With Local Differential Privacy in Mutual Distrust

Location data is valuable for various applications such as epidemiology, natural disasters, and urban planning but causes exposure of sensitive information, e.g., home or work place, from collected data in a datastore. Local Differential Privacy (LDP)-based data collection is a promising technology...

Full description

Bibliographic Details
Main Authors: Taisho Sasada, Yuzo Taenaka, Youki Kadobayashi
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10057407/
_version_ 1811158492171468800
author Taisho Sasada
Yuzo Taenaka
Youki Kadobayashi
author_facet Taisho Sasada
Yuzo Taenaka
Youki Kadobayashi
author_sort Taisho Sasada
collection DOAJ
description Location data is valuable for various applications such as epidemiology, natural disasters, and urban planning but causes exposure of sensitive information, e.g., home or work place, from collected data in a datastore. Local Differential Privacy (LDP)-based data collection is a promising technology to protect sensitive information. A mobile device modify data to make each piece of data indistinguishable from others but keep its intrinsic value for statistical characteristics in data. Although LDP fundamentally protects the privacy exposure from a data store, a datastore suffer a shortcomings on it; as a datastore can never validate the modified data due to concealed raw data, that allows anyone to tamper with one’s data or inject any amount of data, and thus manipulate the statistics of the whole data in a datastore, called data poisoning attack. As a device does not disclose raw data and a datastore cannot collaborate to validate data with a device who may be an adversary on this mutual distrust relationship, data collection needs an ability to avoid the effect of data poisoning.. The cause of data poisoning is the direct relationship between data volume and statistic; the more data a device sends gives more statistical changes on merged data in a datastore. In this paper, we propose to decouple statistical characteristics from data volumes on LDP-based data collection process to minimize the effect of poisoned data on a datastore. We utilize Oblivious Transfer (OT) protocol to retrieve only statistic characteristics of receiving data at a datastore. As OT protocol inevitably strengthen privacy protection on LDP-based data collection and accordingly drops statistic characteristics of data, We adjust LDP processing to collaboratively work with OT protocol. The proposed adjustment method adapts the protection strength of LDP to OT protocol behavior so that a data store receives data containing sufficient statistical characteristics. We conduct qualitative and experimental overhead analysis and show that our method decouples the relationship between statistical characteristics from data volume. Our experimental result also prove that the overhead can be acceptable on devices such as smartphones and IoT.
first_indexed 2024-04-10T05:25:37Z
format Article
id doaj.art-415c15ac7df44872a89e0b81a7116f55
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-10T05:25:37Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-415c15ac7df44872a89e0b81a7116f552023-03-08T00:00:38ZengIEEEIEEE Access2169-35362023-01-0111213742138610.1109/ACCESS.2023.325156010057407Oblivious Statistic Collection With Local Differential Privacy in Mutual DistrustTaisho Sasada0https://orcid.org/0000-0003-2144-4949Yuzo Taenaka1Youki Kadobayashi2Graduate School of Science and Technology, Nara Institute Science and Technology, Ikoma, JapanGraduate School of Science and Technology, Nara Institute Science and Technology, Ikoma, JapanGraduate School of Science and Technology, Nara Institute Science and Technology, Ikoma, JapanLocation data is valuable for various applications such as epidemiology, natural disasters, and urban planning but causes exposure of sensitive information, e.g., home or work place, from collected data in a datastore. Local Differential Privacy (LDP)-based data collection is a promising technology to protect sensitive information. A mobile device modify data to make each piece of data indistinguishable from others but keep its intrinsic value for statistical characteristics in data. Although LDP fundamentally protects the privacy exposure from a data store, a datastore suffer a shortcomings on it; as a datastore can never validate the modified data due to concealed raw data, that allows anyone to tamper with one’s data or inject any amount of data, and thus manipulate the statistics of the whole data in a datastore, called data poisoning attack. As a device does not disclose raw data and a datastore cannot collaborate to validate data with a device who may be an adversary on this mutual distrust relationship, data collection needs an ability to avoid the effect of data poisoning.. The cause of data poisoning is the direct relationship between data volume and statistic; the more data a device sends gives more statistical changes on merged data in a datastore. In this paper, we propose to decouple statistical characteristics from data volumes on LDP-based data collection process to minimize the effect of poisoned data on a datastore. We utilize Oblivious Transfer (OT) protocol to retrieve only statistic characteristics of receiving data at a datastore. As OT protocol inevitably strengthen privacy protection on LDP-based data collection and accordingly drops statistic characteristics of data, We adjust LDP processing to collaboratively work with OT protocol. The proposed adjustment method adapts the protection strength of LDP to OT protocol behavior so that a data store receives data containing sufficient statistical characteristics. We conduct qualitative and experimental overhead analysis and show that our method decouples the relationship between statistical characteristics from data volume. Our experimental result also prove that the overhead can be acceptable on devices such as smartphones and IoT.https://ieeexplore.ieee.org/document/10057407/Local differential privacyoblivious transfer protocollocation dataprivacy-preserving data miningdata security
spellingShingle Taisho Sasada
Yuzo Taenaka
Youki Kadobayashi
Oblivious Statistic Collection With Local Differential Privacy in Mutual Distrust
IEEE Access
Local differential privacy
oblivious transfer protocol
location data
privacy-preserving data mining
data security
title Oblivious Statistic Collection With Local Differential Privacy in Mutual Distrust
title_full Oblivious Statistic Collection With Local Differential Privacy in Mutual Distrust
title_fullStr Oblivious Statistic Collection With Local Differential Privacy in Mutual Distrust
title_full_unstemmed Oblivious Statistic Collection With Local Differential Privacy in Mutual Distrust
title_short Oblivious Statistic Collection With Local Differential Privacy in Mutual Distrust
title_sort oblivious statistic collection with local differential privacy in mutual distrust
topic Local differential privacy
oblivious transfer protocol
location data
privacy-preserving data mining
data security
url https://ieeexplore.ieee.org/document/10057407/
work_keys_str_mv AT taishosasada obliviousstatisticcollectionwithlocaldifferentialprivacyinmutualdistrust
AT yuzotaenaka obliviousstatisticcollectionwithlocaldifferentialprivacyinmutualdistrust
AT youkikadobayashi obliviousstatisticcollectionwithlocaldifferentialprivacyinmutualdistrust