Set-valued data collection with local differential privacy based on category hierarchy

Set-valued data is extremely important and widely used in sensor technology and application. Recently, privacy protection for set-valued data under differential privacy (DP) has become a research hotspot. However, the DP model assumes that the data center is trustworthy, consequently, increasingly a...

Full description

Bibliographic Details
Main Authors: Jia Ouyang, Yinyin Xiao, Shaopeng Liu, Zhenghong Xiao, Xiuxiu Liao
Format: Article
Language:English
Published: AIMS Press 2021-04-01
Series:Mathematical Biosciences and Engineering
Subjects:
Online Access:http://www.aimspress.com/article/doi/10.3934/mbe.2021139?viewType=HTML
_version_ 1818925612591480832
author Jia Ouyang
Yinyin Xiao
Shaopeng Liu
Zhenghong Xiao
Xiuxiu Liao
author_facet Jia Ouyang
Yinyin Xiao
Shaopeng Liu
Zhenghong Xiao
Xiuxiu Liao
author_sort Jia Ouyang
collection DOAJ
description Set-valued data is extremely important and widely used in sensor technology and application. Recently, privacy protection for set-valued data under differential privacy (DP) has become a research hotspot. However, the DP model assumes that the data center is trustworthy, consequently, increasingly attention has been paid to the application of the local differential privacy model (LDP) for set-valued data. Constrained by the local differential privacy model, most methods randomly respond to the subset of set-valued data, and the data collector conducts statistics on the received data. There are two main problems with this kind of method: one is that the utility function used in the random response loses too much information; the other is that the privacy protection of the set-valued data category is usually ignored. To solve these problems, this paper proposes a set-valued data collection method (SetLDP) based on the category hierarchy under the local differential privacy model. The core idea is to first make a random response to the existence of the category, continue to disturb the item count if the category exists, and finally randomly respond to a candidate itemset based on the new utility function. Theory analysis and experimental results show that the SetLDP can not only preserve more information, but also protect the category private information in set-valued data.
first_indexed 2024-12-20T02:44:00Z
format Article
id doaj.art-aaf02af43ada472b943366eb5928f2c2
institution Directory Open Access Journal
issn 1551-0018
language English
last_indexed 2024-12-20T02:44:00Z
publishDate 2021-04-01
publisher AIMS Press
record_format Article
series Mathematical Biosciences and Engineering
spelling doaj.art-aaf02af43ada472b943366eb5928f2c22022-12-21T19:56:15ZengAIMS PressMathematical Biosciences and Engineering1551-00182021-04-011832733276310.3934/mbe.2021139Set-valued data collection with local differential privacy based on category hierarchyJia Ouyang0Yinyin Xiao1Shaopeng Liu2Zhenghong Xiao3Xiuxiu Liao 41. School of Cyber Security, Guangdong Polytechnic Normal University, Guangzhou 510665, China1. School of Cyber Security, Guangdong Polytechnic Normal University, Guangzhou 510665, China2. College of Computer Science, Guangdong Polytechnic Normal University, Guangzhou 510665, China2. College of Computer Science, Guangdong Polytechnic Normal University, Guangzhou 510665, China2. College of Computer Science, Guangdong Polytechnic Normal University, Guangzhou 510665, ChinaSet-valued data is extremely important and widely used in sensor technology and application. Recently, privacy protection for set-valued data under differential privacy (DP) has become a research hotspot. However, the DP model assumes that the data center is trustworthy, consequently, increasingly attention has been paid to the application of the local differential privacy model (LDP) for set-valued data. Constrained by the local differential privacy model, most methods randomly respond to the subset of set-valued data, and the data collector conducts statistics on the received data. There are two main problems with this kind of method: one is that the utility function used in the random response loses too much information; the other is that the privacy protection of the set-valued data category is usually ignored. To solve these problems, this paper proposes a set-valued data collection method (SetLDP) based on the category hierarchy under the local differential privacy model. The core idea is to first make a random response to the existence of the category, continue to disturb the item count if the category exists, and finally randomly respond to a candidate itemset based on the new utility function. Theory analysis and experimental results show that the SetLDP can not only preserve more information, but also protect the category private information in set-valued data.http://www.aimspress.com/article/doi/10.3934/mbe.2021139?viewType=HTMLprivacy preservationset-valued datalocal differential privacyutility functiondata collection
spellingShingle Jia Ouyang
Yinyin Xiao
Shaopeng Liu
Zhenghong Xiao
Xiuxiu Liao
Set-valued data collection with local differential privacy based on category hierarchy
Mathematical Biosciences and Engineering
privacy preservation
set-valued data
local differential privacy
utility function
data collection
title Set-valued data collection with local differential privacy based on category hierarchy
title_full Set-valued data collection with local differential privacy based on category hierarchy
title_fullStr Set-valued data collection with local differential privacy based on category hierarchy
title_full_unstemmed Set-valued data collection with local differential privacy based on category hierarchy
title_short Set-valued data collection with local differential privacy based on category hierarchy
title_sort set valued data collection with local differential privacy based on category hierarchy
topic privacy preservation
set-valued data
local differential privacy
utility function
data collection
url http://www.aimspress.com/article/doi/10.3934/mbe.2021139?viewType=HTML
work_keys_str_mv AT jiaouyang setvalueddatacollectionwithlocaldifferentialprivacybasedoncategoryhierarchy
AT yinyinxiao setvalueddatacollectionwithlocaldifferentialprivacybasedoncategoryhierarchy
AT shaopengliu setvalueddatacollectionwithlocaldifferentialprivacybasedoncategoryhierarchy
AT zhenghongxiao setvalueddatacollectionwithlocaldifferentialprivacybasedoncategoryhierarchy
AT xiuxiuliao setvalueddatacollectionwithlocaldifferentialprivacybasedoncategoryhierarchy