Group Privacy: An Underrated but Worth Studying Research Problem in the Era of Artificial Intelligence and Big Data

<i>Introduction:</i> Recently, the tendency of artificial intelligence (AI) and big data use/applications has been rapidly expanding across the globe, improving people’s lifestyles with data-driven services (i.e., recommendations, smart healthcare, etc.). The synergy between AI and big d...

Full description

Bibliographic Details
Main Authors: Abdul Majeed, Safiullah Khan, Seong Oun Hwang
Format: Article
Language:English
Published: MDPI AG 2022-04-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/11/9/1449
_version_ 1797505103225159680
author Abdul Majeed
Safiullah Khan
Seong Oun Hwang
author_facet Abdul Majeed
Safiullah Khan
Seong Oun Hwang
author_sort Abdul Majeed
collection DOAJ
description <i>Introduction:</i> Recently, the tendency of artificial intelligence (AI) and big data use/applications has been rapidly expanding across the globe, improving people’s lifestyles with data-driven services (i.e., recommendations, smart healthcare, etc.). The synergy between AI and big data has become imperative considering the drastic growth in personal data stemming from diverse sources (cloud computing, IoT, social networks, etc.). However, when data meet AI at some central place, it invites unimaginable privacy issues, and one of those issues is group privacy. Despite being the most significant problem, group privacy has not yet received the attention of the research community it is due. <i>Problem Statement:</i> We study how to preserve the privacy of particular groups (a community of people with some common attributes/properties) rather than an individual in personal data handling (i.e., sharing, aggregating, and/or performing analytics, etc.), especially when we talk about groups purposely made by two or more people (with clear group identifying markers), for whom we need to protect their privacy as a group. <i>Aims/Objectives:</i> With this technical letter, our aim is to introduce a new dimension of privacy (e.g., group privacy) from technical perspectives to the research community. The main objective is to advocate the possibility of group privacy breaches when big data meet AI in real-world scenarios. <i>Methodology:</i> We set a hypothesis that group privacy (extracting group-level information) is a genuine problem, and can likely occur when AI-based techniques meet high dimensional and large-scale datasets. To prove our hypothesis, we conducted a substantial number of experiments on two real-world benchmark datasets using AI techniques. Based on the experimental analysis, we found that the likelihood of privacy breaches occurring at the group level by using AI techniques is very high when data are sufficiently large. Apart from that, we tested the parameter effect of AI techniques and found that some parameters’ combinations can help to extract more and fine-grained data about groups. <i>Findings:</i> Based on experimental analysis, we found that vulnerability of group privacy can likely increase with the data size and capacity of the AI method. We found that some attributes of people can act as catalysts in compromising group privacy. We suggest that group privacy should also be given due attention as individual privacy is, and robust tools are imperative to restrict implications (i.e., biased decision making, denial of accommodation, hate speech, etc.) of group privacy. <i>Significance of results:</i> The obtained results are the first step towards responsible data science, and can pave the way to understanding the phenomenon of group privacy. Furthermore, the results contribute towards the protection of motives/goals/practices of minor communities in any society. <i>Concluding statement:</i> Due to the significant rise in digitation, privacy issues are mutating themselves. Hence, it is vital to quickly pinpoint emerging privacy threats and suggest practical remedies for them in order to mitigate their consequences on human beings.
first_indexed 2024-03-10T04:13:49Z
format Article
id doaj.art-2829b546740d43eeae0c54141dbfc9c4
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-10T04:13:49Z
publishDate 2022-04-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-2829b546740d43eeae0c54141dbfc9c42023-11-23T08:03:46ZengMDPI AGElectronics2079-92922022-04-01119144910.3390/electronics11091449Group Privacy: An Underrated but Worth Studying Research Problem in the Era of Artificial Intelligence and Big DataAbdul Majeed0Safiullah Khan1Seong Oun Hwang2Department of Computer Engineering, Gachon University, Seongnam 13120, KoreaDepartment of IT Convergence Engineering, Gachon University, Seongnam 13120, KoreaDepartment of Computer Engineering, Gachon University, Seongnam 13120, Korea<i>Introduction:</i> Recently, the tendency of artificial intelligence (AI) and big data use/applications has been rapidly expanding across the globe, improving people’s lifestyles with data-driven services (i.e., recommendations, smart healthcare, etc.). The synergy between AI and big data has become imperative considering the drastic growth in personal data stemming from diverse sources (cloud computing, IoT, social networks, etc.). However, when data meet AI at some central place, it invites unimaginable privacy issues, and one of those issues is group privacy. Despite being the most significant problem, group privacy has not yet received the attention of the research community it is due. <i>Problem Statement:</i> We study how to preserve the privacy of particular groups (a community of people with some common attributes/properties) rather than an individual in personal data handling (i.e., sharing, aggregating, and/or performing analytics, etc.), especially when we talk about groups purposely made by two or more people (with clear group identifying markers), for whom we need to protect their privacy as a group. <i>Aims/Objectives:</i> With this technical letter, our aim is to introduce a new dimension of privacy (e.g., group privacy) from technical perspectives to the research community. The main objective is to advocate the possibility of group privacy breaches when big data meet AI in real-world scenarios. <i>Methodology:</i> We set a hypothesis that group privacy (extracting group-level information) is a genuine problem, and can likely occur when AI-based techniques meet high dimensional and large-scale datasets. To prove our hypothesis, we conducted a substantial number of experiments on two real-world benchmark datasets using AI techniques. Based on the experimental analysis, we found that the likelihood of privacy breaches occurring at the group level by using AI techniques is very high when data are sufficiently large. Apart from that, we tested the parameter effect of AI techniques and found that some parameters’ combinations can help to extract more and fine-grained data about groups. <i>Findings:</i> Based on experimental analysis, we found that vulnerability of group privacy can likely increase with the data size and capacity of the AI method. We found that some attributes of people can act as catalysts in compromising group privacy. We suggest that group privacy should also be given due attention as individual privacy is, and robust tools are imperative to restrict implications (i.e., biased decision making, denial of accommodation, hate speech, etc.) of group privacy. <i>Significance of results:</i> The obtained results are the first step towards responsible data science, and can pave the way to understanding the phenomenon of group privacy. Furthermore, the results contribute towards the protection of motives/goals/practices of minor communities in any society. <i>Concluding statement:</i> Due to the significant rise in digitation, privacy issues are mutating themselves. Hence, it is vital to quickly pinpoint emerging privacy threats and suggest practical remedies for them in order to mitigate their consequences on human beings.https://www.mdpi.com/2079-9292/11/9/1449group privacyartificial intelligencebig dataanalyticsprivacy-preserving data publishingutility
spellingShingle Abdul Majeed
Safiullah Khan
Seong Oun Hwang
Group Privacy: An Underrated but Worth Studying Research Problem in the Era of Artificial Intelligence and Big Data
Electronics
group privacy
artificial intelligence
big data
analytics
privacy-preserving data publishing
utility
title Group Privacy: An Underrated but Worth Studying Research Problem in the Era of Artificial Intelligence and Big Data
title_full Group Privacy: An Underrated but Worth Studying Research Problem in the Era of Artificial Intelligence and Big Data
title_fullStr Group Privacy: An Underrated but Worth Studying Research Problem in the Era of Artificial Intelligence and Big Data
title_full_unstemmed Group Privacy: An Underrated but Worth Studying Research Problem in the Era of Artificial Intelligence and Big Data
title_short Group Privacy: An Underrated but Worth Studying Research Problem in the Era of Artificial Intelligence and Big Data
title_sort group privacy an underrated but worth studying research problem in the era of artificial intelligence and big data
topic group privacy
artificial intelligence
big data
analytics
privacy-preserving data publishing
utility
url https://www.mdpi.com/2079-9292/11/9/1449
work_keys_str_mv AT abdulmajeed groupprivacyanunderratedbutworthstudyingresearchproblemintheeraofartificialintelligenceandbigdata
AT safiullahkhan groupprivacyanunderratedbutworthstudyingresearchproblemintheeraofartificialintelligenceandbigdata
AT seongounhwang groupprivacyanunderratedbutworthstudyingresearchproblemintheeraofartificialintelligenceandbigdata