Big Data Privacy Scenarios

This paper is the first in a series on privacy in Big Data. As an outgrowth of a series of workshops on the topic, the Big Data Privacy Working Group undertook a study of a series of use scenarios to highlight the challenges to privacy that arise in the Big Data arena. This is a report on those scen...

Full description

Bibliographic Details
Main Authors: Bruce, Elizabeth, Sollins, Karen, Vernon, Mona, Weitzner, Danny
Other Authors: Karen Sollins
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/1721.1/99127
_version_ 1811095810985689088
author Bruce, Elizabeth
Sollins, Karen
Vernon, Mona
Weitzner, Danny
author2 Karen Sollins
author_facet Karen Sollins
Bruce, Elizabeth
Sollins, Karen
Vernon, Mona
Weitzner, Danny
author_sort Bruce, Elizabeth
collection MIT
description This paper is the first in a series on privacy in Big Data. As an outgrowth of a series of workshops on the topic, the Big Data Privacy Working Group undertook a study of a series of use scenarios to highlight the challenges to privacy that arise in the Big Data arena. This is a report on those scenarios. The deeper question explored by this exercise is what is distinctive about privacy in the context of Big Data. In addition, we discuss an initial list of issues for privacy that derive specifically from the nature of Big Data. These derive from observations across the real world scenarios and use cases explored in this project as well as wider reading and discussions:* Scale: The sheer size of the datasets leads to challenges in creating, managing and applying privacy policies.* Diversity: The increased likelihood of more and more diverse participants in Big Data collection, management, and use, leads to differing agendas and objectives. By nature, this is likely to lead to contradictory agendas and objectives.* Integration: With increased data management technologies (e.g. cloud services, data lakes, and so forth), integration across datasets, with new and often surprising opportunities for cross-product inferences, will also come new information about individuals and their behaviors.* Impact on secondary participants: Because many pieces of information are reflective of not only the targeted subject, but secondary, often unattended, participants, the inferences and resulting information will increasingly be reflective of other people, not originally considered as the subject of privacy concerns and approaches.* Need for emergent policies for emergent information: As inferences over merged data sets occur, emergent information or understanding will occur. Although each unique data set may have existing privacy policies and enforcement mechanisms, it is not clear that it is possible to develop the requisite and appropriate emerged privacy policies and appropriate enforcement of them automatically.
first_indexed 2024-09-23T16:28:58Z
id mit-1721.1/99127
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T16:28:58Z
publishDate 2015
record_format dspace
spelling mit-1721.1/991272019-04-11T03:22:11Z Big Data Privacy Scenarios Bruce, Elizabeth Sollins, Karen Vernon, Mona Weitzner, Danny Karen Sollins Danny Weitzner Advanced Network Architecture Decentralized Information Group Big Data Use scenarios Privacy This paper is the first in a series on privacy in Big Data. As an outgrowth of a series of workshops on the topic, the Big Data Privacy Working Group undertook a study of a series of use scenarios to highlight the challenges to privacy that arise in the Big Data arena. This is a report on those scenarios. The deeper question explored by this exercise is what is distinctive about privacy in the context of Big Data. In addition, we discuss an initial list of issues for privacy that derive specifically from the nature of Big Data. These derive from observations across the real world scenarios and use cases explored in this project as well as wider reading and discussions:* Scale: The sheer size of the datasets leads to challenges in creating, managing and applying privacy policies.* Diversity: The increased likelihood of more and more diverse participants in Big Data collection, management, and use, leads to differing agendas and objectives. By nature, this is likely to lead to contradictory agendas and objectives.* Integration: With increased data management technologies (e.g. cloud services, data lakes, and so forth), integration across datasets, with new and often surprising opportunities for cross-product inferences, will also come new information about individuals and their behaviors.* Impact on secondary participants: Because many pieces of information are reflective of not only the targeted subject, but secondary, often unattended, participants, the inferences and resulting information will increasingly be reflective of other people, not originally considered as the subject of privacy concerns and approaches.* Need for emergent policies for emergent information: As inferences over merged data sets occur, emergent information or understanding will occur. Although each unique data set may have existing privacy policies and enforcement mechanisms, it is not clear that it is possible to develop the requisite and appropriate emerged privacy policies and appropriate enforcement of them automatically. 2015-10-02T15:45:04Z 2015-10-02T15:45:04Z 2015-10-01 2015-10-02T15:45:04Z http://hdl.handle.net/1721.1/99127 MIT-CSAIL-TR-2015-030 Creative Commons Attribution-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nd/4.0/ 53 p. application/pdf
spellingShingle Big Data
Use scenarios
Privacy
Bruce, Elizabeth
Sollins, Karen
Vernon, Mona
Weitzner, Danny
Big Data Privacy Scenarios
title Big Data Privacy Scenarios
title_full Big Data Privacy Scenarios
title_fullStr Big Data Privacy Scenarios
title_full_unstemmed Big Data Privacy Scenarios
title_short Big Data Privacy Scenarios
title_sort big data privacy scenarios
topic Big Data
Use scenarios
Privacy
url http://hdl.handle.net/1721.1/99127
work_keys_str_mv AT bruceelizabeth bigdataprivacyscenarios
AT sollinskaren bigdataprivacyscenarios
AT vernonmona bigdataprivacyscenarios
AT weitznerdanny bigdataprivacyscenarios