Big Data Privacy Scenarios
This paper is the first in a series on privacy in Big Data. As an outgrowth of a series of workshops on the topic, the Big Data Privacy Working Group undertook a study of a series of use scenarios to highlight the challenges to privacy that arise in the Big Data arena. This is a report on those scen...
Main Authors: | , , , |
---|---|
Other Authors: | |
Published: |
2015
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/99127 |
_version_ | 1811095810985689088 |
---|---|
author | Bruce, Elizabeth Sollins, Karen Vernon, Mona Weitzner, Danny |
author2 | Karen Sollins |
author_facet | Karen Sollins Bruce, Elizabeth Sollins, Karen Vernon, Mona Weitzner, Danny |
author_sort | Bruce, Elizabeth |
collection | MIT |
description | This paper is the first in a series on privacy in Big Data. As an outgrowth of a series of workshops on the topic, the Big Data Privacy Working Group undertook a study of a series of use scenarios to highlight the challenges to privacy that arise in the Big Data arena. This is a report on those scenarios. The deeper question explored by this exercise is what is distinctive about privacy in the context of Big Data. In addition, we discuss an initial list of issues for privacy that derive specifically from the nature of Big Data. These derive from observations across the real world scenarios and use cases explored in this project as well as wider reading and discussions:* Scale: The sheer size of the datasets leads to challenges in creating, managing and applying privacy policies.* Diversity: The increased likelihood of more and more diverse participants in Big Data collection, management, and use, leads to differing agendas and objectives. By nature, this is likely to lead to contradictory agendas and objectives.* Integration: With increased data management technologies (e.g. cloud services, data lakes, and so forth), integration across datasets, with new and often surprising opportunities for cross-product inferences, will also come new information about individuals and their behaviors.* Impact on secondary participants: Because many pieces of information are reflective of not only the targeted subject, but secondary, often unattended, participants, the inferences and resulting information will increasingly be reflective of other people, not originally considered as the subject of privacy concerns and approaches.* Need for emergent policies for emergent information: As inferences over merged data sets occur, emergent information or understanding will occur. Although each unique data set may have existing privacy policies and enforcement mechanisms, it is not clear that it is possible to develop the requisite and appropriate emerged privacy policies and appropriate enforcement of them automatically. |
first_indexed | 2024-09-23T16:28:58Z |
id | mit-1721.1/99127 |
institution | Massachusetts Institute of Technology |
last_indexed | 2024-09-23T16:28:58Z |
publishDate | 2015 |
record_format | dspace |
spelling | mit-1721.1/991272019-04-11T03:22:11Z Big Data Privacy Scenarios Bruce, Elizabeth Sollins, Karen Vernon, Mona Weitzner, Danny Karen Sollins Danny Weitzner Advanced Network Architecture Decentralized Information Group Big Data Use scenarios Privacy This paper is the first in a series on privacy in Big Data. As an outgrowth of a series of workshops on the topic, the Big Data Privacy Working Group undertook a study of a series of use scenarios to highlight the challenges to privacy that arise in the Big Data arena. This is a report on those scenarios. The deeper question explored by this exercise is what is distinctive about privacy in the context of Big Data. In addition, we discuss an initial list of issues for privacy that derive specifically from the nature of Big Data. These derive from observations across the real world scenarios and use cases explored in this project as well as wider reading and discussions:* Scale: The sheer size of the datasets leads to challenges in creating, managing and applying privacy policies.* Diversity: The increased likelihood of more and more diverse participants in Big Data collection, management, and use, leads to differing agendas and objectives. By nature, this is likely to lead to contradictory agendas and objectives.* Integration: With increased data management technologies (e.g. cloud services, data lakes, and so forth), integration across datasets, with new and often surprising opportunities for cross-product inferences, will also come new information about individuals and their behaviors.* Impact on secondary participants: Because many pieces of information are reflective of not only the targeted subject, but secondary, often unattended, participants, the inferences and resulting information will increasingly be reflective of other people, not originally considered as the subject of privacy concerns and approaches.* Need for emergent policies for emergent information: As inferences over merged data sets occur, emergent information or understanding will occur. Although each unique data set may have existing privacy policies and enforcement mechanisms, it is not clear that it is possible to develop the requisite and appropriate emerged privacy policies and appropriate enforcement of them automatically. 2015-10-02T15:45:04Z 2015-10-02T15:45:04Z 2015-10-01 2015-10-02T15:45:04Z http://hdl.handle.net/1721.1/99127 MIT-CSAIL-TR-2015-030 Creative Commons Attribution-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nd/4.0/ 53 p. application/pdf |
spellingShingle | Big Data Use scenarios Privacy Bruce, Elizabeth Sollins, Karen Vernon, Mona Weitzner, Danny Big Data Privacy Scenarios |
title | Big Data Privacy Scenarios |
title_full | Big Data Privacy Scenarios |
title_fullStr | Big Data Privacy Scenarios |
title_full_unstemmed | Big Data Privacy Scenarios |
title_short | Big Data Privacy Scenarios |
title_sort | big data privacy scenarios |
topic | Big Data Use scenarios Privacy |
url | http://hdl.handle.net/1721.1/99127 |
work_keys_str_mv | AT bruceelizabeth bigdataprivacyscenarios AT sollinskaren bigdataprivacyscenarios AT vernonmona bigdataprivacyscenarios AT weitznerdanny bigdataprivacyscenarios |