Digital Forensics Formats: Seeking a Digital Preservation Storage Container Format for Web Archiving

In this paper we discuss archival storage container formats from the point of view of digital curation and preservation, an aspect of preservation overlooked by most other studies. Considering established approaches to data management as our jumping off point, we selected seven container format attr...

Full description

Bibliographic Details
Main Authors: Yunhyong Kim, Seamus Ross
Format: Article
Language:English
Published: University of Edinburgh 2012-10-01
Series:International Journal of Digital Curation
Online Access:http://129.215.67.233:80/ijdc/article/view/227
_version_ 1797401977802457088
author Yunhyong Kim
Seamus Ross
author_facet Yunhyong Kim
Seamus Ross
author_sort Yunhyong Kim
collection DOAJ
description In this paper we discuss archival storage container formats from the point of view of digital curation and preservation, an aspect of preservation overlooked by most other studies. Considering established approaches to data management as our jumping off point, we selected seven container format attributes that are core to the long term accessibility of digital materials. We have labeled these core preservation attributes. These attributes are then used as evaluation criteria to compare storage container formats belonging to five common categories: formats for archiving selected content (e.g. tar, WARC), disk image formats that capture data for recovery or installation (partimage, dd raw image), these two types combined with a selected compression algorithm (e.g. tar+gzip), formats that combine packing and compression (e.g. 7-zip), and forensic file formats for data analysis in criminal investigations (e.g. aff – Advanced Forensic File format). We present a general discussion of the storage container format landscape in terms of the attributes we discuss, and make a direct comparison between the three most promising archival formats: tar, WARC, and aff. We conclude by suggesting the next steps to take the research forward and to validate the observations we have made.
first_indexed 2024-03-09T02:18:45Z
format Article
id doaj.art-40856765e99c47b9abe9b2a97dad107d
institution Directory Open Access Journal
issn 1746-8256
language English
last_indexed 2024-03-09T02:18:45Z
publishDate 2012-10-01
publisher University of Edinburgh
record_format Article
series International Journal of Digital Curation
spelling doaj.art-40856765e99c47b9abe9b2a97dad107d2023-12-06T20:02:44ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562012-10-0172Digital Forensics Formats: Seeking a Digital Preservation Storage Container Format for Web ArchivingYunhyong KimSeamus RossIn this paper we discuss archival storage container formats from the point of view of digital curation and preservation, an aspect of preservation overlooked by most other studies. Considering established approaches to data management as our jumping off point, we selected seven container format attributes that are core to the long term accessibility of digital materials. We have labeled these core preservation attributes. These attributes are then used as evaluation criteria to compare storage container formats belonging to five common categories: formats for archiving selected content (e.g. tar, WARC), disk image formats that capture data for recovery or installation (partimage, dd raw image), these two types combined with a selected compression algorithm (e.g. tar+gzip), formats that combine packing and compression (e.g. 7-zip), and forensic file formats for data analysis in criminal investigations (e.g. aff – Advanced Forensic File format). We present a general discussion of the storage container format landscape in terms of the attributes we discuss, and make a direct comparison between the three most promising archival formats: tar, WARC, and aff. We conclude by suggesting the next steps to take the research forward and to validate the observations we have made.http://129.215.67.233:80/ijdc/article/view/227
spellingShingle Yunhyong Kim
Seamus Ross
Digital Forensics Formats: Seeking a Digital Preservation Storage Container Format for Web Archiving
International Journal of Digital Curation
title Digital Forensics Formats: Seeking a Digital Preservation Storage Container Format for Web Archiving
title_full Digital Forensics Formats: Seeking a Digital Preservation Storage Container Format for Web Archiving
title_fullStr Digital Forensics Formats: Seeking a Digital Preservation Storage Container Format for Web Archiving
title_full_unstemmed Digital Forensics Formats: Seeking a Digital Preservation Storage Container Format for Web Archiving
title_short Digital Forensics Formats: Seeking a Digital Preservation Storage Container Format for Web Archiving
title_sort digital forensics formats seeking a digital preservation storage container format for web archiving
url http://129.215.67.233:80/ijdc/article/view/227
work_keys_str_mv AT yunhyongkim digitalforensicsformatsseekingadigitalpreservationstoragecontainerformatforwebarchiving
AT seamusross digitalforensicsformatsseekingadigitalpreservationstoragecontainerformatforwebarchiving