The Use of PDF/A in Digital Archives: A Case Study from Archaeology

In recent years the Portable Document Format (PDF) has become a ubiquitous format in the exchange of documents; in 2005 the PDF/A profile was defined in order to meet long term accessibility needs, and has accordingly come to be regarded as a long-term archiving strategy for PDF files. In the field...

Full description

Bibliographic Details
Main Authors: Tim N. L. Evans, Ray H. Moore
Format: Article
Language:English
Published: University of Edinburgh 2014-10-01
Series:International Journal of Digital Curation
Online Access:http://129.215.67.233/ijdc/article/view/267
_version_ 1797435006780440576
author Tim N. L. Evans
Ray H. Moore
author_facet Tim N. L. Evans
Ray H. Moore
author_sort Tim N. L. Evans
collection DOAJ
description In recent years the Portable Document Format (PDF) has become a ubiquitous format in the exchange of documents; in 2005 the PDF/A profile was defined in order to meet long term accessibility needs, and has accordingly come to be regarded as a long-term archiving strategy for PDF files. In the field of archaeology, a growing number of PDF files – containing the detailed results of fieldwork and research – are beginning to be deposited with digital archives such as the Archaeology Data Service (ADS). In the ADS’ experience, the use of PDF/A has had benefits as well as drawbacks: the majority of PDF reports are now in a standard format better suited to longer-term access, however migrating to PDF/A and managing and ensuring reuse of these files is intensive, and fraught with potential pitfalls. Of these, perhaps the most serious has been an unreliability in PDF/A conformance by the wide range of tools and software now available. There are also practical and more theoretical implications for reuse which, as our discipline of archaeology alongside so many others rapidly becomes digitized, presents us with a large corpus of ‘data’ that is human readable, but may not be amenable to machine-based technologies such as NLP. It may be argued that these factors effectively undermine some of the perceived cost benefit of moving from paper to digital, as well as the longer-term sustainability of PDF/A within digital archives.
first_indexed 2024-03-09T10:42:02Z
format Article
id doaj.art-e121516e34d04630beb65420b8ab535f
institution Directory Open Access Journal
issn 1746-8256
language English
last_indexed 2024-03-09T10:42:02Z
publishDate 2014-10-01
publisher University of Edinburgh
record_format Article
series International Journal of Digital Curation
spelling doaj.art-e121516e34d04630beb65420b8ab535f2023-12-01T14:17:40ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562014-10-0192The Use of PDF/A in Digital Archives: A Case Study from ArchaeologyTim N. L. EvansRay H. Moore In recent years the Portable Document Format (PDF) has become a ubiquitous format in the exchange of documents; in 2005 the PDF/A profile was defined in order to meet long term accessibility needs, and has accordingly come to be regarded as a long-term archiving strategy for PDF files. In the field of archaeology, a growing number of PDF files – containing the detailed results of fieldwork and research – are beginning to be deposited with digital archives such as the Archaeology Data Service (ADS). In the ADS’ experience, the use of PDF/A has had benefits as well as drawbacks: the majority of PDF reports are now in a standard format better suited to longer-term access, however migrating to PDF/A and managing and ensuring reuse of these files is intensive, and fraught with potential pitfalls. Of these, perhaps the most serious has been an unreliability in PDF/A conformance by the wide range of tools and software now available. There are also practical and more theoretical implications for reuse which, as our discipline of archaeology alongside so many others rapidly becomes digitized, presents us with a large corpus of ‘data’ that is human readable, but may not be amenable to machine-based technologies such as NLP. It may be argued that these factors effectively undermine some of the perceived cost benefit of moving from paper to digital, as well as the longer-term sustainability of PDF/A within digital archives. http://129.215.67.233/ijdc/article/view/267
spellingShingle Tim N. L. Evans
Ray H. Moore
The Use of PDF/A in Digital Archives: A Case Study from Archaeology
International Journal of Digital Curation
title The Use of PDF/A in Digital Archives: A Case Study from Archaeology
title_full The Use of PDF/A in Digital Archives: A Case Study from Archaeology
title_fullStr The Use of PDF/A in Digital Archives: A Case Study from Archaeology
title_full_unstemmed The Use of PDF/A in Digital Archives: A Case Study from Archaeology
title_short The Use of PDF/A in Digital Archives: A Case Study from Archaeology
title_sort use of pdf a in digital archives a case study from archaeology
url http://129.215.67.233/ijdc/article/view/267
work_keys_str_mv AT timnlevans theuseofpdfaindigitalarchivesacasestudyfromarchaeology
AT rayhmoore theuseofpdfaindigitalarchivesacasestudyfromarchaeology
AT timnlevans useofpdfaindigitalarchivesacasestudyfromarchaeology
AT rayhmoore useofpdfaindigitalarchivesacasestudyfromarchaeology