Data and image storage on synthetic DNA: existing solutions and challenges

Abstract Storage of digital data is becoming challenging for humanity due to the relatively short life-span of storage devices. Furthermore, the exponential increase in the generation of digital data is creating the need for constantly constructing new resources to handle the storage of this data vo...

Full description

Bibliographic Details
Main Authors: Melpomeni Dimopoulou, Marc Antonini
Format: Article
Language:English
Published: SpringerOpen 2022-10-01
Series:EURASIP Journal on Image and Video Processing
Subjects:
Online Access:https://doi.org/10.1186/s13640-022-00600-x
_version_ 1811257148627222528
author Melpomeni Dimopoulou
Marc Antonini
author_facet Melpomeni Dimopoulou
Marc Antonini
author_sort Melpomeni Dimopoulou
collection DOAJ
description Abstract Storage of digital data is becoming challenging for humanity due to the relatively short life-span of storage devices. Furthermore, the exponential increase in the generation of digital data is creating the need for constantly constructing new resources to handle the storage of this data volume. Recent studies suggest the use of the DNA molecule as a promising novel candidate which can hold 500 Gbyte/mm $$^3$$ 3 (1000 times more than HDD drives). Any digital information can be synthesized into DNA in vitro and stored in special tiny storage capsules that can promise reliability for hundreds of years. The stored DNA sequence can be retrieved whenever needed using special machines that are called sequencers. This whole process is very challenging, as the process of DNA synthesis is expensive in terms of money and sequencing is prone to errors. However, studies have shown that when respecting several rules in the encoding, the probability of sequencing error is reduced. Consequently, the encoding of digital information is not trivial, and the input data need to be efficiently compressed before encoding so that the high synthesis cost is reduced. In this paper, we present a survey on the storage of digital data in synthetic DNA, explaining the problem which is tackled by this novel field of study, present the main processes included in the storage workflow as well as the history of different studies and the most well-known algorithms that have been proposed in the bibliography on DNA data storage.
first_indexed 2024-04-12T17:51:54Z
format Article
id doaj.art-81402d9186974f918c1fd12251cc2db9
institution Directory Open Access Journal
issn 1687-5281
language English
last_indexed 2024-04-12T17:51:54Z
publishDate 2022-10-01
publisher SpringerOpen
record_format Article
series EURASIP Journal on Image and Video Processing
spelling doaj.art-81402d9186974f918c1fd12251cc2db92022-12-22T03:22:29ZengSpringerOpenEURASIP Journal on Image and Video Processing1687-52812022-10-012022111910.1186/s13640-022-00600-xData and image storage on synthetic DNA: existing solutions and challengesMelpomeni Dimopoulou0Marc Antonini1Laboratoire d’Informatique, Signaux et Systèmes de Sophia Antipolis (I3S), UMR 7271, Université Côte d’Azur, CNRSLaboratoire d’Informatique, Signaux et Systèmes de Sophia Antipolis (I3S), UMR 7271, Université Côte d’Azur, CNRSAbstract Storage of digital data is becoming challenging for humanity due to the relatively short life-span of storage devices. Furthermore, the exponential increase in the generation of digital data is creating the need for constantly constructing new resources to handle the storage of this data volume. Recent studies suggest the use of the DNA molecule as a promising novel candidate which can hold 500 Gbyte/mm $$^3$$ 3 (1000 times more than HDD drives). Any digital information can be synthesized into DNA in vitro and stored in special tiny storage capsules that can promise reliability for hundreds of years. The stored DNA sequence can be retrieved whenever needed using special machines that are called sequencers. This whole process is very challenging, as the process of DNA synthesis is expensive in terms of money and sequencing is prone to errors. However, studies have shown that when respecting several rules in the encoding, the probability of sequencing error is reduced. Consequently, the encoding of digital information is not trivial, and the input data need to be efficiently compressed before encoding so that the high synthesis cost is reduced. In this paper, we present a survey on the storage of digital data in synthetic DNA, explaining the problem which is tackled by this novel field of study, present the main processes included in the storage workflow as well as the history of different studies and the most well-known algorithms that have been proposed in the bibliography on DNA data storage.https://doi.org/10.1186/s13640-022-00600-xDNA data storageRobust encodingQuaternary code
spellingShingle Melpomeni Dimopoulou
Marc Antonini
Data and image storage on synthetic DNA: existing solutions and challenges
EURASIP Journal on Image and Video Processing
DNA data storage
Robust encoding
Quaternary code
title Data and image storage on synthetic DNA: existing solutions and challenges
title_full Data and image storage on synthetic DNA: existing solutions and challenges
title_fullStr Data and image storage on synthetic DNA: existing solutions and challenges
title_full_unstemmed Data and image storage on synthetic DNA: existing solutions and challenges
title_short Data and image storage on synthetic DNA: existing solutions and challenges
title_sort data and image storage on synthetic dna existing solutions and challenges
topic DNA data storage
Robust encoding
Quaternary code
url https://doi.org/10.1186/s13640-022-00600-x
work_keys_str_mv AT melpomenidimopoulou dataandimagestorageonsyntheticdnaexistingsolutionsandchallenges
AT marcantonini dataandimagestorageonsyntheticdnaexistingsolutionsandchallenges