Towards long double-stranded chains and robust DNA-based data storage using the random code system

DNA has become a popular choice for next-generation storage media due to its high storage density and stability. As the storage medium of life’s information, DNA has significant storage capacity and low-cost, low-power replication and transcription capabilities. However, utilizing long double-strand...

Full description

Bibliographic Details
Main Authors: Xu Yang, Xiaolong Shi, Langwen Lai, Congzhou Chen, Huaisheng Xu, Ming Deng
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-06-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2023.1179867/full
_version_ 1797805775490383872
author Xu Yang
Xiaolong Shi
Langwen Lai
Congzhou Chen
Huaisheng Xu
Ming Deng
author_facet Xu Yang
Xiaolong Shi
Langwen Lai
Congzhou Chen
Huaisheng Xu
Ming Deng
author_sort Xu Yang
collection DOAJ
description DNA has become a popular choice for next-generation storage media due to its high storage density and stability. As the storage medium of life’s information, DNA has significant storage capacity and low-cost, low-power replication and transcription capabilities. However, utilizing long double-stranded DNA for storage can introduce unstable factors that make it difficult to meet the constraints of biological systems. To address this challenge, we have designed a highly robust coding scheme called the “random code system,” inspired by the idea of fountain codes. The random code system includes the establishment of a random matrix, Gaussian preprocessing, and random equilibrium. Compared to Luby transform codes (LT codes), random code (RC) has better robustness and recovery ability of lost information. In biological experiments, we successfully stored 29,390 bits of data in 25,700 bp chains, achieving a storage density of 1.78 bits per nucleotide. These results demonstrate the potential for using long double-stranded DNA and the random code system for robust DNA-based data storage.
first_indexed 2024-03-13T05:57:09Z
format Article
id doaj.art-a106d4bb3a7d4502a53b1795de742142
institution Directory Open Access Journal
issn 1664-8021
language English
last_indexed 2024-03-13T05:57:09Z
publishDate 2023-06-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Genetics
spelling doaj.art-a106d4bb3a7d4502a53b1795de7421422023-06-13T04:31:47ZengFrontiers Media S.A.Frontiers in Genetics1664-80212023-06-011410.3389/fgene.2023.11798671179867Towards long double-stranded chains and robust DNA-based data storage using the random code systemXu Yang0Xiaolong Shi1Langwen Lai2Congzhou Chen3Huaisheng Xu4Ming Deng5Institute of Computing Science and Technology, Guangzhou University, Guangzhou, ChinaInstitute of Computing Science and Technology, Guangzhou University, Guangzhou, ChinaInstitute of Computing Science and Technology, Guangzhou University, Guangzhou, ChinaCollege of Information Science and Technology, Beijing University of Chemical Technology, Beijing, ChinaInstitute of Computing Science and Technology, Guangzhou University, Guangzhou, ChinaInstitute of Computing Science and Technology, Guangzhou University, Guangzhou, ChinaDNA has become a popular choice for next-generation storage media due to its high storage density and stability. As the storage medium of life’s information, DNA has significant storage capacity and low-cost, low-power replication and transcription capabilities. However, utilizing long double-stranded DNA for storage can introduce unstable factors that make it difficult to meet the constraints of biological systems. To address this challenge, we have designed a highly robust coding scheme called the “random code system,” inspired by the idea of fountain codes. The random code system includes the establishment of a random matrix, Gaussian preprocessing, and random equilibrium. Compared to Luby transform codes (LT codes), random code (RC) has better robustness and recovery ability of lost information. In biological experiments, we successfully stored 29,390 bits of data in 25,700 bp chains, achieving a storage density of 1.78 bits per nucleotide. These results demonstrate the potential for using long double-stranded DNA and the random code system for robust DNA-based data storage.https://www.frontiersin.org/articles/10.3389/fgene.2023.1179867/fullDNA-based data storagelong double-stranded chainsrandom matrixrandom equilibriumhighly robustrandom code system
spellingShingle Xu Yang
Xiaolong Shi
Langwen Lai
Congzhou Chen
Huaisheng Xu
Ming Deng
Towards long double-stranded chains and robust DNA-based data storage using the random code system
Frontiers in Genetics
DNA-based data storage
long double-stranded chains
random matrix
random equilibrium
highly robust
random code system
title Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_full Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_fullStr Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_full_unstemmed Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_short Towards long double-stranded chains and robust DNA-based data storage using the random code system
title_sort towards long double stranded chains and robust dna based data storage using the random code system
topic DNA-based data storage
long double-stranded chains
random matrix
random equilibrium
highly robust
random code system
url https://www.frontiersin.org/articles/10.3389/fgene.2023.1179867/full
work_keys_str_mv AT xuyang towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT xiaolongshi towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT langwenlai towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT congzhouchen towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT huaishengxu towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem
AT mingdeng towardslongdoublestrandedchainsandrobustdnabaseddatastorageusingtherandomcodesystem