Magnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressing

Abstract The storage of data in DNA typically involves encoding and synthesizing data into short oligonucleotides, followed by reading with a sequencing instrument. Major challenges include the molecular consumption of synthesized DNA, basecalling errors, and limitations with scaling up read operati...

Full description

Bibliographic Details
Main Authors: Billy Lau, Shubham Chandak, Sharmili Roy, Kedar Tatwawadi, Mary Wootters, Tsachy Weissman, Hanlee P. Ji
Format: Article
Language:English
Published: Nature Portfolio 2023-05-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-29575-z
_version_ 1827939939440918528
author Billy Lau
Shubham Chandak
Sharmili Roy
Kedar Tatwawadi
Mary Wootters
Tsachy Weissman
Hanlee P. Ji
author_facet Billy Lau
Shubham Chandak
Sharmili Roy
Kedar Tatwawadi
Mary Wootters
Tsachy Weissman
Hanlee P. Ji
author_sort Billy Lau
collection DOAJ
description Abstract The storage of data in DNA typically involves encoding and synthesizing data into short oligonucleotides, followed by reading with a sequencing instrument. Major challenges include the molecular consumption of synthesized DNA, basecalling errors, and limitations with scaling up read operations for individual data elements. Addressing these challenges, we describe a DNA storage system called MDRAM (Magnetic DNA-based Random Access Memory) that enables repetitive and efficient readouts of targeted files with nanopore-based sequencing. By conjugating synthesized DNA to magnetic agarose beads, we enabled repeated data readouts while preserving the original DNA analyte and maintaining data readout quality. MDRAM utilizes an efficient convolutional coding scheme that leverages soft information in raw nanopore sequencing signals to achieve information reading costs comparable to Illumina sequencing despite higher error rates. Finally, we demonstrate a proof-of-concept DNA-based proto-filesystem that enables an exponentially-scalable data address space using only small numbers of targeting primers for assembly and readout.
first_indexed 2024-03-13T09:02:08Z
format Article
id doaj.art-e05020c8bf364082b0fa245011539ba0
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-03-13T09:02:08Z
publishDate 2023-05-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-e05020c8bf364082b0fa245011539ba02023-05-28T11:17:38ZengNature PortfolioScientific Reports2045-23222023-05-0113111510.1038/s41598-023-29575-zMagnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressingBilly Lau0Shubham Chandak1Sharmili Roy2Kedar Tatwawadi3Mary Wootters4Tsachy Weissman5Hanlee P. Ji6Division of Oncology, Department of Medicine, Stanford University School of MedicineDepartment of Electrical Engineering, Stanford UniversityDivision of Oncology, Department of Medicine, Stanford University School of MedicineDepartment of Electrical Engineering, Stanford UniversityDepartment of Electrical Engineering, Stanford UniversityDepartment of Electrical Engineering, Stanford UniversityDivision of Oncology, Department of Medicine, Stanford University School of MedicineAbstract The storage of data in DNA typically involves encoding and synthesizing data into short oligonucleotides, followed by reading with a sequencing instrument. Major challenges include the molecular consumption of synthesized DNA, basecalling errors, and limitations with scaling up read operations for individual data elements. Addressing these challenges, we describe a DNA storage system called MDRAM (Magnetic DNA-based Random Access Memory) that enables repetitive and efficient readouts of targeted files with nanopore-based sequencing. By conjugating synthesized DNA to magnetic agarose beads, we enabled repeated data readouts while preserving the original DNA analyte and maintaining data readout quality. MDRAM utilizes an efficient convolutional coding scheme that leverages soft information in raw nanopore sequencing signals to achieve information reading costs comparable to Illumina sequencing despite higher error rates. Finally, we demonstrate a proof-of-concept DNA-based proto-filesystem that enables an exponentially-scalable data address space using only small numbers of targeting primers for assembly and readout.https://doi.org/10.1038/s41598-023-29575-z
spellingShingle Billy Lau
Shubham Chandak
Sharmili Roy
Kedar Tatwawadi
Mary Wootters
Tsachy Weissman
Hanlee P. Ji
Magnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressing
Scientific Reports
title Magnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressing
title_full Magnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressing
title_fullStr Magnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressing
title_full_unstemmed Magnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressing
title_short Magnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressing
title_sort magnetic dna random access memory with nanopore readouts and exponentially scaled combinatorial addressing
url https://doi.org/10.1038/s41598-023-29575-z
work_keys_str_mv AT billylau magneticdnarandomaccessmemorywithnanoporereadoutsandexponentiallyscaledcombinatorialaddressing
AT shubhamchandak magneticdnarandomaccessmemorywithnanoporereadoutsandexponentiallyscaledcombinatorialaddressing
AT sharmiliroy magneticdnarandomaccessmemorywithnanoporereadoutsandexponentiallyscaledcombinatorialaddressing
AT kedartatwawadi magneticdnarandomaccessmemorywithnanoporereadoutsandexponentiallyscaledcombinatorialaddressing
AT marywootters magneticdnarandomaccessmemorywithnanoporereadoutsandexponentiallyscaledcombinatorialaddressing
AT tsachyweissman magneticdnarandomaccessmemorywithnanoporereadoutsandexponentiallyscaledcombinatorialaddressing
AT hanleepji magneticdnarandomaccessmemorywithnanoporereadoutsandexponentiallyscaledcombinatorialaddressing