2021
DOI: 10.1101/2021.09.27.462049
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Expanding the Molecular Alphabet of DNA-Based Data Storage Systems with Neural Network Nanopore Readout Processing

Abstract: DNA is a promising next-generation data storage medium, but the recording latency and synthesis cost of oligos using the four natural nucleotides remain high. Here, we describe an improved DNA-based storage system that uses an extended 11-letter molecular alphabet combining natural and chemically modified nucleotides. Our extended-alphabet molecular storage paradigm offers a nearly two-fold increase in storage density and potentially the same order of reduction in the recording time. Experimental results invol… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 39 publications
0
3
0
Order By: Relevance
“…4,[42][43][44][45][46][47][48][49][50]36,14 While physical and logical redundancy lower the storage density of DNA, recent works have proposed to raise it by expanding the DNA alphabet using composite natural letters 7,51,52 or chemically modified nucleotides. 53 2.2. Storage and Degradation Issues.…”
Section: Sequence-based Dna Data Storage Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…4,[42][43][44][45][46][47][48][49][50]36,14 While physical and logical redundancy lower the storage density of DNA, recent works have proposed to raise it by expanding the DNA alphabet using composite natural letters 7,51,52 or chemically modified nucleotides. 53 2.2. Storage and Degradation Issues.…”
Section: Sequence-based Dna Data Storage Methodsmentioning
confidence: 99%
“…Contrary to SBS, it is therefore also able to identify modified and unnatural nucleotides such that the readout of data encoded using an expanded molecular alphabet is possible. 53,87 While nanopore sequencing improves upon several limitations of SBS for DNA data storage, as reviewed by Ceze et al, 12 two key constraints of the technology are its high error rate and the required sequence length. The high error rate of nanopore sequencing (∼10% per nt in the single read), 70,88 compared with the nearly negligible rate of errors introduced by SBS (∼0.5% per nt), 70 necessitates the clustering of sequence information, and thus, higher sequencing coverage and additional postprocessing of sequencing data.…”
Section: Readingmentioning
confidence: 99%
See 1 more Smart Citation