2021
DOI: 10.1038/s41467-021-24497-8
|View full text |Cite
|
Sign up to set email alerts
|

A deep learning model for predicting next-generation sequencing depth from DNA sequence

Abstract: Targeted high-throughput DNA sequencing is a primary approach for genomics and molecular diagnostics, and more recently as a readout for DNA information storage. Oligonucleotide probes used to enrich gene loci of interest have different hybridization kinetics, resulting in non-uniform coverage that increases sequencing costs and decreases sequencing sensitivities. Here, we present a deep learning model (DLM) for predicting Next-Generation Sequencing (NGS) depth from DNA probe sequences. Our DLM includes a bidi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
23
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 47 publications
(23 citation statements)
references
References 32 publications
0
23
0
Order By: Relevance
“…Rather than querying for only known variants with molecular probes as with digital PCR, NGS has the capacity to find genetic perturbations in a relatively unbiased fashion, since sequencing, by its very nature, will identify all the base pairs of a given DNA molecule. The flexibility in selectively capturing and/or amplifying only regions of interest has led to approaches for sequencing only parts of a genome, including so-called “exome capture” and also gene panels ( 82 ). Unfortunately, NGS is prone to sequencing errors, due to the inherent nature of strand synthesis, as well as PCR amplification.…”
Section: Analysis Of Ctdnamentioning
confidence: 99%
“…Rather than querying for only known variants with molecular probes as with digital PCR, NGS has the capacity to find genetic perturbations in a relatively unbiased fashion, since sequencing, by its very nature, will identify all the base pairs of a given DNA molecule. The flexibility in selectively capturing and/or amplifying only regions of interest has led to approaches for sequencing only parts of a genome, including so-called “exome capture” and also gene panels ( 82 ). Unfortunately, NGS is prone to sequencing errors, due to the inherent nature of strand synthesis, as well as PCR amplification.…”
Section: Analysis Of Ctdnamentioning
confidence: 99%
“…Machine learning algorithms , build a model based on sample data, known as “training data”, to make predictions or decisions without completely understanding the “black box”, which represents the mechanism and learning process during model training. These characteristics enable machine learning to be successfully applied in the fields of chemistry and biology, such as spectral prediction, cancer biomarker detection, biosensor advancement, and chemical synthesis. Despite this progress, the application of machine learning algorithms to guide CD synthesis is still limited. In our previous studies, we developed a deep convolution neural network (DCNN) model for predicting the optical properties of CDs .…”
Section: Introductionmentioning
confidence: 99%
“…Although numerous studies have been conducted with deep neural networks for natural computing, deep neural networks are still new models for the DNA data storage system. For instance, in 2021, a GRU (gated recurrent unit)-based deep learning model was presented for DNA information storage with next-generation sequence prediction [14]. Similarly, the author proposed a DeepMod system that integrates the RNN and LSTM models to perceive the DNA codes from various Oxford Nanopore sequences.…”
Section: Deep Neural Network For Dna Codesmentioning
confidence: 99%