2020
DOI: 10.1186/s12859-020-3459-0
|View full text |Cite
|
Sign up to set email alerts
|

Nanopore basecalling from a perspective of instance segmentation

Abstract: Background: Nanopore sequencing is a rapidly developing third-generation sequencing technology, which can generate long nucleotide reads of molecules within a portable device in real-time. Through detecting the change of ion currency signals during a DNA/RNA fragment's pass through a nanopore, genotypes are determined. Currently, the accuracy of nanopore basecalling has a higher error rate than the basecalling of short-read sequencing. Through utilizing deep neural networks, the-state-of-the art nanopore basec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
29
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(29 citation statements)
references
References 13 publications
0
29
0
Order By: Relevance
“…The large coverage of the long read nanopore sequencing allowed for a robust first draft genome to be created, with few fragments. However, nanopore read base calling remains less accurate than Illumina, around 95% [ 82 ], and while modern base calling software harnessing neural networks are improving this, it is still necessary to polish, with higher accuracy, short read Illumina sequencing data to create a robust genome assembly, in order to remove the indels and miscalled bases present in nanopore reads that otherwise lead to frameshifts and fragmentation of genes.…”
Section: Discussionmentioning
confidence: 99%
“…The large coverage of the long read nanopore sequencing allowed for a robust first draft genome to be created, with few fragments. However, nanopore read base calling remains less accurate than Illumina, around 95% [ 82 ], and while modern base calling software harnessing neural networks are improving this, it is still necessary to polish, with higher accuracy, short read Illumina sequencing data to create a robust genome assembly, in order to remove the indels and miscalled bases present in nanopore reads that otherwise lead to frameshifts and fragmentation of genes.…”
Section: Discussionmentioning
confidence: 99%
“…The Mauler basecaller, proposed by Abbaszadegan [ 13 ], implemented a new approach using deep learning encoder–decoder with the attention model. Even more recent tools are URNano, based on U-net and proposed by Zhang et al [ 14 ], and Causalcall [ 15 ], using TCN, which is a model gaining in popularity. Mentioned tools have increased the accuracy of single strand reads basecalling to over 90–95%.…”
Section: Introductionmentioning
confidence: 99%
“…As for new conceptions in this field, both Abbaszadegan’s [ 13 ] and Zhang et al’s [ 14 ] works mention that, apart from the direct use of raw data, the processing of its segmentation might be beneficial, providing more information to the model.…”
Section: Introductionmentioning
confidence: 99%
“…Zhang et al. 40 also proposed a refined U-net model (thus, a UR-net model, an enhanced U-net model for 1D sequence segmentation) called URnano to improve previous end-to-end deep-learning models. Early neural-network base-callers (such as DeepNano 41 and BasecRAWller 42 ) relied on a preprocessing step that segmented the current measurements into discrete events that may artificially introduce errors when segmenting the ion current signals.…”
Section: Introductionmentioning
confidence: 99%