2022
DOI: 10.1186/s13007-022-00971-2
|View full text |Cite
|
Sign up to set email alerts
|

Species-specific basecallers improve actual accuracy of nanopore sequencing in plants

Abstract: Background Long-read sequencing platforms offered by Oxford Nanopore Technologies (ONT) allow native DNA containing epigenetic modifications to be directly sequenced, but can be limited by lower per-base accuracies. A key step post-sequencing is basecalling, the process of converting raw electrical signals produced by the sequencing device into nucleotide sequences. This is challenging as current basecallers are primarily based on mixtures of model species for training. Here we utilise both ONT… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 22 publications
(15 citation statements)
references
References 31 publications
0
15
0
Order By: Relevance
“…From our evaluation, the performance of RODAN holds up well against Guppy in terms of read accuracy, especially for organisms in its training data. As species-specific training data are known to improve performance of nanopore basecallers in those species [Wick et al, 2019, Ferguson et al, 2022, the improvements of RODAN suggest a promising direction for training species-specific basecallers also for dRNA-seq data. Lastly, the presence of the same systematic error patterns in RODAN points to more fundamental causes of errors in the raw signal data, necessitating further development of better pore chemistry to produce higher quality dRNA-seq data.…”
Section: Discussionmentioning
confidence: 99%
“…From our evaluation, the performance of RODAN holds up well against Guppy in terms of read accuracy, especially for organisms in its training data. As species-specific training data are known to improve performance of nanopore basecallers in those species [Wick et al, 2019, Ferguson et al, 2022, the improvements of RODAN suggest a promising direction for training species-specific basecallers also for dRNA-seq data. Lastly, the presence of the same systematic error patterns in RODAN points to more fundamental causes of errors in the raw signal data, necessitating further development of better pore chemistry to produce higher quality dRNA-seq data.…”
Section: Discussionmentioning
confidence: 99%
“…We performed Hi-C scaffolding, grouping, ordering, and orienting of our previously assembled E. melliodora genome into pseudo-chromosomes (Ferguson, et al, 2022). Leaves were obtained from the reference tree, and a proximity ligation library for chromosome conformation capture was created with a Phase Genomics Proximo Hi-C (Plant) Kit (version 4), according to the manufacturer’s instructions (document KT3040B).…”
Section: Methodsmentioning
confidence: 99%
“…These tools can already identify the differences between nuclear and mitochondrial genomes (Danilevsky et al 2022 ) as well as human and bacterial genomes (Bao et al 2021 ). Species-specific basecallers trained on the genomes of specific species of plants are able to outperform ‘universal’ base-callers (Ferguson et al 2022 ). Similarly, a rise in species-specific variant callers such as PEPPER-DeepVariant (Shafin et al 2021 ) suggest information rich differences exist in variants as well.…”
Section: Long-read Dna Nanopore Sequencingmentioning
confidence: 99%