2022
DOI: 10.1093/bib/bbac098
|View full text |Cite
|
Sign up to set email alerts
|

S2Snet: deep learning for low molecular weight RNA identification with nanopore

Abstract: Ribonucleic acid (RNA) is a pivotal nucleic acid that plays a crucial role in regulating many biological activities. Recently, one study utilized a machine learning algorithm to automatically classify RNA structural events generated by a Mycobacterium smegmatis porin A nanopore trap. Although it can achieve desirable classification results, compared with deep learning (DL) methods, this classic machine learning requires domain knowledge to manually extract features, which is sophisticated, labor-intensive and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(14 citation statements)
references
References 24 publications
0
14
0
Order By: Relevance
“…The input long sequence S is truncated to n sub-sequence ], and the input RNA type T is truncated to n sub-targets ]. It considers that the previous paper used the RF algorithm as the classification model to distinguish the RNA types ( Guan et al , 2022 ; Wang et al , 2021 ). Therefore, the input of RF algorithm is the feature vector ( v i ) of the sub-sequence s i extracted by feature extract methods, which contains the length, mean, standard deviation and other statistical information.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…The input long sequence S is truncated to n sub-sequence ], and the input RNA type T is truncated to n sub-targets ]. It considers that the previous paper used the RF algorithm as the classification model to distinguish the RNA types ( Guan et al , 2022 ; Wang et al , 2021 ). Therefore, the input of RF algorithm is the feature vector ( v i ) of the sub-sequence s i extracted by feature extract methods, which contains the length, mean, standard deviation and other statistical information.…”
Section: Methodsmentioning
confidence: 99%
“…In some cases, the machine learning model C is specially set as the RF algorithm in the RNA types prediction experiment, as shown in Figure 1b . Correspondingly, we set the C as the CNN model in the ONT barcode classification experiment and the RNA type classification experiment by S2Snet ( Guan et al , 2022 ). Notably, the query function Q contains six common strategies: query-by-committee (QBC) is based on the QS ( Freund et al , 1997 ), Random is the random sampling, QUerying Informative and Representative Examples (QUIRE) is the pool-based active learning strategy ( Huang et al , 2010 ), Density is the density-based sampling AL strategy ( Nguyen and Smeulders, 2004 ), EER is Expected Error Reduction ( Roy and McCallum, 2001 ), LAL is Learning Active Learning ( Konyushkova et al , 2017 ), SPAL is Self-Paced Active Learning ( Tang and Huang, 2019 ) and UNCertainty sampling (UNC) is based on the Margin Sampling ( Lewis and Gale, 1994 ) in our experimental configuration, as shown in Figure 1c .…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations