2022
DOI: 10.1109/tcbb.2020.3044575
|View full text |Cite
|
Sign up to set email alerts
|

RNN-VirSeeker: A Deep Learning Method for Identification of Short Viral Sequences From Metagenomes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
15
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 26 publications
(15 citation statements)
references
References 37 publications
0
15
0
Order By: Relevance
“…Tools such as the machine learning algorithms RNN-VirSeeker and VirFinder, along with cloud-based platforms like Serratus, have been innovatively designed to pinpoint viral sequences within metagenomic data, markedly enhancing the efficiency and precision of virus detection. [129][130][131] Despite the remarkable advancements in the construction of infectious clones for insect viruses, significant challenges persist. These challenges include the assembly or preservation of large genomic fragments in prokaryotic cells, such as Escherichia coli or Agrobacterium.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Tools such as the machine learning algorithms RNN-VirSeeker and VirFinder, along with cloud-based platforms like Serratus, have been innovatively designed to pinpoint viral sequences within metagenomic data, markedly enhancing the efficiency and precision of virus detection. [129][130][131] Despite the remarkable advancements in the construction of infectious clones for insect viruses, significant challenges persist. These challenges include the assembly or preservation of large genomic fragments in prokaryotic cells, such as Escherichia coli or Agrobacterium.…”
Section: Discussionmentioning
confidence: 99%
“…Furthermore, advancements in bioinformatics have greatly enriched the domain of virus identification. Tools such as the machine learning algorithms RNN‐VirSeeker and VirFinder, along with cloud‐based platforms like Serratus, have been innovatively designed to pinpoint viral sequences within metagenomic data, markedly enhancing the efficiency and precision of virus detection 129–131 …”
Section: Discussionmentioning
confidence: 99%
“…An alternative is to use one-dimensional (1D) convolution, which has linear complexity with respect to k , to represent kmers. Naturally, convolutional networks have been well adopted in the literature, but convolutional networks struggle to grasp extended sequence dependencies vital for DNA-related tasks. Conversely, Transformers employ a neural network design that incorporates self-attention, enabling them to understand relationships across any length of a sequence .…”
Section: Introductionmentioning
confidence: 99%
“…Long-short term memory (LSTM) network and convolutional neural network (CNN) are the most commonly used models. For example, ViraMiner [21], VirNet [22] and RNN-VirSeeker [23] utilize a single LSTM network to learn the interconnections between each part in a one-hot encoded sequence; DeepVirFinder [24], PPR-Meta [25] and CHEER [26] establish a single CNN to extract high-level features from one-hot encoded sequences automatically before a set of dense layers and a softmax layer for classification. However, the following two issues hamper the performance of deep learning models for the recognition of short viral sequences: 1) the single deep learning architecture suffers from failing to extract enough features to represent sequences; 2) the one-hot encoding strategy omits the relationship between two parts of a sequence because of its orthogonal property [27].…”
Section: Introductionmentioning
confidence: 99%
“…Universally, a confusion matrix is calculated to evaluate the performance of a classifier according to four statistics: True Positives (TP), False Positives (FP), True Negatives (TN), and False Negatives (FN)[23]. TP are examples correctly labeled as positives; FP refer to negative examples incorrectly labeled as positive; TN correspond to negatives correctly labeled as negative; and FN refer to positive examples incorrectly labeled as negative.…”
mentioning
confidence: 99%