2020
DOI: 10.1101/2020.05.23.112011
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Precise detection of Acrs in prokaryotes using only six features

Abstract: Anti-CRISPR proteins (Acrs) can suppress the activity of CRISPR-Cas systems. Some viruses depend on Acrs to expand their genetic materials into the host genome which can promote species diversity. Therefore, the identification and determination of Acrs are of vital importance. In this work we developed a random forest tree-based tool, AcrDetector, to identify Acrs in the whole genomescale using merely six features. AcrDetector can achieve a mean accuracy of 99.65%, a mean recall of 75.84%, a mean precision of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 25 publications
0
7
0
Order By: Relevance
“…Approximately 86.9% (272/313) of the 313 Acrs simultaneously predicted by the three methods were annotated as a ‘hypothetical protein’ or ‘Uncharacterized protein’ according to NCBI annotation. The proteins of unknown function were employed to predict Acrs ( 10 , 41 ), because if a protein has a validated functional annotation, it is less likely to perform an Acr function.…”
Section: Resultsmentioning
confidence: 99%
“…Approximately 86.9% (272/313) of the 313 Acrs simultaneously predicted by the three methods were annotated as a ‘hypothetical protein’ or ‘Uncharacterized protein’ according to NCBI annotation. The proteins of unknown function were employed to predict Acrs ( 10 , 41 ), because if a protein has a validated functional annotation, it is less likely to perform an Acr function.…”
Section: Resultsmentioning
confidence: 99%
“…The mathematical expressions for calculating these metrics are provided in Appendix C. Since other methods require additional information other than the sequence alone, such as gene location on chromosome [Dong et al, 2020], we compare DeepAcr with three recently proposed methods, namely PaCRISPR [Wang et al, 2020], AcRanker [Eitzinger et al, 2020], and the one from Gussow et al [2020]. All of them have shown strong Acr predicting capacities and efficiency.…”
Section: Resultsmentioning
confidence: 99%
“…By using the criteria for selecting the negative sample proteins as introduced in Wang et al [2020], we obtain 902 negative samples from the negative training dataset and 260 negative samples from the negative testing dataset for cross-dataset test. The 1094 non-redundant positive samples from Anti-CRISPRdb, and the 902 negative samples from the negative training dataset, and 260 negative samples from the cross-dataset dataset of Wang et al [2020], are combined as the whole experiments dataset, where the sequence similarity between positive samples and negative samples is also less than 40%. In the cross-dataset test, the Acrs of types I-F, II-C, and I-D are selected as positive test samples, and the resting Acrs are used as positive training samples.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations