2020
DOI: 10.1016/j.ygeno.2020.03.001
|View full text |Cite
|
Sign up to set email alerts
|

RRGPredictor, a set-theory-based tool for predicting pathogen-associated molecular pattern receptors (PRRs) and resistance (R) proteins from plants

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
17
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 18 publications
(18 citation statements)
references
References 52 publications
1
17
0
Order By: Relevance
“…The files in tabulated TSV format generated by the InterProScan were used as input for the software RRG_Predictor [ 63 ], which makes a search in two steps, the first searches for sequences with one or more domain codes associated with PRR and R genes, and the second step classifies those sequences into one of the 13 resistance gene classes in plants associated with resistance to pathogens, based on the following domain combinations: (1) Proteins with Coiled-coils ( CC ) and nucleotide binding sites (NBS) domains, (2) proteins with CC, NBS, and Leucine-rich repeat (LRR) domains, (3) proteins with Mlo-like resistance proteins, (4) proteins with just NBS domain, (5) proteins with NBS and LRR domains, (6) proteins with Receptor-like Kinase (RLK), (7) proteins with RLK and Ginkbilobin2 (GNK2) domains, (8) proteins with Receptor-like without the Kinase domain, (9) proteins with Resistance to Powdery Mildew 8 ( RPW8 ) NBS, and LRR domains, (10) proteins with Toll/interleukin-1 receptor (TIR) domain, (11) proteins with TIR and NBS domains, (12) proteins with TIR, NBS, and LRR domains, and (13) proteins with Leucine-rich repeat (LRR) domains that do not fit any other class (UNKNOWN).…”
Section: Methodsmentioning
confidence: 99%
“…The files in tabulated TSV format generated by the InterProScan were used as input for the software RRG_Predictor [ 63 ], which makes a search in two steps, the first searches for sequences with one or more domain codes associated with PRR and R genes, and the second step classifies those sequences into one of the 13 resistance gene classes in plants associated with resistance to pathogens, based on the following domain combinations: (1) Proteins with Coiled-coils ( CC ) and nucleotide binding sites (NBS) domains, (2) proteins with CC, NBS, and Leucine-rich repeat (LRR) domains, (3) proteins with Mlo-like resistance proteins, (4) proteins with just NBS domain, (5) proteins with NBS and LRR domains, (6) proteins with Receptor-like Kinase (RLK), (7) proteins with RLK and Ginkbilobin2 (GNK2) domains, (8) proteins with Receptor-like without the Kinase domain, (9) proteins with Resistance to Powdery Mildew 8 ( RPW8 ) NBS, and LRR domains, (10) proteins with Toll/interleukin-1 receptor (TIR) domain, (11) proteins with TIR and NBS domains, (12) proteins with TIR, NBS, and LRR domains, and (13) proteins with Leucine-rich repeat (LRR) domains that do not fit any other class (UNKNOWN).…”
Section: Methodsmentioning
confidence: 99%
“…For benchmarking using the RefPlantNLR dataset we used DRAGO2 (DRAGO2-API, Osuna-Cruz et al ., 2018), NLGenomeSweeper (v1.2.0, Toda et al ., 2020; dependencies: Python 3.8, NCBI-BLAST+ (v2.11.0+), MUSCLE aligner (v3.8.1551), SAMtools (v1.9-50-g18be38a), bedtools (v2.27.1-9-g5f83cacb), HMMER (v3.3.1), InterProScan (v5.47-82.0), TransDecoder (v5.5.0)), NLR-Annotator (Steuernagel et al ., 2020; dependencies: meme-suite (v5.1.1), NLR-Parser (v3;Steuernagel et al ., 2015), Oracle Java SE Development Kit 11.0.9), RGAugury (Li et al ., 2016; dependencies: CViT, HMMER, InterProScan, ncoils, NCBI-BLAST+, Pfamscan, Phobius), and RRGPredictor (Santana Silva and Micheli, 2020; dependencies: InterProScan) using either amino acid, CDS, and/or the extracted NLR genomic loci as an input. Since NLGenomeSweeper and NLR-Annotator only accept nucleotide input, while RGAugury only accepts amino acid input, we only used RefPlantNLR entries for which CDS was available in the direct comparison.…”
Section: Methodsmentioning
confidence: 99%
“…As an input these tools take either annotated genomic features and transcriptomic data, or alternatively can be run directly on the unannotated genomic sequence. NLR-Parser, RGAugury, RRGPredictor, and DRAGO2 identify transcript and protein sequences that have features of NLRs and are best described as NLR extractors (Steuernagel et al ., 2015; Li et al ., 2016; Osuna-Cruz et al ., 2018; Santana Silva and Micheli, 2020). RGAugury, RRGPredictor, and DRAGO2 also extract other classes of immune-related genes in addition to NLRs.…”
Section: Introductionmentioning
confidence: 99%
“… 495 Recently, RRGPredictor, a tool mainly based on text mining and set theory to predict new plant PRRs, makes the identification of PRRs more effective, specific, and sensitive than other available tools. 496 And a random forest-based method was proposed to identify PRRs, which is superior to other machine learning methods for PRRs. This method constructs a benchmark database, uses the amino acid composition and the composition transition distribution to formulate the sequences in the dataset, and then uses the maximum relevance maximum distance to select the best features.…”
Section: Conclusion and Future Perspectivesmentioning
confidence: 99%