2020
DOI: 10.1021/acs.jproteome.0c00385
|View full text |Cite
|
Sign up to set email alerts
|

METATRYP v 2.0: Metaproteomic Least Common Ancestor Analysis for Taxonomic Inference Using Specialized Sequence Assemblies—Standalone Software and Web Servers for Marine Microorganisms and Coronaviruses

Abstract: We present METATRYP version 2 software that identifies shared peptides across the predicted proteomes of organisms within environmental metaproteomics studies to enable accurate taxonomic attribution of peptides during protein inference. Improvements include ingestion of complex sequence assembly data categories (metagenomic and metatranscriptomic assemblies, single cell amplified genomes, and metagenome assembled genomes), prediction of the least common ancestor (LCA) for a peptide shared across multiple orga… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
21
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 19 publications
(21 citation statements)
references
References 45 publications
0
21
0
Order By: Relevance
“…This is a key question within the field of metaproteomics because of the possibility that peptide constituents of proteins could be found in multiple organisms present within an individual sample. To address this, the OPP utilizes the software tool METATRYP we previously created that searches a database of all tryptic peptides among a group of organisms specified or within meta-omic assemblies from the environment. , METATRYP then identifies peptides that are shared among multiple organisms and reports which organisms share the peptides and calculates the “Least Common Ancestor” (LCA) of the identified taxa possessing this peptide. The METATRYP databases use the NCBI Taxonomy database to identify the ancestral phylogeny of the taxa identified that possess the peptide in question.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…This is a key question within the field of metaproteomics because of the possibility that peptide constituents of proteins could be found in multiple organisms present within an individual sample. To address this, the OPP utilizes the software tool METATRYP we previously created that searches a database of all tryptic peptides among a group of organisms specified or within meta-omic assemblies from the environment. , METATRYP then identifies peptides that are shared among multiple organisms and reports which organisms share the peptides and calculates the “Least Common Ancestor” (LCA) of the identified taxa possessing this peptide. The METATRYP databases use the NCBI Taxonomy database to identify the ancestral phylogeny of the taxa identified that possess the peptide in question.…”
Section: Resultsmentioning
confidence: 99%
“…The results can be visualized in heatmap and tree formats to allow the user to gain an immediate understanding of who the Least Common Ancestor of the protein constituent is and their associated taxonomic lineages (Figure ). The OPP is currently using METATRYP Version 2 that has improved performance, can calculate the Least Common Ancestor, and has the capacity to separate its database into genomes, metagenomes, and metagenomic products that are described in a separate manuscript …”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The workflow also included additional, optional in-line characterization of these peptides by searching against NCBI-non redundant (nr) BLAST-P and Unipept [ 40 ] analysis. Further offline analysis was performed using NCBI BLAST-P analysis as well as the MetaTRYP [ 34 ] coronavirus database. The peptide validation workflow can be found at COVID Galaxy website ( https://COVID19.galaxyproject.org/proteomics ).…”
Section: Methodsmentioning
confidence: 99%
“…We then interrogated these peptides using the PepQuery search engine [ 31 ] to confirm the quality of these PSMs and determine whether the matched sequences were unique to SARS-CoV-2 or could be better ascribed to the human proteome or that of another closely related coronavirus. Peptides and their associated PSMS which survived this rigorous filtering were then manually validated using the Multi-omics Visualization Platform [ 32 ] and further analyzed for specificity to the SARS-CoV-2 virus via BLAST-P [ 33 ] and MetaTryp [ 34 ]. Taken together, our analyses enable the construction of a high-confidence target peptide list that would form the basis of a targeted clinical proteomics assay for SARS-CoV-2 infection.…”
Section: Introductionmentioning
confidence: 99%