2020
DOI: 10.1021/acs.jproteome.0c00407
|View full text |Cite
|
Sign up to set email alerts
|

Spritz: A Proteogenomic Database Engine

Abstract: Proteoforms are the workhorses of the cell, and subtle differences between their amino acid sequence or post-translational modifications (PTMs) can change their biological function. To most effectively identify and quantify proteoforms in genetically diverse samples by mass spectrometry (MS), it is advantageous to search the MS data against a sample-specific protein database that is tailored to the sample being analyzed, in that it contains the correct amino acid sequences and relevant PTMs for that sample. To… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
24
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 25 publications
(24 citation statements)
references
References 39 publications
0
24
0
Order By: Relevance
“…, Refs. ( 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 , 26 )). The immunopeptidogenomics workflow presented here needs only sample-specific RNA-Seq data to produce a single customized DB incorporating conventional peptides, neopeptides from SNVs/polymorphisms and noncomplex indels, and cryptic peptides derived from noncanonical translation, such as alternative reading frames and noncoding RNA regions/transcripts.…”
mentioning
confidence: 99%
“…, Refs. ( 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 , 26 )). The immunopeptidogenomics workflow presented here needs only sample-specific RNA-Seq data to produce a single customized DB incorporating conventional peptides, neopeptides from SNVs/polymorphisms and noncomplex indels, and cryptic peptides derived from noncanonical translation, such as alternative reading frames and noncoding RNA regions/transcripts.…”
mentioning
confidence: 99%
“…There are considerable challenges still to face in their detection, particularly in noise/signal differentiation. This is especially complicated as variants often co-occur with other PTMs such as phosphorylation. , Current detection methods including ionbot cannot handle the complexity of two modifications on one site. However, deep neural networks show great promise with difficult peptide identifications .…”
Section: Discussionmentioning
confidence: 99%
“…A proteogenomics approach where only those variant peptides predicted from genome or transcriptome information are added to the peptide search databases can improve their detection. Proteogenomics pipelines have streamlined this process of incorporating personal genome information into a proteomic search database. In addition, there is evidence that including correct sequence variant information, including often-overlooked sample-specific indels and frameshifts, improves variant peptide identification workflows . Yet, false discovery rate (FDR) correction is needed to compensate for the increase of database size and complexity. , When searching for evidence of specific peptides such as variant peptides, an additional subset-specific FDR correction should be made …”
Section: Introductionmentioning
confidence: 99%
“…Examples of free or open-source proteomic software that can run in a Linux environment include Crux 14 , EncyclopeDIA 15 , FragPipe 16,17 , ProteoWizard 18 , SearchGUI 19 , The OpenMS Proteomics Pipeline (referred to as OpenMS or TOPP) 20,21 , Trans-Proteomic Pipeline (TPP) 22,23 , and X!Tandem 24 . Other tools that run in Windows but can also be run via command line in a Linux environment include MaxQuant 25,26 , MetaMorpheus 27 , and Spritz 28 . In order to truly take advantage of the scalability of cloud environments, software that can work in a clustered environment is preferred.…”
Section: Proteomics In the Cloudmentioning
confidence: 99%