2013
DOI: 10.1007/978-3-642-37195-0_9
|View full text |Cite
|
Sign up to set email alerts
|

UniNovo : A Universal Tool for de Novo Peptide Sequencing

Abstract: Motivation: Mass spectrometry (MS) instruments and experimental protocols are rapidly advancing, but de novo peptide sequencing algorithms to analyze tandem mass (MS/MS) spectra are lagging behind. Although existing de novo sequencing tools perform well on certain types of spectra [e.g. Collision Induced Dissociation (CID) spectra of tryptic peptides], their performance often deteriorates on other types of spectra, such as Electron Transfer Dissociation (ETD), Higher-energy Collisional Dissociation (HCD) spect… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
23
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(23 citation statements)
references
References 38 publications
0
23
0
Order By: Relevance
“…Given an unmodified peptide with perfect fragmentation (high coverage and few spurious peaks), the peptide sequence can be assembled immediately by connecting the product ion pairs whose mass difference matches the mass value of certain Comparison on spectra level SEQUEST (Eng et al 1994) • Compares experimental spectra with theoretical spectra through cross-correlation function Comet (Eng et al 2013) • Academic (open-source) version of SEQUEST • Optimized for high resolution tandem mass spectra • Support multiple fragmentation types (HCD, ETD, and CID) • Probability-based scoring (cumulative binomial distribution) • Included in SearchGUI and node in Proteome Discover Universal MS-GF+ (Kim and Pevzner, 2014) • Support for data from multiple fragmentation methods, multiple enzyme digestion, and phosphoproteomics • Bioconductor, SearchGUI de novo sequencing Naïve approach PEAKS (Zhang, Xin, et al 2012) • Widely used commercial tool for de novo sequencing • Generates consensus sequences based on comparison of experimental spectrum with a large pool of randomly sampled sequences • Now a full-featured software suite for proteomics study Spectrum graph (dynamic programming) pNovo+ (Chi et al 2012) • Integrates complementary HCD and ETD spectral information. • pDAG algorithm looking for the top k longest paths • 0.018s for sequencing / spectrum pepNovo+ (Frank 2009a) • Boosting-based PSM ranking algorithm integrating multiple PSM features UniNovo (Jeong et al 2013) • Independent of specific spectral type • Estimates the probability that a constructed sequence is correct Novor (Ma 2015) • Currently the fastest sequencing tool for de novo searches • Contains two large decision trees built from NIST • Open source windows client for targeted proteomics methods creation • Supports creation and use of tandem MS spectral libraries amino acid residue, which is actually independent of the size of the overall sequence space. As the consistent improvement of instrument accuracy and experimental design, researchers are gradually approaching the ideal data quality required for de novo sequencing.…”
Section: B De Novo Sequencing-based Methodsmentioning
confidence: 99%
“…Given an unmodified peptide with perfect fragmentation (high coverage and few spurious peaks), the peptide sequence can be assembled immediately by connecting the product ion pairs whose mass difference matches the mass value of certain Comparison on spectra level SEQUEST (Eng et al 1994) • Compares experimental spectra with theoretical spectra through cross-correlation function Comet (Eng et al 2013) • Academic (open-source) version of SEQUEST • Optimized for high resolution tandem mass spectra • Support multiple fragmentation types (HCD, ETD, and CID) • Probability-based scoring (cumulative binomial distribution) • Included in SearchGUI and node in Proteome Discover Universal MS-GF+ (Kim and Pevzner, 2014) • Support for data from multiple fragmentation methods, multiple enzyme digestion, and phosphoproteomics • Bioconductor, SearchGUI de novo sequencing Naïve approach PEAKS (Zhang, Xin, et al 2012) • Widely used commercial tool for de novo sequencing • Generates consensus sequences based on comparison of experimental spectrum with a large pool of randomly sampled sequences • Now a full-featured software suite for proteomics study Spectrum graph (dynamic programming) pNovo+ (Chi et al 2012) • Integrates complementary HCD and ETD spectral information. • pDAG algorithm looking for the top k longest paths • 0.018s for sequencing / spectrum pepNovo+ (Frank 2009a) • Boosting-based PSM ranking algorithm integrating multiple PSM features UniNovo (Jeong et al 2013) • Independent of specific spectral type • Estimates the probability that a constructed sequence is correct Novor (Ma 2015) • Currently the fastest sequencing tool for de novo searches • Contains two large decision trees built from NIST • Open source windows client for targeted proteomics methods creation • Supports creation and use of tandem MS spectral libraries amino acid residue, which is actually independent of the size of the overall sequence space. As the consistent improvement of instrument accuracy and experimental design, researchers are gradually approaching the ideal data quality required for de novo sequencing.…”
Section: B De Novo Sequencing-based Methodsmentioning
confidence: 99%
“…This spectrum is easily interpreted, but manual interpretation of fragmentation spectra is challenging and time-consuming in general since each spectral peak in the MS n spectrum can be any of the expected fragment ion types (eg, b-or y-ion). The processing of large tandem MS data sets is facilitated by de novo sequencing software programmes (Allmer, 2011), including PEAKS (Zhang et al, 2012), Lutefisk (Taylor & Johnson, 1997), PepNovo (Frank & Pevzner, 2005), NovoHMM (Pevtsov, Fedulova, Mirzaei, Buck, & Zhang, 2006), AUDENS (Grossmann et al, 2005), Novor (Ma, 2015), Sherenga (Dancik, Addona, Clauser, Vath, & Pevzner, 1999), UniNovo (Jeong, Kim, & Pevzner, 2013) and pNovo + (Chi, Chen, et al, 2013). In these programmes, high-throughput automated interpretation of MS 2 spectra is supported by different algorithms.…”
Section: Peptide-centric (Bottom-up) Proteomicsmentioning
confidence: 99%
“…These have been successfully applied in many proteogenomics studies, especially for non‐model species where a comprehensive genome annotation is still unavailable or incomplete. Multiple tools to derive these partial sequence stretches or complete de novo sequences are available:, e.g., PepNovo+ (Frank & Pevzner, ), DirecTag (Tabb et al, ) (both bundled in a graphical user interface called DenovoGui (Muth et al, )) and UniNovo (Jeong et al, ). Special about the latter is that it can cope with all types of fragmentation (CID, ETD and/or HCD).…”
Section: Tools Databases and Pipelines For Proteogenomics Studiesmentioning
confidence: 99%