2019
DOI: 10.7150/ijbs.32142
|View full text |Cite
|
Sign up to set email alerts
|

SWPepNovo: An Efficient De Novo Peptide Sequencing Tool for Large-scale MS/MS Spectra Analysis

Abstract: Tandem mass spectrometry (MS/MS)-based de novo peptide sequencing is a powerful method for high-throughput protein analysis. However, the explosively increasing size of MS/MS spectra dataset inevitably and exponentially raises the computational demand of existing de novo peptide sequencing methods, which is an issue urgently to be solved in computational biology. This paper introduces an efficient tool based on SW26010 many-core processor, namely SWPepNovo, to process the large-scale peptide MS/MS spectra usin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 37 publications
0
7
0
Order By: Relevance
“…This new approach calibrates the matching results from a target-decoy database with sample-specific controls, and can significantly improve the accuracy of isoform identification in non-canonical proteomes. Parallel PSM processing algorithms for large scale proteomics dataset have also been implemented [33].…”
Section: Param-medic [47]mentioning
confidence: 99%
“…This new approach calibrates the matching results from a target-decoy database with sample-specific controls, and can significantly improve the accuracy of isoform identification in non-canonical proteomes. Parallel PSM processing algorithms for large scale proteomics dataset have also been implemented [33].…”
Section: Param-medic [47]mentioning
confidence: 99%
“…To confirm our lower-bounds that we have proved for the existing methods, and lowerbounds on communication that might be possible we did a thorough evaluation of the existing methods. These existing methods [18,19,20,7,13,11,21,16,12,22,15,14] included MPI-based memory-distributed implementations, Map-Reduce/Hadoop implementations, and GPU-based methods. Since we are assuming a memory-distributed architecture for our bounds; we have concentrated on those studies.…”
Section: Meta-analysis Of Results Of Current Hpc Methodsmentioning
confidence: 99%
“…For evaluation, we downloaded all the results [18,19,20,7,13,11,21,16,12,22,15,14] that have been reported till date. This information included, the database size, the number of spectra, serial and parallel times, and the speedups.…”
Section: Meta-analysis Of Results Of Current Hpc Methodsmentioning
confidence: 99%
“…Proteomics data analysis tools are generally used for protein identification (via bioinformatics) and quantification, and bioinformatics techniques tools used to process the proteomics data. A few examples of data analysis tools that are used for the identification of peptides and proteins include Mascot (Eng et al, 1994 ), Swiss-Prot (Bairoch and Boeckmann, 1994 ), Sequest (Perkins et al, 1999 ), Tandem (Craig and Beavis, 2004 ), Skyline (MacLean et al, 2010 ), Uni-Prot, 1 UniNovo (Jeong et al, 2013 ), and SWPepNovo (Li et al, 2019 ). Such algorithm-based software were developed to match the MS collected data from peptide/protein analysis to their base peptides/proteins and with in silico predicted intact masses and fragmentation patterns (Urgen Cox and Mann, 2011 ).…”
Section: A Comparison Of the Major “Omics” Technologiesmentioning
confidence: 99%