2019
DOI: 10.1186/s12859-019-2980-5
|View full text |Cite
|
Sign up to set email alerts
|

MCtandem: an efficient tool for large-scale peptide identification on many integrated core (MIC) architecture

Abstract: Background Tandem mass spectrometry (MS/MS)-based database searching is a widely acknowledged and widely used method for peptide identification in shotgun proteomics. However, due to the rapid growth of spectra data produced by advanced mass spectrometry and the greatly increased number of modified and digested peptides identified in recent years, the current methods for peptide database searching cannot rapidly and thoroughly process large MS/MS spectra datasets. A breakthrough in efficient datab… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 47 publications
(54 reference statements)
0
8
0
Order By: Relevance
“…These studies include Parallel Tandem [ 196 ] which spawns multiple instances of the original X!Tandem on distributed machines; X! !Tandem [ 207 ] achieves parallelism using owner-compute MPI processes; MR-Tandem [ 208 ] uses Map-Reduce instead of MPI for better speedup efficiency; MCtandem [ 209 ] employs Intel Many Integrated Core (MIC) architecture co-processor to speedup spectral dot products (SDP) for X!Tandem, and SW-Tandem [ 197 ] employs the Haswell AVX2 engine to speedup SDP computations on Sunway Taihulight supercomputer. SW-Tandem also spawns a manager process that distributes the experimental data to worker processes using a global queue for better load balancing.…”
Section: Current Hpc Methods For Analysis and Their Limitationsmentioning
confidence: 99%
“…These studies include Parallel Tandem [ 196 ] which spawns multiple instances of the original X!Tandem on distributed machines; X! !Tandem [ 207 ] achieves parallelism using owner-compute MPI processes; MR-Tandem [ 208 ] uses Map-Reduce instead of MPI for better speedup efficiency; MCtandem [ 209 ] employs Intel Many Integrated Core (MIC) architecture co-processor to speedup spectral dot products (SDP) for X!Tandem, and SW-Tandem [ 197 ] employs the Haswell AVX2 engine to speedup SDP computations on Sunway Taihulight supercomputer. SW-Tandem also spawns a manager process that distributes the experimental data to worker processes using a global queue for better load balancing.…”
Section: Current Hpc Methods For Analysis and Their Limitationsmentioning
confidence: 99%
“…In the particular context of mass spectrometry, LSH has been used for looking up peptide sequences in databases [18] [19], to cluster different spectra for MS1 spectra on LC-MS data [20], and for fast database lookup on MS2 spectra [21].…”
Section: Related Workmentioning
confidence: 99%
“…As demonstrated by other big data fields [23], such limitations can be reduced by developing parallel algorithms that combine the computational power of thousands of processing elements across distributed-memory clusters, and supercomputers. We, and others have developed high-performance computing (HPC) techniques for processing of MS data including for multicore [3], [2], [10], [9], and distributed-memory architectures [24], [25] [26], [27], [28], [29]. Similar to serial algorithms, the objective of these HPC methods has been to speed up the arithmetic scoring part of the search engines, by spawning multiple (managed) instances of the original code, replicating the theoretical database, and splitting the experimental data.…”
Section: Mainmentioning
confidence: 99%
“…!Tandem however by breaking computations into small Map and Reduce tasks (Map-Reduce model) exhibiting better parallel efficiency than the Parallel Tandem and X!!Tandem. MCtandem [27] and SW-Tandem [28] implement the same parallel design but offload the X!Tandem's expensive Spectral Dot Product (SDP) computations over Intel Many Integrated Core (MIC) co-processor and Haswell AVX2 vector instructions respectively. Both algorithms also implement optimization techniques including double buffering, pre-fetching, overlapped communication and computations and a task-distributor for better performance.…”
Section: Data Availabilitymentioning
confidence: 99%