Tandem mass spectrometry (MS/MS) can be used to identify peptides present in a biological sample containing unknown proteins. De novo peptide sequencing aims to determine the amino acid sequence of a portion of a peptide directly from MS/MS spectral data. Unlike spectral cross-correlation methods of peptide sequencing, the de novo approach does not require a complete database of all possible proteins that may be present in the sample. In this work, a de novo peptide sequencing algorithm (denovoGPU) was implemented using general-purpose computing techniques on a graphics processing unit (GPGPU), in order to reduce the runtime of the algorithm sufficiently to complete in real-time during MS/MS data collection. This is a step towards enabling true information-driven MS/MS, where incremental data analysis is used to guide data collection. Given data from an MS/MS spectrum, the algorithm filters the data, generates and scores candidate "sequence tags" (or short amino acid sequences), and ultimately outputs a ranked list of sequence tags. The denovoGPU algorithm was tested on over 380 experimentally obtained MS/MS spectra, whose peptide sequences were validated using the Mascot search engine for mass spectrometry data. The performance of the algorithm was compared to an existing de novo peptide sequencing algorithm (PepNovo) in terms of runtime and sequence tag accuracy. Constraints of the denovoGPU algorithm due to limited GPU memory were identified. By adjusting various parameters of the denovoGPU algorithm, the runtime was reduced to below one second, which is an essential requirement for real-time information-driven MS/MS.
The author has granted a non exclusive license allowing Library and Archives Canada to reproduce, publish, archive, preserve, conserve, communicate to the public by telecommunication or on the Internet, loan, distribute and sell theses worldwide, for commercial or non commercial purposes, in microform, paper, electronic and/or any other formats. AVIS:L'auteur a accorde une licence non exclusive permettant a la Bibliotheque et Archives Canada de reproduire, publier, archiver, sauvegarder, conserver, transmettre au public par telecommunication ou par Nntemet, preter, distribuer et vendre des theses partout dans le monde, a des fins commerciales ou autres, sur support microforme, papier, electronique et/ou autres formats.Bien que ces formulaires aient inclus dans la pagination, il n'y aura aucun contenu manquant.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.