2012
DOI: 10.1074/mcp.o111.014902
|View full text |Cite
|
Sign up to set email alerts
|

De Novo Sequencing and Homology Searching

Abstract: In proteomics, de novo sequencing is the process of deriving peptide sequences from tandem mass spectra without the assistance of a sequence database. Such analyses have traditionally been performed manually by human experts, and more recently by computer programs that have been developed because of the need for higher throughput. Although powerful, de novo sequencing often can only determine partially correct sequence tags because of imperfect tandem mass spectra. However, these sequence tags can then be sear… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
124
0
4

Year Published

2012
2012
2023
2023

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 144 publications
(128 citation statements)
references
References 108 publications
0
124
0
4
Order By: Relevance
“…As pointed out by Ma and Johnson (2011), de novo sequences not only are valuable for the analysis of the novel peptides that are not present in proteome databases but also can facilitate the homology-based database searches. Since the reconstructions reported by UniNovo contain mass gaps representing the total mass of multiple amino acids [termed gapped peptides (Jeong et al, 2011;Kim et al, 2009b)], MS-BPM algorithm (Ng et al, 2011) can be used for fast exact or homology searches (UniNovoÈMS-BPM).…”
Section: Resultsmentioning
confidence: 99%
“…As pointed out by Ma and Johnson (2011), de novo sequences not only are valuable for the analysis of the novel peptides that are not present in proteome databases but also can facilitate the homology-based database searches. Since the reconstructions reported by UniNovo contain mass gaps representing the total mass of multiple amino acids [termed gapped peptides (Jeong et al, 2011;Kim et al, 2009b)], MS-BPM algorithm (Ng et al, 2011) can be used for fast exact or homology searches (UniNovoÈMS-BPM).…”
Section: Resultsmentioning
confidence: 99%
“…Since the actual output of an MS experiment is a set of MS/MS spectra, the first step typically consists of identifying the origin of these spectra, i.e., match spectra to peptides. While various options exist for this goal (including de novo sequencing (Frank et al 2007;Ma and Johnson 2012) and tagbased approaches (Tabb et al 2008;Tabb et al 2003)), database searching is by far the most common spectrum identification method. Here, acquired spectra are matched to a set of known spectra to find the best match and thereby transitively assign identification.…”
Section: From Acquired Data To Processed Resultsmentioning
confidence: 99%
“…In the second approach, instead of searching acquired MS/MS spectra against theoretically predicted spectra, one can assign MS/MS spectra to peptides by matching against a spectral library (spectral library searching) [132,27,42,63]. In the third approach, peptide sequences are extracted directly from the spectra, i.e., without referring to a sequence database for help (de novo sequencing approach) [122,82,81]. The forth category is hybrid approaches, such as those based on the extraction of short sequence tags (a piece of information about a peptide that contains three to five amino acid residues) followed by "error-tolerant" database searching, where the error tolerant search of uninterpreted MS/MS data is a powerful way of finding additional peptide matches [85,121,41].…”
Section: Research Objective and Motivationmentioning
confidence: 99%
“…The relations between users and objects that can be represented by a bipartite graph is the basic information contained by every recommender system. 81 4.2 The illustration of a bipartite network (a), its X projection (b), and Y projection (c) [141]. The The vertical axis is the mass, and the maximum mass is approximate 9,000. .…”
mentioning
confidence: 99%