2009
DOI: 10.1073/pnas.0810767106
|View full text |Cite
|
Sign up to set email alerts
|

Sequence context-specific profiles for homology searching

Abstract: Sequence alignment and database searching are essential tools in biology because a protein's function can often be inferred from homologous proteins. Standard sequence comparison methods use substitution matrices to find the alignment with the best sum of similarity scores between aligned residues. These similarity scores do not take the local sequence context into account. Here, we present an approach that derives context-specific amino acid similarities from short windows centered on each query sequence resi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
138
0
1

Year Published

2009
2009
2017
2017

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 159 publications
(139 citation statements)
references
References 45 publications
0
138
0
1
Order By: Relevance
“…This script uses CS-BLAST, a sequence contextspecific extension of PSI-BLAST, for iterative sequence searching. 45 It also contains heuristics to reduce the inclusion of nonhomologous sequence segments at the ends of PSI-BLAST sequence matches, the leading cause of high-scoring false positive matches in PSI-BLAST. Profile HMMs were calculated from the alignments using hhmake and compared with HHsearch, both from the HHsearch 1.6.0 package.…”
Section: Methodsmentioning
confidence: 99%
“…This script uses CS-BLAST, a sequence contextspecific extension of PSI-BLAST, for iterative sequence searching. 45 It also contains heuristics to reduce the inclusion of nonhomologous sequence segments at the ends of PSI-BLAST sequence matches, the leading cause of high-scoring false positive matches in PSI-BLAST. Profile HMMs were calculated from the alignments using hhmake and compared with HHsearch, both from the HHsearch 1.6.0 package.…”
Section: Methodsmentioning
confidence: 99%
“…In contrast, HHblits calculates pseudocounts that depend on the local sequence context (that is, the 13 positions around each residue). This method had improved the sensitivity and alignment quality of the resulting profile considerably 8 . HHblits then searches the HMM database and adds the sequences from HMMs below a defined expected value (E value) threshold to the query MSA, from which the HMM for the next search iteration is built ( Fig.…”
mentioning
confidence: 99%
“…We thus obtain a 219-row extended sequence profile, which can be aligned to extended sequences representing the other profile using fast, standard dynamic programming techniques. We generated the 219-letter alphabet using the same method that was previously used for learning an optimal set of sequence context profiles 8 , but here we set the window size from 13 to 1 residue. We also set the window weights w j to 100 to obtain a hard clustering.…”
mentioning
confidence: 99%
“…We carried out a profiled search for BH3 domains using structural HsBcl-xL and sequence motifs from known BH3-only proteins as a template [18]. This yielded four Hydra gene models encoding novel proteins with conserved BH3 domains.…”
Section: Bh3-only Proteinsmentioning
confidence: 99%