1999
DOI: 10.1073/pnas.96.8.4285
|View full text |Cite
|
Sign up to set email alerts
|

Assigning protein functions by comparative genome analysis: Protein phylogenetic profiles

Abstract: Determining protein functions from genomic sequences is a central goal of bioinformatics. We present a method based on the assumption that proteins that function together in a pathway or structural complex are likely to evolve in a correlated fashion. During evolution, all such functionally linked proteins tend to be either preserved or eliminated in a new species. We describe this property of correlated evolution by characterizing each protein by its phylogenetic profile, a string that encodes the presence or… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

10
1,307
1
8

Year Published

2001
2001
2005
2005

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 1,573 publications
(1,326 citation statements)
references
References 10 publications
10
1,307
1
8
Order By: Relevance
“…Proteins in gray boxes are unclassified (unknown function) while the others are classified proteins with the functions within the brackets and labeled according to the following criteria: 1-cell growth; 2-budding, cell polarity and filament formation; 3-pheromone response, mating-type determination, sex-specific proteins; 4-cell cycle check point proteins; 5-cytokinesis; 6-rRNA synthesis; 7-tRNA synthesis; 8-transcriptional control; 9-other transcription activities; 10-other pheromone response activities; 11-stress response; and 12-nuclear organization. Given one of these proteins of unknown function if we take as a prediction the function that appears more in the neighbor proteins of known function then we obtain the following classification (from top to bottom) YNL127W (2), YDR200C (3,4,10) and YLR238W (12). Our method, however, considers also the interactions among unclassified proteins.…”
Section: Figmentioning
confidence: 99%
See 2 more Smart Citations
“…Proteins in gray boxes are unclassified (unknown function) while the others are classified proteins with the functions within the brackets and labeled according to the following criteria: 1-cell growth; 2-budding, cell polarity and filament formation; 3-pheromone response, mating-type determination, sex-specific proteins; 4-cell cycle check point proteins; 5-cytokinesis; 6-rRNA synthesis; 7-tRNA synthesis; 8-transcriptional control; 9-other transcription activities; 10-other pheromone response activities; 11-stress response; and 12-nuclear organization. Given one of these proteins of unknown function if we take as a prediction the function that appears more in the neighbor proteins of known function then we obtain the following classification (from top to bottom) YNL127W (2), YDR200C (3,4,10) and YLR238W (12). Our method, however, considers also the interactions among unclassified proteins.…”
Section: Figmentioning
confidence: 99%
“…Our method, however, considers also the interactions among unclassified proteins. If we iterate once more the majority rule by taking into account the interactions between the three unclassified proteins, we obtain the following classification YNL127W (2,4), YDR200C (3,4,10) and YLR238W (12). In this way we find another possible function for YNL127W.…”
Section: Figmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition to the standard homology searching, several computational methods for functional prediction of proteins have been proposed. They include the domain fusion method (Marcotte et al, 1999a) and the phylogenetic profile method (Pellegrini et al, 1999), both of which are based on the comparative genomics. Although such methods will give us interesting clues, their reliability is not satisfactory high.…”
Section: Introductionmentioning
confidence: 99%
“…Two distinct examples include the correlation of mRNA expression profiles to function, as in the characterization of pharmacological perturbations by identifying a novel target of dyclonine, 78 and through in silico predictions of structural or pathway interactions from whole genome comparative analyses. 79 von Mering et al 80 evaluated the roughly 80 000 interactions among yeast proteins currently available in various databases by comparing data from yeast two-hybrid systems, mass spectrometric protein complex purification techniques, mRNA expression profiles, genetic interaction data, as well as in silico interaction predictions from gene fusion, neighborhood, and co-occurrence analysis. Surprisingly, only B2400 of these interactions are supported by more than one method.…”
Section: Protein-protein Interactions: Isolation Of Protein Complexesmentioning
confidence: 99%