2005
DOI: 10.1093/nar/gki474
|View full text |Cite
|
Sign up to set email alerts
|

PRODOC: a resource for the comparison of tethered protein domain architectures with in-built information on remotely related domain families

Abstract: PROtein Domain Organization and Comparison (PRODOC) comprises several programs that enable convenient comparison of proteins as a sequence of domains. The in-built dataset currently consists of ∼698 000 proteins from 192 organisms with complete genomic data, and all the SWISSPROT proteins obtained from the Pfam database. All the entries in PRODOC are represented as a sequence of functional domains, assigned using hidden Markov models, instead of as a sequence of amino acids. On average 69% of the proteins in t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2006
2006
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 39 publications
0
5
0
Order By: Relevance
“…Domain architectures have been also used to detect homology between multidomain protein families and were shown to achieve a very good performance. 18,19 For example, Krishnamurthy et al 20 clustered sequences sharing similar domain architectures to detect homologous proteins while the method introduced by Krishnadev et al 21 allowed identification of circular permutations in the evolution of multidomain protein families.…”
Section: Introductionmentioning
confidence: 99%
“…Domain architectures have been also used to detect homology between multidomain protein families and were shown to achieve a very good performance. 18,19 For example, Krishnamurthy et al 20 clustered sequences sharing similar domain architectures to detect homologous proteins while the method introduced by Krishnadev et al 21 allowed identification of circular permutations in the evolution of multidomain protein families.…”
Section: Introductionmentioning
confidence: 99%
“…Two parameters were employed to estimate sequence divergence: variability in composition and order of sequence domains and percent sequence identity. Pfam [ 10 ] domain assignments of the different subunits of RNA polymerase complex of bacterial organisms with known genome sequences were extracted from the PRODOC [ 11 ] database. Greater than 100 sequences were obtained for alpha (121), beta (138) and betaprime (127) subunits each.…”
Section: Methodsmentioning
confidence: 99%
“…Detecting a fusion event of a protein of unknown function with a sequence involved in a known metabolic pathway or regulatory network may accordingly help to narrow the possibilities of its putative functions and thus facilitate hypothesis building. Many computational approaches to detect such gene fusion events were undertaken in the past [80][81][82]. FusionDB is a public database that allows detecting such gene fusion events [83,84].…”
Section: Fusiondbmentioning
confidence: 99%