2000
DOI: 10.1002/1097-0134(20010101)42:1<38::aid-prot50>3.0.co;2-3
|View full text |Cite
|
Sign up to set email alerts
|

Sequence complexity of disordered protein

Abstract: Intrinsic disorder refers to segments or to whole proteins that fail to self-fold into fixed 3D structure, with such disorder sometimes existing in the native state. Here we report data on the relationships among intrinsic disorder, sequence complexity as measured by Shannon's entropy, and amino acid composition. Intrinsic disorder identified in protein crystal structures, and by nuclear magnetic resonance, circular dichroism, and prediction from amino acid sequence, all exhibit similar complexity distribution… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

50
1,721
1
9

Year Published

2003
2003
2022
2022

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 1,645 publications
(1,822 citation statements)
references
References 69 publications
50
1,721
1
9
Order By: Relevance
“…Individual conformations can contain elements of secondary structure, but the population-weighted average conformation has only fractional secondary structure. These protein sequences have lower complexity than folded proteins and are enriched in arginine, lysine, glutamate, proline and serine, with fewer cysteine, tyrosine, tryptophan, isoleucine and valine residues 17 , as observed for the R region, with 30% charged residues. Prediction of intrinsically disordered proteins in complete genomes shows an increasing proportion of disordered proteins with increasing organism complexity: up to 14% of archaeal, 21% of bacterial and 41% of eukaryotic proteins are predicted to contain stretches of more than 50 disordered residues 18 .…”
mentioning
confidence: 90%
“…Individual conformations can contain elements of secondary structure, but the population-weighted average conformation has only fractional secondary structure. These protein sequences have lower complexity than folded proteins and are enriched in arginine, lysine, glutamate, proline and serine, with fewer cysteine, tyrosine, tryptophan, isoleucine and valine residues 17 , as observed for the R region, with 30% charged residues. Prediction of intrinsically disordered proteins in complete genomes shows an increasing proportion of disordered proteins with increasing organism complexity: up to 14% of archaeal, 21% of bacterial and 41% of eukaryotic proteins are predicted to contain stretches of more than 50 disordered residues 18 .…”
mentioning
confidence: 90%
“…24 Briefly, reports on disordered regions identified by NMR, circular dichroism, or protease digestion were located by keyword searches of PubMed (http://www.ncbi.nlm.nih.gov/entrez). Additionally, starting from PDB_Select_25, 25 a nonredundant subset of the Protein Data Bank (PDB), 26 whose members have Ͻ25% sequence identity, we identified disordered regions in X-ray crystal structures by searching for residues whose backbone atoms are absent from the ATOMS list of their PDB files.…”
Section: Data Setsmentioning
confidence: 99%
“…Because the amount of information about disordered proteins is rather limited, in accordance with previous work, 24 only first-order statistics of the 20 amino acids within a given window were used as attributes to prevent the "curse of dimensionality." 28 For example, attribute X A at a given position is calculated as the fraction of amino acid A (ALA or Alanine) within a window.…”
Section: Data Representation and Attribute Selectionmentioning
confidence: 99%
See 1 more Smart Citation
“…22,28 The circular dichroism spectra are similar to those observed for coil-like natively unfolded polypeptides; 28 changes in circular dichroism as a function of temperature also resemble the response of intrinsically disordered proteins. 27 Analysis of the CTCF sequence with disorder prediction algorithms 29,30 even identified regions in the terminal domains as likely to be unstructured (data not shown). We cannot rule out the possible existence of isolated helices or strands, but these elements are neither abundant nor assemble into an ordered fold.…”
Section: Functional Implications Of Ctcf Molecular Architecturementioning
confidence: 99%