An analysis of single amino acid repeats as use case for application specific background models

Łabaj, Paweł P.; Sykacek, Peter; Kreil, David P.

doi:10.1186/1471-2105-12-173

Cited by 3 publications

(2 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Over billions of years, organisms have leveraged fundamental physical principles, exemplified by signal peptides like nuclear localization signals with significant compositional biases 89 . Such peptides, despite low information content, hold substantial abstract signal value due to their unique enrichment in specific functional groups, as seen in the PKKKRKV segment found in the SV40 Large T-antigen.…”

Section: Resultsmentioning

confidence: 99%

Deciphering Peptide-Protein Interactions via Composition-Based Prediction: A Case Study with Survivin/BIRC5

Anindya,

Olsson,

Jensen

et al. 2024

Preprint

View full text Add to dashboard Cite

In the realm of atomic physics and chemistry, composition emerges as the most powerful means of describing matter. Mendeleev’s periodic table and chemical formulas, while not entirely free from ambiguities, provide robust approximations for comprehending the properties of atoms, chemicals, and their collective behaviours, which stem from the dynamic interplay of their constituents.Our study illustrates that protein-protein interactions follow a similar paradigm, wherein the composition of peptides plays a pivotal role in predicting their interactions with the protein survivin, using an elegantly simple model. An analysis of these predictions within the context of the human proteome not only illuminates the known cellular locations of survivin and its interaction partners, but also introduces novel insights into biological functionality. It becomes evident that an electrostatic- and primary structure-based description falls short in predictive power, leading us to speculate that protein interactions are orchestrated by the collective dynamics of the functional groups of the peptides involved.

show abstract

Section: Resultsmentioning

confidence: 99%

Deciphering Peptide-Protein Interactions via Composition-Based Prediction: A Case Study with Survivin/BIRC5

Anindya,

Olsson,

Jensen

et al. 2024

Preprint

View full text Add to dashboard Cite

show abstract

“…Over billions of years, organisms have leveraged fundamental physical principles, exemplified by signal peptides like nuclear localization signals with significant compositional biases [99]. Despite low information content, such peptides hold substantial abstract signal value due to their unique enrichment in specific functional groups, as seen in the PKKKRKV segment found in the SV40 Large T-antigen.…”

Section: Exploring the Effectiveness Of Composition-based Predictionmentioning

confidence: 99%

Deciphering peptide-protein interactions via composition-based prediction: a case study with survivin/BIRC5

Anindya,

Olsson,

Jensen

et al. 2024

Mach. Learn.: Sci. Technol.

View full text Add to dashboard Cite

In the realm of atomic physics and chemistry, composition emerges as the most powerful means of describing matter. Mendeleev’s periodic table and chemical formulas, while not entirely free from ambiguities, provide robust approximations for comprehending the properties of atoms, chemicals, and their collective behaviours, which stem from the dynamic interplay of their constituents. Our study illustrates that protein-protein interactions follow a similar paradigm, wherein the composition of peptides plays a pivotal role in predicting their interactions with the protein survivin, using an elegantly simple model. An analysis of these predictions within the context of the human proteome not only confirms the known cellular locations of survivin and its interaction partners, but also introduces novel insights into biological functionality. It becomes evident that electrostatic- and primary structure-based descriptions fall short in predictive power, leading us to speculate that protein interactions are orchestrated by the collective dynamics of functional groups.

show abstract

Disentangling the complexity of low complexity proteins

Mier

Paladin

Tamana

et al. 2019

Briefings in Bioinformatics

View full text Add to dashboard Cite

There are multiple definitions for low complexity regions (LCRs) in protein sequences, with all of them broadly considering LCRs as regions with fewer amino acid types compared to an average composition. Following this view, LCRs can also be defined as regions showing composition bias. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichotomy, and more generally the overlaps between different properties related to LCRs, using examples. We argue that statistical measures alone cannot capture all structural aspects of LCRs and recommend the combined usage of a variety of predictive tools and measurements. While the methodologies available to study LCRs are already very advanced, we foresee that a more comprehensive annotation of sequences in the databases will enable the improvement of predictions and a better understanding of the evolution and the connection between structure and function of LCRs. This will require the use of standards for the generation and exchange of data describing all aspects of LCRs. Short abstract There are multiple definitions for low complexity regions (LCRs) in protein sequences. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichotomy, plus overlaps between different properties related to LCRs, using examples.

show abstract

An analysis of single amino acid repeats as use case for application specific background models

Cited by 3 publications

References 50 publications

Deciphering Peptide-Protein Interactions via Composition-Based Prediction: A Case Study with Survivin/BIRC5

Deciphering Peptide-Protein Interactions via Composition-Based Prediction: A Case Study with Survivin/BIRC5

Deciphering peptide-protein interactions via composition-based prediction: a case study with survivin/BIRC5

Disentangling the complexity of low complexity proteins

Contact Info

Product

Resources

About