Machine learning overcomes human bias in the discovery of self-assembling peptides

Batra, Rohit; Loeffler, Troy D.; Chan, Henry; Srinivasan, Srilok; Cui, Honggang; Korendovych, Ivan V.; Nanda, Vikas; Palmer, Liam C.; Solomon, Lee A.; Fry, H. Christopher; Sankaranarayanan, Subramanian K. R. S.

doi:10.1038/s41557-022-01055-3

Cited by 70 publications

(72 citation statements)

References 65 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The sequence space increases dramatically with the length of the peptides as 20 n , with n being the length of the sequence–an intractable problem to identify the best sequences for self-assembly. Thus, design strategies typically use heuristics on known substitution patterns (e.g., among hydrophobic residues) or by using charge complementarity to design new sequences, reducing the number of sequences to study into a manageable number . In recent years, advancements in machine learning for both protein structure prediction, , peptide binding, and sequence optimization have transformed computational biology.…”

Section: Results and Discussionmentioning

confidence: 99%

Molecular Modeling of Self-Assembling Peptides

Jones

Pérez

2023

ACS Appl. Bio Mater.

View full text Add to dashboard Cite

Peptide epitopes mediate as many as 40% of protein− protein interactions and fulfill signaling, inhibition, and activation roles within the cell. Beyond protein recognition, some peptides can self-or coassemble into stable hydrogels, making them a readily available source of biomaterials. While these 3D assemblies are routinely characterized at the fiber level, there are missing atomistic details about the assembly scaffold. Such atomistic detail can be useful in the rational design of more stable scaffold structures and with improved accessibility to functional motifs. Computational approaches can in principle reduce the experimental cost of such an endeavor by predicting the assembly scaffold and identifying novel sequences that adopt said structure. Yet, inaccuracies in physical models and inefficient sampling have limited atomistic studies to short (two or three amino acid) peptides. Given recent developments in machine learning and advances in sampling strategies, we revisit the suitability of physical models for this task. We use the MELD (Modeling Employing Limited Data) approach to drive self-assembly in combination with generic data in cases where conventional MD is unsuccessful. Finally, despite recent developments in machine learning algorithms for protein structure and sequence predictions, we find the algorithms are not yet suited for studying the assembly of short peptides.

show abstract

Section: Results and Discussionmentioning

confidence: 99%

Molecular Modeling of Self-Assembling Peptides

Jones

Pérez

2023

ACS Appl. Bio Mater.

View full text Add to dashboard Cite

show abstract

“…Machine learningbased techniques can hopefully be used to predict peptide sequences that can be assembled to form controlled morphologies and specific functions, thereby controlling their mineralization properties. [325][326][327] This will help in shortening the time of peptide design, reducing the work of experimental screening and optimization, improving the design success rate, and making it possible to design sequences with a better structure and function.…”

Section: Discussionmentioning

confidence: 99%

“…324 In addition, machine learning methods can be used to design peptide sequences with high self-assembly propensity, which can self-assemble into hydrogels. 325 Based on the above research studies, we believe that machine learning strategies for designing peptide sequences during mineralization should be developed, which is promising, although not yet directly relevant. Machine learning-based techniques can hopefully be used to predict peptide sequences that can be assembled to form controlled morphologies and specific functions, thereby controlling their mineralization properties.…”

Section: Discussionmentioning

confidence: 99%

Biomimetic mineralization based on self-assembling peptides

Wang

Zhang

et al. 2023

Chem. Soc. Rev.

View full text Add to dashboard Cite

show abstract

“…Recently, an ML approach, in combination with Monte Carlo tree search and molecular dynamics simulations, was reported for predicting unexpected de novo -sheet rich self-assembling peptides. 22 While all these reports successfully identified either bioactive or self-assembling peptides separately, they did not predict bioactive self-assembling peptides.…”

Section: Navigating Through Chemical Space Via Machine Learningmentioning

confidence: 98%

Inverse design of viral infectivity-enhancing peptide fibrils from continuous protein-vector embeddings

Kaygisiz

Dutta

Rauch-Wirth

et al. 2023

Preprint

View full text Add to dashboard Cite

Amyloid-like nanofibers from self-assembling peptides can promote viral gene transfer for therapeutic applications. Traditionally, new sequences are discovered either from screening large libraries or by creating derivatives of known active peptides. However, the discovery of de novo peptides, which are sequence-wise not related to any known active peptides, is limited by the difficulty to rationally predict structureactivity relationships because their activities typically have multi-scale and multi-parameter dependencies. Here, we used a small library of 163 peptides to predict de novo sequences for viral infectivity enhancement using a machine learning (ML) approach based on natural language processing. Specifically, we trained an ML model using continuous vector representations of the peptides, which were previously shown to retain relevant information embedded in the sequences. We used the trained ML model to sample the sequence space of peptides with 6 amino acids to identify promising candidates. These 6-mers were then further screened for charge and aggregation propensity. The resulting 16 new 6-mers were tested and found to be active with a 25% hit rate. Strikingly, these de novo sequences are the shortest active peptides for infectivity enhancement reported so far and show no sequence relation to the training set. Moreover, by screening the chemical space, we discovered the first hydrophobic peptide fibrils with a moderately negative surface charge that can enhance infectivity. Hence, this ML strategy is a time- and cost-efficient way for expanding the chemical space of short functional self-assembling peptides exemplified for therapeutic viral gene delivery.

show abstract

Machine learning overcomes human bias in the discovery of self-assembling peptides

Cited by 70 publications

References 65 publications

Molecular Modeling of Self-Assembling Peptides

Molecular Modeling of Self-Assembling Peptides

Biomimetic mineralization based on self-assembling peptides

Inverse design of viral infectivity-enhancing peptide fibrils from continuous protein-vector embeddings

Contact Info

Product

Resources

About