2023
DOI: 10.1039/d3dd00045a
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking protein structure predictors to assist machine learning-guided peptide discovery

Abstract: Machine learning models provide an informed and efficient strategy to create novel peptide and protein sequences with the desired profiles. Nevertheless, they are primarily trained on sequences where the tridimensional...

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 67 publications
(127 reference statements)
0
4
0
Order By: Relevance
“…Aware of the possible over-representation of α-helical peptides in our models, we evaluated the structural diversity of the peptide datasets. We recently developed a fast and reliable approach to estimate the structural landscape of any sizable peptide dataset, using protein structure predictors, PEP2D and AlphaFold2 (AF2) 29 . Many of the sequences in our datasets contained 50 or more residues, guiding our preferences for AF2.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…Aware of the possible over-representation of α-helical peptides in our models, we evaluated the structural diversity of the peptide datasets. We recently developed a fast and reliable approach to estimate the structural landscape of any sizable peptide dataset, using protein structure predictors, PEP2D and AlphaFold2 (AF2) 29 . Many of the sequences in our datasets contained 50 or more residues, guiding our preferences for AF2.…”
Section: Resultsmentioning
confidence: 99%
“…We recently recommended a fast and robust estimate of the structural landscape(s) of medium-large datasets for fold discovery and prior ML modeling. Our predictions identified loose α-helices as the main structure class (65.1%), followed by random coils (17.8%), and β-stranded and mixed structures accounted for the rest of the large AMP dataset 29 . Consequently, current AMP models (predictors and generators) might favor these dominant structure classes.…”
Section: Introductionmentioning
confidence: 84%
See 2 more Smart Citations