2020
DOI: 10.1101/2020.01.06.895466
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Protein Sequence Design with a Learned Potential

Abstract: The primary challenge of fixed-backbone protein sequence design is to find a distribution of sequences that fold to the backbone of interest. In practice, state-of-the-art protocols often find viable but highly convergent solutions. In this study, we propose a novel method for fixed-backbone protein sequence design using a learned deep neural network potential. We train a convolutional neural network (CNN) to predict a distribution over amino acids at each residue position conditioned on the local structural e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
47
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 32 publications
(49 citation statements)
references
References 100 publications
2
47
0
Order By: Relevance
“…With recent advances in deep learning technology, machine learning tools have seen increasing applications in protein science, with deep neural networks being applied to tasks such as sequence design [11], fold recognition [12], binding site prediction [13], and structure prediction [14,15]. Generative models, which approximate the distributions of the data they are trained on, have garnered interest as a data-driven way to create novel proteins.…”
Section: Introductionmentioning
confidence: 99%
“…With recent advances in deep learning technology, machine learning tools have seen increasing applications in protein science, with deep neural networks being applied to tasks such as sequence design [11], fold recognition [12], binding site prediction [13], and structure prediction [14,15]. Generative models, which approximate the distributions of the data they are trained on, have garnered interest as a data-driven way to create novel proteins.…”
Section: Introductionmentioning
confidence: 99%
“…There has been growing interest in using neural network-based approaches for protein representation learning and design (19,(37)(38)(39)(40), with several new methods reported during the preparation of this manuscript (41,42). While most methods are accompanied by a variety of metrics which attempt to illustrate the accuracy of the predictions, it is inherently difficult to evaluate the quality of generative models, as their ultimate goal is to generate entirely novel sequences with no existing counterparts.…”
Section: Resultsmentioning
confidence: 99%
“…The generative approaches to de novo protein structure design so far described in the literature, rule- or model-based, either focus exclusively on helical structures ( 31 33 ), are not geared toward atomic-detail modeling and design ( 34 ), or sacrifice fine-grained structural control for structural diversity ( 35 ). Machine-learning–based generative models show considerable promise ( 35 , 36 ), but have not yet been applied to the direct generation of full atomic structures with specific features of interest, as we do here for scaffolds containing a varied geometry of binding pockets. We hope the experimental data generated in this work will aid the development of models that more efficiently produce protein structures with finer control over atomic detail and greater diversity.…”
Section: Discussionmentioning
confidence: 99%