2022
DOI: 10.1101/2022.12.21.521443
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep learning-enabled design of synthetic orthologs of a signaling protein

Abstract: Evolution-based deep generative models represent an exciting direction in understanding and designing proteins. An open question is whether such models can represent the constraints underlying specialized functions that are necessary for organismal fitness in specific biological contexts. Here, we examine the ability of three different models to produce synthetic versions of SH3 domains that can support function in a yeast stress signaling pathway. Using a select-seq assay, we show that one form of a variation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
37
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 9 publications
(37 citation statements)
references
References 62 publications
0
37
0
Order By: Relevance
“…SH3 domains have evolved to perform a variety of functions within various organisms by evolving differential binding specificities, resulting in a number of distinct paralogs (i.e., homologous proteins performing different functions within the same species) within the SH3 family. Recent work by Lian et al trained VAEs over an MSA of ~--5,300 SH3 homologs to develop a deep generative model for synthetic SH3 design [20]. The VAE learned an unsupervised three-dimensional latent space embedding in which the natural sequences demonstrated an emergent hierarchical clustering by phylogeny and function.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…SH3 domains have evolved to perform a variety of functions within various organisms by evolving differential binding specificities, resulting in a number of distinct paralogs (i.e., homologous proteins performing different functions within the same species) within the SH3 family. Recent work by Lian et al trained VAEs over an MSA of ~--5,300 SH3 homologs to develop a deep generative model for synthetic SH3 design [20]. The VAE learned an unsupervised three-dimensional latent space embedding in which the natural sequences demonstrated an emergent hierarchical clustering by phylogeny and function.…”
Section: Resultsmentioning
confidence: 99%
“…Based on these desired criteria, a number of deep learning architectures have been employed for data-driven protein design. Two approaches in particular have received substantial attention: variational autoencoders (VAEs) [15, 16, 20, 21] and transformer-based models [2431, 35]. VAEs are deep generative models comprising two consecutive neural networks [36, 37]: an encoder compresses the high-dimensional sequence data into a low-dimensional latent space that is then passed to a decoder whose task is to reconstruct the input sequences as the output of the network.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations