2012
DOI: 10.1073/pnas.1215251110
|View full text |Cite
|
Sign up to set email alerts
|

Navigating the protein fitness landscape with Gaussian processes

Abstract: Knowing how protein sequence maps to function (the "fitness landscape") is critical for understanding protein evolution as well as for engineering proteins with new and useful properties. We demonstrate that the protein fitness landscape can be inferred from experimental data, using Gaussian processes, a Bayesian learning technique. Gaussian process landscapes can model various protein sequence properties, including functional status, thermostability, enzyme activity, and ligand binding affinity. Trained on ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

4
323
1
2

Year Published

2013
2013
2024
2024

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 302 publications
(330 citation statements)
references
References 39 publications
4
323
1
2
Order By: Relevance
“…The gatekeeper hypothesis thus points to Ala-82 as a crucial target residue in research to enable further engineering of BM3 for diverse functions. Our findings show that, contrary to previous approaches focusing on increasing enzyme stability as a path to biotechnologically relevant enzymes (55,56), enzyme conformational destabilization is key to reducing the thermodynamic barrier to substrate binding and therefore to altered enzymatic activities, which could enable rapid identification of P450 (and other enzyme) variants with biotechnologically important activities.…”
Section: Discussioncontrasting
confidence: 44%
“…The gatekeeper hypothesis thus points to Ala-82 as a crucial target residue in research to enable further engineering of BM3 for diverse functions. Our findings show that, contrary to previous approaches focusing on increasing enzyme stability as a path to biotechnologically relevant enzymes (55,56), enzyme conformational destabilization is key to reducing the thermodynamic barrier to substrate binding and therefore to altered enzymatic activities, which could enable rapid identification of P450 (and other enzyme) variants with biotechnologically important activities.…”
Section: Discussioncontrasting
confidence: 44%
“…Large sequence-function datasets will provide an increasingly detailed view of the determinants of enzyme function. When combined with methods from statistics and machine learning, protein design rules can be extracted and applied in an automated manner (39). Given the rapid pace of advances in high-throughput experimentation, data-driven protein engineering may be able to outpace more traditional physics-based methods.…”
Section: Discussionmentioning
confidence: 99%
“…This can therefore increase dramatically the rate of knowledge-based navigation of the relevant search space (Fox and Huisman, 2008; Kell, 2012; Romero et al , 2013) for functional screening, and coupled with efficient synthesis and expression in E. coli or any other preferred host can provide a platform for the potential screening of millions of sequence variants in a single generation.…”
Section: Discussionmentioning
confidence: 99%