2021
DOI: 10.1038/s43588-021-00168-y
|View full text |Cite
|
Sign up to set email alerts
|

Cluster learning-assisted directed evolution

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
51
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 46 publications
(51 citation statements)
references
References 51 publications
0
51
0
Order By: Relevance
“…Several adaptive experimental design approaches, such as MLDE (17), cluster learning-assisted directed evolution (CLADE) (61), and ODBO (62), have utilized ML models for sampling-efficient protein engineering. There are two key differences between these approaches and our method.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Several adaptive experimental design approaches, such as MLDE (17), cluster learning-assisted directed evolution (CLADE) (61), and ODBO (62), have utilized ML models for sampling-efficient protein engineering. There are two key differences between these approaches and our method.…”
Section: Discussionmentioning
confidence: 99%
“…First, BO-EVO does not require prior knowledge of the structure or homolog sequences of a target protein, which may be unavailable or scarce. On the contrary, information derived from protein structures (e.g., ΔΔG) and homolog sequences were utilized by MLDE (17) and CLADE (61), respectively, as zero-shot predictors to exclude possible zero- or low-fitness mutants from experimental sets. Second, BO-EVO does not need to evaluate the whole design space in silico by restricting sequence design via evolutionary algorithms.…”
Section: Discussionmentioning
confidence: 99%
“…By normalizing the fitness to its global maximum, 92% of sequences have fitness lower than 0.01 and 99.3% sequences have fitness lower than 0.3 for GB1. Similarly, there are 92% of sequences having fitness lower than 0.01 and 99.96% of sequences having fitness lower than 0.3 for PhoQ (Supporting Information Figure S1) …”
Section: Methodsmentioning
confidence: 96%
“…In this work, we use two combinatorial libraries, GB1 and PhoQ, that have almost complete coverage for mutations at four mutational sites. GB1 is a very popular benchmark library, while the PhoQ library has also been used in early MLDE studies. , PhoQ is considered as an alternative data set. For both data sets, their fitness values were normalized into the range [0, 1] when being applied to CLADE.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation