2022
DOI: 10.48550/arxiv.2203.12742
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

Abstract: Bayesian optimization is a gold standard for query-efficient continuous optimization. However, its adoption for drug and antibody sequence design has been hindered by the discrete, highdimensional nature of the decision variables. We develop a new approach (LaMBO) which jointly trains a denoising autoencoder with a discriminative multi-task Gaussian process head, enabling gradient-based optimization of multi-objective acquisition functions in the latent space of the autoencoder. These acquisition functions all… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 46 publications
(71 reference statements)
0
7
0
Order By: Relevance
“…Another potential direction is to incorporate cost-aware Bayesian optimization techniques into the EHIG framework [29,50,3]. We also wish to study how the proposed EHIG framework could be applied in practice to solve various problems in the sciences, including experimental physics [14,32,8], drug discovery [43,27,20], and materials design [31,46]. Finally, we wish to study in further detail how the EHIG acquisition function could be implemented for Bayesian decision making with other probabilistic models beyond Gaussian processes [42,10,9,34].…”
Section: Discussionmentioning
confidence: 99%
“…Another potential direction is to incorporate cost-aware Bayesian optimization techniques into the EHIG framework [29,50,3]. We also wish to study how the proposed EHIG framework could be applied in practice to solve various problems in the sciences, including experimental physics [14,32,8], drug discovery [43,27,20], and materials design [31,46]. Finally, we wish to study in further detail how the EHIG acquisition function could be implemented for Bayesian decision making with other probabilistic models beyond Gaussian processes [42,10,9,34].…”
Section: Discussionmentioning
confidence: 99%
“…Bayesian optimization is highly effective in striking this balance while being able to also incorporate prior knowledge about the problem, constraints to the search space, and the ability to optimize multiple objectives simultaneously. This has successfully been used in chemical synthesis engineering to reduce the number of experiments required to find the optimal conditions for a reaction and can similarly be applied to protein engineering. ,,,, …”
Section: Machine-learning Methods For Protein Engineeringmentioning
confidence: 99%
“…Stanton et al 79 combined a denoising autoencoder (a model similar to a VAE) with a Gaussian process regressor that predicts protein properties. Using a multiobjective Bayesian optimization approach, they generated novel red-spectrum fluorescent proteins (RFPs) with improved predicted stability and solvent-accessible surface areas (SASAs).…”
Section: Machine-learning Methods For Protein Engineeringmentioning
confidence: 99%
See 1 more Smart Citation
“…Figure S1 shows optimization of latent space has continuous improvement, but after decoding to an actual sequence and evaluating g( u) there is a plateau. This may be unique to UniRep or because or be task specific because other recent work has successfully used latent spaces for BayesOpt 68,69 and work on improving extrapolation from latent spaces. 70 Nevertheless, avoiding latent space negates this potential agreement problem between u and x.…”
Section: B Bayesian Optimizationmentioning
confidence: 99%