Mol-CycleGAN: a generative model for molecular optimization

Maziarka, Łukasz; Pocha, Agnieszka; Kaczmarczyk, Jan; Rataj, Krzysztof; Danel, Tomasz; Warchoł, Michał

doi:10.1186/s13321-019-0404-1

Cited by 227 publications

(206 citation statements)

References 40 publications

Supporting

Mentioning

203

Contrasting

Unclassified

Order By: Relevance

“…The literature concerning generative models of molecules has exploded since the first work on the topic Gómez-Bombarelli et al [2018]. Current methods feature molecular representations such as SMILES [Janz et al, 2018, Segler et al, 2017, Skalic et al, 2019, Ertl et al, 2017, Lim et al, 2018, Kang and Cho, 2018, Sattarov et al, 2019, Gupta et al, 2018, Harel and Radinsky, 2018, Yoshikawa et al, 2018, Bjerrum and Sattarov, 2018, Mohammadi et al, 2019 and graphs [Simonovsky and Komodakis, 2018, Li et al, 2018a, De Cao and Kipf, 2018, Kusner et al, 2017, Dai et al, 2018, Samanta et al, 2019, Li et al, 2018b, Kajino, 2019, Jin et al, 2019, Bresson and Laurent, 2019, Lim et al, 2019, Pölsterl and Wachinger, 2019, Krenn et al, 2019, Maziarka et al, 2019, Madhawa et al, 2019, Shen, 2018, Korovina et al, 2019 In this section we conduct an empirical test of the hypothesis from [Gómez-Bombarelli et al, 2018] that the decoder's lack of efficiency is due to data point collection in "dead regions" of the latent space far from the data on which the VAE was trained. We use this information to construct a binary classification Bayesian Neural Network (BNN) to serve as a constraint function that outputs the probability of a latent point being valid, the details of which will be discussed in the section on labelling criteria.…”

Section: Related Workmentioning

confidence: 99%

Constrained Bayesian optimization for automatic chemical design using variational autoencoders

2020

View full text Add to dashboard Cite

Automatic Chemical Design is a framework for generating novel molecules with optimized properties. The original scheme, featuring Bayesian optimization over the latent space of a variational autoencoder, suffers from the pathology that it tends to produce invalid molecular structures. First, we demonstrate empirically that this pathology arises when the Bayesian optimization scheme queries latent points far away from the data on which the variational autoencoder has been trained. Secondly, by reformulating the search procedure as a constrained Bayesian optimization problem, we show that the effects of this pathology can be mitigated, yielding marked improvements in the validity of the generated molecules. We posit that constrained Bayesian optimization is a good approach for solving this class of training set mismatch in many generative tasks involving Bayesian optimization over the latent space of a variational autoencoder.

show abstract

Section: Related Workmentioning

confidence: 99%

Constrained Bayesian optimization for automatic chemical design using variational autoencoders

2020

View full text Add to dashboard Cite

show abstract

“…Likewise, Maziarka et al [44] implemented a deep learning GAN architecture called the Mol-CycleGAN structure to produce optimized molecular compounds where their molecular structures were highly similar to the original ones. It should be emphasized that both the generative and discriminative network modules in the Mol-CycleGAN structure directly performed with latent vectors, and then the latent vectors were translated back to chemical structures (represented as molecular graphs).…”

Section: Molecular De Novo Designmentioning

confidence: 99%

Relevant Applications of Generative Adversarial Networks in Drug Design and Discovery: Molecular De Novo Design, Dimensionality Reduction, and De Novo Peptide and Protein Design

Lin

Lane

2020

Molecules

View full text Add to dashboard Cite

A growing body of evidence now suggests that artificial intelligence and machine learning techniques can serve as an indispensable foundation for the process of drug design and discovery. In light of latest advancements in computing technologies, deep learning algorithms are being created during the development of clinically useful drugs for treatment of a number of diseases. In this review, we focus on the latest developments for three particular arenas in drug design and discovery research using deep learning approaches, such as generative adversarial network (GAN) frameworks. Firstly, we review drug design and discovery studies that leverage various GAN techniques to assess one main application such as molecular de novo design in drug design and discovery. In addition, we describe various GAN models to fulfill the dimension reduction task of single-cell data in the preclinical stage of the drug development pipeline. Furthermore, we depict several studies in de novo peptide and protein design using GAN frameworks. Moreover, we outline the limitations in regard to the previous drug design and discovery studies using GAN models. Finally, we present a discussion of directions and challenges for future research.

show abstract

“…CycleGAN provides unpaired image-to-image translation using Cycle-Consistent Adversarial Networks (Zhu et al, 2017 ). MolCycleGAN, which extended the CycleGAN framework with an added loss and extra encoding network, maps from distribution to distribution on unpaired samples, so it can amplify the size of our dataset in the process by taking all of the pairing combinations rather than relying on a training dataset of predefined molecule-inhibitor pairs (Maziarka et al, 2020 ). The advantage of MolCycleGAN is the ability to learn transformation rules from the sets of compounds with desired and undesired values of the considered property.…”

Section: The Rise Of the Machines: Allosteric Mechanisms Through The mentioning

confidence: 99%

Allosteric Regulation at the Crossroads of New Technologies: Multiscale Modeling, Networks, and Machine Learning

Verkhivker

Agajanian

et al. 2020

Front. Mol. Biosci.

View full text Add to dashboard Cite

Mol-CycleGAN: a generative model for molecular optimization

Cited by 227 publications

References 40 publications

Constrained Bayesian optimization for automatic chemical design using variational autoencoders

Constrained Bayesian optimization for automatic chemical design using variational autoencoders

Relevant Applications of Generative Adversarial Networks in Drug Design and Discovery: Molecular De Novo Design, Dimensionality Reduction, and De Novo Peptide and Protein Design

Allosteric Regulation at the Crossroads of New Technologies: Multiscale Modeling, Networks, and Machine Learning

Contact Info

Product

Resources

About