Curiosity in exploring chemical spaces: intrinsic rewards for molecular reinforcement learning

Thiede, Luca Anthony; Krenn, Mario; Nigam, AkshatKumar; Aspuru‐Guzik, Alán

doi:10.1088/2632-2153/ac7ddc

Cited by 24 publications

(34 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In drug discovery, experimental verification of molecular properties is very time-consuming and labor-intensive. Most DL generative models have not yet incorporated this challenge into the design process. − ,,− ,,− ,− To make ChemistGA more aware of this constraint by design, we propose an augmented ChemistGA, called R-ChemistGA, which simulates the real environment as further elucidated below.…”

Section: Results and Discussionmentioning

confidence: 99%

ChemistGA: A Chemical Synthesizable Accessible Molecular Generation Algorithm for Real-World Drug Discovery

Wang

Sun

et al. 2022

J. Med. Chem.

View full text Add to dashboard Cite

Many deep learning (DL)-based molecular generative models have been proposed to design novel molecules. These models may perform well on benchmarks, but they usually do not take real-world constraints into account, such as available training data set, synthetic accessibility, and scaffold diversity in drug discovery. In this study, a new algorithm, ChemistGA, was proposed by combining the traditional heuristic algorithm with DL, in which the crossover of the traditional genetic algorithm (GA) was redefined by DL in conjunction with GA, and an innovative backcrossing operation was implemented to generate desired molecules. Our results clearly show that ChemistGA not only retains the strength of the traditional GA but also greatly enhances the synthetic accessibility and success rate of the generated molecules with desired properties. Calculations on the two benchmarks illustrate that ChemistGA achieves impressive performance among the state-of-the-art baselines, and it opens a new avenue for the application of generative models to real-world drug discovery scenarios.

show abstract

Section: Results and Discussionmentioning

confidence: 99%

ChemistGA: A Chemical Synthesizable Accessible Molecular Generation Algorithm for Real-World Drug Discovery

Wang

Sun

et al. 2022

J. Med. Chem.

View full text Add to dashboard Cite

show abstract

“…It then chooses actions that lead to situations it cannot predict well, thus maximizing its own understanding of the environment. It has been shown using curious agents in simulated virtual universes 124 and robot agents in real laboratories 84 that curiosity is an efficient exploration strategy. Alternative intrinsic rewards for artificial agents are ‘computational creativity’ 125 , 126 and ‘surprise’ 127 .…”

Section: Three Dimensions Of Computer-assisted Understandingmentioning

confidence: 99%

On scientific understanding with artificial intelligence

et al. 2022

Self Cite

View full text Add to dashboard Cite

An oracle that correctly predicts the outcome of every particle physics experiment, the products of every possible chemical reaction or the function of every protein would revolutionize science and technology. However, scientists would not be entirely satisfied because they would want to comprehend how the oracle made these predictions. This is scientific understanding, one of the main aims of science. With the increase in the available computational power and advances in artificial intelligence, a natural question arises: how can advanced computational systems, and specifically artificial intelligence, contribute to new scientific understanding or gain it autonomously? Trying to answer this question, we adopted a definition of ‘scientific understanding’ from the philosophy of science that enabled us to overview the scattered literature on the topic and, combined with dozens of anecdotes from scientists, map out three dimensions of computer-assisted scientific understanding. For each dimension, we review the existing state of the art and discuss future developments. We hope that this Perspective will inspire and focus research directions in this multidisciplinary emerging field.

show abstract

“…As previously mentioned, the challenging syntax of SMILES strings makes their construction difficult. For this reason, Thiede et al 91 worked with SELFIES strings, in which every combination of characters is valid and the substrings generated during the construction process can be directly interpreted. Optimisation is handled via PPO, while the reward function is defined by a combination of an extrinsic reward (based on the predicted properties of the molecule) and an intrinsic reward named curiosity to encourage increased exploration of the state space.…”

Section: Applications Of Reinforcement Learning In Chemistrymentioning

confidence: 99%