“…[ 3 , 4 , 5 ] Deep learning‐based molecular design algorithms can extract high‐level molecular features from “raw” molecular representations,[ 6 , 7 , 8 , 9 , 10 ] such as molecular graphs and the Simplified Molecular Input Line Entry System (SMILES, Figure 1 a ), [11] potentially allowing them to access unexplored regions of the chemical space. [12] Previous studies showed that chemical language models (CLMs),[ 13 , 14 ] in particular generative deep learning models trained on SMILES strings, can generate novel molecules with experimentally validated bioactivity. [ 9 , 15 , 16 ] CLMs have shown the ability to learn focused chemical features from small collections of template molecules by means of transfer learning, that is, a method to reuse previously learned knowledge on a new task for which the available data is scarce.…”