“…Recently, advances in NLP have found surprising, strong results in the chemistry domain by training LLMs (Fabian et al, 2020;Chithrananda et al, 2020;NVIDIA Corporation, 2022;Tysinger et al, 2023) on string representations of molecules (Weininger, 1988;Weininger et al, 1989;Krenn et al, 2020;Cheng et al, 2023). To enable higher-level control over molecular design, multi-modal models (Edwards et al, 2021;Vall et al, 2021;Zeng et al, 2022;Xu and Wang, 2022;Su et al, 2022;Seidl et al, 2023;Xu et al, 2023;Zhao et al, 2023;Liu et al, 2023b) have been proposed.…”