“…36,98 These studies showed that similarly to the impacts of data quality, 28 molecular representation also has a great impact on models' performance. Despite Tayyebi et al 36 being able to achieve an MAE of 0.64 on solubility challenge 1 when using Morgan fingerprints (MF), Zagidullin et al 98 reported poor performance when using MF. Our approach, on the other hand, is based on extracting information from simple string representations, a more straightforward raw data.…”