“…Although QM8 and QM9 are of unprecedented size compared to previous, common benchmark sets in quantum chemistry of several hundred to thousands of molecules, they still contain only small molecules with restricted elemental diversity (H, C, N, O and F) and with simple bonding patterns [67]. They lack larger, more complex molecules with, e.g., extended heteroaromatic backbones and attached functional groups, as commonly targeted in organic synthesis [6,38] and applied in (opto-)electronic [24,32,33,73] or pharmaceutical research [38,53,69]. We have based the spectroscopic dataset presented in this article on a diverse collection of 64,725 organic crystals that were extracted from the Cambridge Structural Database (CSD) [2] by Schober et al [47,48].…”