“…The data for identification of structural alerts were collected from 1) the databases such as ChEMBL ( Gaulton et al, 2011 ), ChemIDplus ( Tomasulo, 2002 ), Comparative Toxicogenomics Database (CTD) ( Davis et al, 2018 ), Carcinogenic Potency Database (CPDB) ( Gold et al, 1984 ) and DrugBank ( Wishart et al, 2017 ) and 2) peer-reviewed publications through manually filtering and processing. We focused on 22 toxicity endpoints which are of most concern in environmental toxicology and drug discovery, including acute oral toxicity ( Li et al, 2014 ), chemical aquatic toxicity [ Tetrahymena pyriformis ( Cheng et al, 2011 ), Daphnia magna ( Gajewicz-Skretna et al, 2021 ), and fathead minnow ( Sun et al, 2015 )], chemical-induced hematotoxicity ( Hua et al, 2021 ), drug-induced neurotoxicity ( Jiang et al, 2020 ), drug-induced autoimmune diseases ( Wu et al, 2021 ), drug-induced ototoxicity ( Huang et al, 2021 ), drug-induced rhabdomyolysis ( Cui et al, 2019 ), endocrine disruption ( Chen et al, 2014 ), eye irritation ( Wang et al, 2017 ), hepatotoxicity ( Li et al, 2018 ), hERG inhibition ( Li et al, 2017c ), honey bee toxicity ( Li et al, 2017b ), inhalation toxicity ( Cui et al, 2021 ), mitochondrial toxicity ( Nelms et al, 2015 ), mutagenicity ( Yang et al, 2017 ), nephrotoxicity ( Shi et al, 2022 ), non-genotoxic carcinogenicity ( Benigni et al, 2013 ), reproductive and development toxicity ( Fan et al, 2018 ; Jiang et al, 2019 ), skin sensitization ( Di et al, 2019 ), and toxicity on avian species ( Zhang et al, 2015 ). For each toxicity endpoint, we searched the literature separately and included the publications with the same definition of the toxicity endpoint and consistent toxic/non-toxic classification criteria.…”