Abstract:The Density Functional Theory (DFT) method and the 6-31G** basis set were employed to calculate the molecular properties of artemisinin and 20 derivatives with different degrees of cytotoxicity against the human hepatocellular carcinoma HepG2 line. Principal component analysis (PCA) and hierarchical cluster analysis (HCA) were employed to select the most important descriptors related to anticancer activity. The significant molecular descriptors related to the compounds with anticancer activity were the ALOGPS_log, Mor29m, IC5 and GAP energy. The Pearson correlation between
OPEN ACCESSMolecules 2014, 19 10671 activity and most important descriptors were used for the regression partial least squares (PLS) and principal component regression (PCR) models built. The regression PLS and PCR were very close, with variation between PLS and PCR of R 2 = ±0.0106, R 2 ajust = ±0.0125, s = ±0.0234, F (4,11) = ±12.7802, Q 2 = ±0.0088, SEV = ±0.0132, PRESS = ±0.4808 and S PRESS = ±0.0057. These models were used to predict the anticancer activity of eight new artemisinin compounds (test set) with unknown activity, and for these new compounds were predicted pharmacokinetic properties: human intestinal absorption (HIA), cellular permeability (P CaCO2 ), cell permeability Maden Darby Canine Kidney (P MDCK ), skin permeability (P Skin ), plasma protein binding (PPB) and penetration of the blood-brain barrier (C Brain/Blood ), and toxicological: mutagenicity and carcinogenicity. The test set showed for two new artemisinin compounds satisfactory results for anticancer activity and pharmacokinetic and toxicological properties. Consequently, further studies need be done to evaluate the different proposals as well as their actions, toxicity, and potential use for treatment of cancers.