Near-infrared spectroscopy has been widely used to characterize the chemical composition of tobacco because it is fast, economical, and nondestructive. However, few predictive models perform ideally when applied to large spectral libraries of tobacco and its various chemical indicators. In this study, the just-in-time learning-integrated partial least-squares (JIT-PLS) modeling strategy was applied for the first time to quantitatively analyze 71 chemical components in Chinese tobacco. Approximately 18000 tobacco samples from China were analyzed to find appropriately similar measurements and propose suitable and flexible similar subsets from the calibration for each test sample. In total, 879 representative aged tobacco leaf samples and 816 cigarette samples were used as external instances to evaluate the practical predicting ability of the proposed method. The most suitable similar subsets for each test sample could be selected by limiting the Euclidean distance and number of similar subsets to 0−3.0 × 10 −9 and 10−300, respectively. The majority of the JIT-PLS models performed significantly better than traditional PLS models. Specifically, using JIT-PLS instead of traditional PLS models increased the R 2 values from 0.347−0.984 to 0.763−0.996, and from 0.179−0.981 to 0.506−0.989 for the prediction of 67 and 71 components in aged tobacco leaf and cigarette samples, respectively. Good prediction ability was demonstrated for routine chemical components, polyphenolic compounds, organic acids, and other compounds, with the mean ratios of prediction to deviation (RPD mean ) being 7. 74, 4.39, 4.05, and 5.48, respectively). The proposed methodology could simultaneously determine 67 major components in large and complicated tobacco spectral libraries with high precision and accuracy, which will assist tobacco and cigarette quality control in collecting as well as processing stages.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.