In this study, near infrared spectroscopy (NIRS) technique was used for quantitative detection of quaternary blended oil. After a series of preprocessing, the prediction effects of the three models and their preprocessing combinations were compared. Taking soybean oil content prediction as an example, random forest (RF) model had better performance after second derivative (D2) optimization. In feature selection, a two‐step feature selection method was adopted to extract the feature wavelength. First, the elastic net (EN) was used for the initial screening of feature wavelengths, and most irrelevant features were eliminated. The number of feature wavelengths was reduced from 1048 to 134. After that, the competitive adaptive re‐weighted sampling (CARS) method was used to screen the remaining characteristic wavelengths more carefully, and 20 effective characteristic wavelengths were selected. Finally, a quantitative detection model was established based on 20 effective characteristic wavelengths selected by EN + CARS. Evaluated by the test set, The correlation coefficient of determination (R2), root‐mean‐square error of prediction (RMSEP) and Relative Percent Difference (RPD) values of 2D + EN + CARS + RF model were 0.97953, 1.34306 and 7.08875, respectively. The results showed that the two‐step feature selection method can effectively extract the feature wavelength, and the NIRS technology can realize the intelligent detection of blended oil components.
In this study, the near infrared spectroscopy (NIRS) technology was used to quantitatively detect talcum powder in 82 wheat flour samples. Based on anomaly detection, sample division, and spectral preprocessing, four models were established to predict the content of talcum powder in wheat flour. Among them, the performance of Bayesian ridge regression (BRR) combined with second derivative (2D) was proved to be the best. In addition, 46 effective features were selected using a multilevel feature method combining improved particle swarm optimization (PSO) and genetic algorithm (GA). At this time, the coefficient of determination (R2_PRE), root mean square error of prediction (RMSEP), and relative percent difference (RPD) values of the BRR model on the prediction set reached 0.9802, 0.8914, and 6.9263. The results showed that NIRS technology was feasible in detecting the content of talcum powder in wheat flour. At the same time, the effectiveness of multilevel method was better than that of single‐level method, and the performance of improved POS was better than that of PSO.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.