A new method for the elimination of uninformative variables in multivariate data sets is proposed. To achieve this, artificial (noise) variables are added and a closed form of the PLS or PCR model is obtained for the data set containing the experimental and the artificial variables. The experimental variables that do not have more importance than the artificial variables, as judged from a criterion based on the b coefficients, are eliminated. The performance of the method is evaluated on simulated data. Practical aspects are discussed on experimentally obtained near-IR data sets. It is concluded that the elimination of uninformative variables can improve predictive ability.
A comparison of multiple linear regression (MLR) with partial least-squares (PLS) regression is presented, for the multivariate modeling of hydroxyl number in a certain polymer of a heterogeneous near-IR spectroscopic data set The MLR model was performed by selecting the variables with a genetic algorithm. A good model could be obtained with both methods. It was shown that the MLR and PLS solutions are very similar. The two models
In process analytical applications it is not always possible to keep the measurement conditions constant. However, fluctuations in external variables such as temperature can have a strong influence on measurement results. For example, nonlinear temperature effects on near-infrared (NIR) spectra may lead to a strongly biased prediction result from multivariate calibration models such as PLS. A new method, called Continuous Piecewise Direct Standardization (CPDS) has been developed for the correction of such external influences. It represents a generalization of the discrete PDS calibration transfer method and is able to adjust for continuous nonlinear influences such as the temperature effects on spectra. It was applied to shortwave NIR spectra of ethanol/water/2-propanol mixtures measured at different temperatures in the range 30-70 degrees C. The method was able to remove, almost completely, the temperature effects on the spectra, and prediction of the mole fractions of the chemical components was close to the results obtained at constant temperature.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.