2020
DOI: 10.1016/j.chemolab.2020.104105
|View full text |Cite
|
Sign up to set email alerts
|

Investigating the need for preprocessing of near-infrared spectroscopic data as a function of sample size

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
23
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 52 publications
(31 citation statements)
references
References 17 publications
1
23
0
Order By: Relevance
“…Given that the sample size steps were taken larger, those results still confirm the theory based on the n/d given by Vapnik [20]. Complementary, in the work by Shoot [15], an absolute threshold of 100 samples was detected to be optimal for calibration models involving some food products. The optimal complexity of the models in the mentioned study was chosen as the location of the minimum RMSECV, which can correspond to RMSECV curves of long tails as it was the case of the milk data for the prediction of lactose content.…”
Section: About the Sample Sizesupporting
confidence: 74%
See 2 more Smart Citations
“…Given that the sample size steps were taken larger, those results still confirm the theory based on the n/d given by Vapnik [20]. Complementary, in the work by Shoot [15], an absolute threshold of 100 samples was detected to be optimal for calibration models involving some food products. The optimal complexity of the models in the mentioned study was chosen as the location of the minimum RMSECV, which can correspond to RMSECV curves of long tails as it was the case of the milk data for the prediction of lactose content.…”
Section: About the Sample Sizesupporting
confidence: 74%
“…This means that a different sample size n will be required for an easy-to-predict dominant chemical constituent than for a minor constituent that is harder to predict. The same idea was commented by Schoot about the sample size for calibration models of different products [15], although no further analysis was provided to explain it. This insight was revealed by the analysis of the ratio n/d and the optimal sample sizes obtained in each case study.…”
Section: About the Sample Sizementioning
confidence: 97%
See 1 more Smart Citation
“…Prior to further modeling, several methods were selected to improve the original spectra. The spectra preprocessing stage was an essential aspect of multivariate calibration, majorly aimed at eliminating irrelevant background signal or distortion in the original spectra, in a bid to enhance predictive accuracy or data interpretation, thus, maximizing the correlation to the desired quality parameters ( Schoot et al., 2020 ; Singpoonga et al., 2020 ) Meanwhile, PLSR was used to correlate the independent variables (absorbance spectra) with the dependent variables, the SSC and water content of intact zucchini, bitter gourd, ridge gourd, melon, chayote, cucumber (desired quality parameters). A single model was adopted for to evaluate the SSC of all samples, while another calibration model was employed to estimate the water content.…”
Section: Resultsmentioning
confidence: 99%
“…[23,26]), and smoothing (Savitzky-Golay (SG) smoothing, etc. [27,28]). The above pre-processing methods effectively remove the effects of the instrument background or drift on the signal, eliminate the effects of scattering due to the inhomogeneous distribution of particles and different particle sizes on the spectrum, remove or reduce noise, and improve the signal-tonoise ratio.…”
Section: Introductionmentioning
confidence: 99%