2018
DOI: 10.1002/cem.3004
|View full text |Cite
|
Sign up to set email alerts
|

Data augmentation in food science: Synthesising spectroscopic data of vegetable oils for performance enhancement

Abstract: Generating more accurate, efficient, and robust classification models in chemometrics, able to address real‐world problems in food analysis, is intrinsically related with the amount of available calibration samples. In this paper, we propose a data augmentation solution to increase the performance of a classification model by generating realistic data augmented samples. The feasibility of this solution has been evaluated on 3 main different experiments where Fourier transform mid infrared (FT‐IR) spectroscopic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
14
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 53 publications
1
14
0
Order By: Relevance
“…Recent work demonstrated the utility augmenting simulated spectra to represent possible variations in real spectra. 72 Such an approach is ongoing in the laboratory. An alternative approach also undergoing studies in the laboratory is classification updating (also referred to as transfer learning).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Recent work demonstrated the utility augmenting simulated spectra to represent possible variations in real spectra. 72 Such an approach is ongoing in the laboratory. An alternative approach also undergoing studies in the laboratory is classification updating (also referred to as transfer learning).…”
Section: Resultsmentioning
confidence: 99%
“…A supplementary approach to improve microplastic identification by classification is to increase the number of reference spectra using additional simulated samples expressed as various linear combination of current spectra, 72 simulated weathered samples under additional simulated conditions, and environmental samples in order to form base reference classes spanning a larger variety spectral distortions. Classification updating is another strategy that has proven to be successful in other classification situations.…”
Section: Resultsmentioning
confidence: 99%
“…[28] Simulated spectra based on the measured datasets were used to confirm the results of PLSR models based on original datasets. [17,18] The five independent G1-G5 datasets were also applied to build PLSR models and these independent models were tested with the original spectra. Mean plots of RMSE and RPD of PLSR models with confidence intervals (±0.95) are shown in Fig.…”
Section: Results Of Partial Least Squares Regressionsmentioning
confidence: 99%
“…These kinds of data augmentation can facilitate the optimization of modelbuilding techniques when the low availability and high costs of samples may limit experimental sample sizes. [17,18] Notwithstanding, it should be taken into account that the models thus constructed only represent the variability of the samples being tested. For later use, it is recommended to increase the number of samples and rebuild calibrations.…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, a differentiation of the carrageenan+gum+excipient samples between carrageenan+gum+KCl and carrageenan+gum+MD samples is also noticeable. Regarding the testing phase, before the same data pre-treatment, linear interpolation was applied to the test spectra in order to get the desirable number of variables (n=1050) since they were acquired with a spectrometer with different spectral resolution [21,22] to reduce overfitting to the NIR calibration set.…”
Section: Data Pre-treatmentmentioning
confidence: 99%