2020
DOI: 10.5194/soil-6-565-2020
|View full text |Cite
|
Sign up to set email alerts
|

The influence of training sample size on the accuracy of deep learning models for the prediction of soil properties with near-infrared spectroscopy data

Abstract: Abstract. The number of samples used in the calibration data set affects the quality of the generated predictive models using visible, near and shortwave infrared (VIS–NIR–SWIR) spectroscopy for soil attributes. Recently, the convolutional neural network (CNN) has been regarded as a highly accurate model for predicting soil properties on a large database. However, it has not yet been ascertained how large the sample size should be for CNN model to be effective. This paper investigates the effect of the trainin… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
42
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 115 publications
(45 citation statements)
references
References 39 publications
3
42
0
Order By: Relevance
“…Cui & Fearn, 2018). For convolutional neural networks, this break‐even point was described based on a empirical study of predicting soil properties with near‐infrared spectra with about 1500 samples (Ng et al, 2020). This suggests that for studies with a higher number of samples in the calibration set compared with ours, ANN and GBM‐based regression should be reconsidered.…”
Section: Discussionmentioning
confidence: 99%
“…Cui & Fearn, 2018). For convolutional neural networks, this break‐even point was described based on a empirical study of predicting soil properties with near‐infrared spectra with about 1500 samples (Ng et al, 2020). This suggests that for studies with a higher number of samples in the calibration set compared with ours, ANN and GBM‐based regression should be reconsidered.…”
Section: Discussionmentioning
confidence: 99%
“…A critical component that must be taken into consideration is the number of observation available for training the deep learning model. With a smaller data set, the conventional model performed better than the deep learning model (Ng et al, 2020). However, a simple technique of adding random noise to the observation to augment the data set would be sufficient to get around this complication and outperform conventional models.…”
Section: Discussionmentioning
confidence: 99%
“…Human-computer interaction (HCI) refers to the information exchange process between people and computers who use a certain dialogue language to complete certain tasks in a certain way. With the continuous development of science and technology, mobile networks are playing an increasingly important role in people's daily lives [16]. Various terminals such as smartphones and laptops are emerging in an endless stream, and various applications that follow have also begun to appear on the mobile network [17].…”
Section: Study Of Based On the Role Of Intelligentmentioning
confidence: 99%