2022
DOI: 10.3390/land11112098
|View full text |Cite
|
Sign up to set email alerts
|

Simple Optimal Sampling Algorithm to Strengthen Digital Soil Mapping Using the Spatial Distribution of Machine Learning Predictive Uncertainty: A Case Study for Field Capacity Prediction

Abstract: Machine learning models are now capable of delivering coveted digital soil mapping (DSM) benefits (e.g., field capacity (FC) prediction); therefore, determining the optimal sample sites and sample size is essential to maximize the training efficacy. We solve this with a novel optimal sampling algorithm that allows the authentic augmentation of insufficient soil features using machine learning predictive uncertainty. Nine hundred and fifty-three forest soil samples and geographically referenced forest informati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 42 publications
0
4
0
Order By: Relevance
“…S0. When multiple "maps of points" are taken at randomly chosen locations of heterogeneous surfaces, with each "map of points" comprising a representative number of the feature under analysis (in our case, grains/grain boundaries), then the sampling method is considered random [60][61][62]. Moreover, as the SECCM maps were described by sparse matrices (either regular or irregular), the randomness of our sampling method was further increased (the variability of the measurements increases when not every point on the grid is considered).…”
Section: Seccm Experimentsmentioning
confidence: 99%
“…S0. When multiple "maps of points" are taken at randomly chosen locations of heterogeneous surfaces, with each "map of points" comprising a representative number of the feature under analysis (in our case, grains/grain boundaries), then the sampling method is considered random [60][61][62]. Moreover, as the SECCM maps were described by sparse matrices (either regular or irregular), the randomness of our sampling method was further increased (the variability of the measurements increases when not every point on the grid is considered).…”
Section: Seccm Experimentsmentioning
confidence: 99%
“…Analysis was conducted to determine the minimum number of flood events required for effective flood peak estimation. As the machine learning model is a data-driven model and is directly affected by the quality and quantity of data [28], we compared the relationship between data quantity and prediction capacity. The number of flood events contained in the data of the training dataset was increased gradually to observe any changes in prediction accuracy (Figure 4).…”
Section: Predictive Performance Changes With Data Accumulationmentioning
confidence: 99%
“…Deep learning approaches are data-driven models; therefore, the predictive accuracy is determined by the quality and quantity of the input data [28]. Thus, to develop a reasonable predictive model, an adequate amount of data must be collected, and the model should be trained based on these data.…”
Section: Introductionmentioning
confidence: 99%
“…Folorunso et al reviewed research predicting soil quality based on machine learning and analyzed the composition and quality of the soil, the prediction of soil parameters, existing soil datasets, soil maps, the influence of soil nutrients on crop growth, and the status of soil information system research 19 . Yang et al proposed a new optimal sampling algorithm capable of realistically enhancing insufficient soil properties using machine learning uncertainty prediction 20 . Sun et al developed a coupled retrieval approach to quantify nickel (Ni) concentration in agricultural soil using spaceborne hyperspectral imagery.…”
Section: Introductionmentioning
confidence: 99%