2021
DOI: 10.3390/rs13081433
|View full text |Cite
|
Sign up to set email alerts
|

Assessing the Effect of Training Sampling Design on the Performance of Machine Learning Classifiers for Land Cover Mapping Using Multi-Temporal Remote Sensing Data and Google Earth Engine

Abstract: Machine learning classifiers are being increasingly used nowadays for Land Use and Land Cover (LULC) mapping from remote sensing images. However, arriving at the right choice of classifier requires understanding the main factors influencing their performance. The present study investigated firstly the effect of training sampling design on the classification results obtained by Random Forest (RF) classifier and, secondly, it compared its performance with other machine learning classifiers for LULC mapping using… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
59
0
10

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 88 publications
(72 citation statements)
references
References 53 publications
3
59
0
10
Order By: Relevance
“…In most methods, it is not possible to obtain accuracy on the training set, but only on the validation and test set. In many publications the accuracy of validation (OA) is reported, which in almost all cases is above 80% (e.g., SV M = 97.7% [7], SV M = 98.96% [20], SV Mmodi f ied = 98.07% [13], RF = 93% [22], RF = 86.98% [23], RF = 83.96% [39], Dynamic Time Warping algorithm, NDVI time series classification = 72-89%, multi-band classification = 76-88% [40]). In some cases, the accuracy for test data is also delivered: 84.2% [7], 88.94% [20], which means in the case of 13.5% [7] less value than the accuracy of the validation and in the case of 10.02% [20] lower.…”
Section: Discussionmentioning
confidence: 99%
“…In most methods, it is not possible to obtain accuracy on the training set, but only on the validation and test set. In many publications the accuracy of validation (OA) is reported, which in almost all cases is above 80% (e.g., SV M = 97.7% [7], SV M = 98.96% [20], SV Mmodi f ied = 98.07% [13], RF = 93% [22], RF = 86.98% [23], RF = 83.96% [39], Dynamic Time Warping algorithm, NDVI time series classification = 72-89%, multi-band classification = 76-88% [40]). In some cases, the accuracy for test data is also delivered: 84.2% [7], 88.94% [20], which means in the case of 13.5% [7] less value than the accuracy of the validation and in the case of 10.02% [20] lower.…”
Section: Discussionmentioning
confidence: 99%
“…Not as much research is done into determining how data sampling strategies affect ML classifiers. The authors in [101] compared different data sampling strategies and their effects on how different ML classifiers performed on LULC tasks. A multi-seasonal sample set was collected in [88] for global land cover mapping in 2015 from Landsat 8 images.…”
Section: Land Cover Classificationmentioning
confidence: 99%
“…The random forest is an ensemble method specially designed for a decision tree classifier, and the selection of random attributes is further added to its training process. Using similar parameters to those used for the decision tree, the random forest model is easy to implement and shows good effects [32,33]. In this research, parameters are determined by using cross-validation and grid search methods.…”
Section: Random Forestmentioning
confidence: 99%
“…(2) Machine learning methods: These methods feature pixel-based pattern recognition analysis, mainly including supervised and unsupervised classification techniques. The supervised methods mainly include neural network [21][22][23][24][25], support vector machine (SVM) [26][27][28], logistic regression [29,30], and random forest [31][32][33], and the unsupervised classification methods mainly include K-means clustering [34] and ISODATA clustering [35,36] methods. The machine learning algorithm has been widely used in remote sensing water extraction due to its high accuracy.…”
Section: Introductionmentioning
confidence: 99%