2020
DOI: 10.1007/s42452-020-03307-8
|View full text |Cite
|
Sign up to set email alerts
|

Three oversampling methods applied in a comparative landslide spatial research in Penang Island, Malaysia

Abstract: Two main problems in landslide spatial prediction research are the lack of landslide samples (minority) to train the models and the misunderstanding of assigning equal costs to different misclassifications. In order to handle the problems properly, the research is conducted based on two main objectives, which are to augment the landslide sample data in an efficient way and to assign proper unequal costs to the two types of error when training and evaluating models. Resampling techniques, including random overs… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(4 citation statements)
references
References 49 publications
(60 reference statements)
0
4
0
Order By: Relevance
“…Multiple covariance analysis is often used to evaluate the correlation between factors to ensure that there is no linear correlation between factors 37 . In LSM, if the VIF value of a factor is greater than 10 or the TOL value is less than 0.1, it means that the factor has serious multicollinearity problems, and the factor should be removed 38 .…”
Section: Methodsmentioning
confidence: 99%
“…Multiple covariance analysis is often used to evaluate the correlation between factors to ensure that there is no linear correlation between factors 37 . In LSM, if the VIF value of a factor is greater than 10 or the TOL value is less than 0.1, it means that the factor has serious multicollinearity problems, and the factor should be removed 38 .…”
Section: Methodsmentioning
confidence: 99%
“…Oversampling addresses class imbalance between landslide occurrences/historical records (positive instances) and non-landslide areas (negative instances) in the dataset. The imbalance can lead to biased models that favor the majority class [54]. This allows logistic regression, a binary classification algorithm commonly used in landslide susceptibility assessment, to learn from a broader range of examples and improve its ability to predict landslide occurrences accurately.…”
Section: Studied Case Locationmentioning
confidence: 99%
“…Research by Gao [55] had use three oversampling methods to relate the spatial research with landslide in Penang Island. With the purpose to augment the landslide sample data in an effective way and to assign proper unequal cost, this research had come out with the results of landslide susceptibility mapping to conduct landslide spatial assessment.…”
Section: Landslidementioning
confidence: 99%