Gully erosion triggers land degradation and restricts the use of land. This study assesses the spatial relationship between gully erosion (GE) and geo-environmental variables (GEVs) using Weights-of-Evidence (WoE) Bayes theory, and then applies three data mining methods—Random Forest (RF), boosted regression tree (BRT), and multivariate adaptive regression spline (MARS)—for gully erosion susceptibility mapping (GESM) in the Shahroud watershed, Iran. Gully locations were identified by extensive field surveys, and a total of 172 GE locations were mapped. Twelve gully-related GEVs: Elevation, slope degree, slope aspect, plan curvature, convergence index, topographic wetness index (TWI), lithology, land use/land cover (LU/LC), distance from rivers, distance from roads, drainage density, and NDVI were selected to model GE. The results of variables importance by RF and BRT models indicated that distance from road, elevation, and lithology had the highest effect on GE occurrence. The area under the curve (AUC) and seed cell area index (SCAI) methods were used to validate the three GE maps. The results showed that AUC for the three models varies from 0.911 to 0.927, whereas the RF model had a prediction accuracy of 0.927 as per SCAI values, when compared to the other models. The findings will be of help for planning and developing the studied region.
Every year, gully erosion causes substantial damage to agricultural land, residential areas and infrastructure, such as roads. Gully erosion assessment and mapping can facilitate decision making in environmental management and soil conservation. Thus, this research aims to propose a new model by combining the geographically weighted regression (GWR) technique with the certainty factor (CF) and random forest (RF) models to produce gully erosion zonation mapping. The proposed model was implemented in the Mahabia watershed of Iran, which is highly sensitive to gully erosion. Firstly, dependent and independent variables, including a gully erosion inventory map (GEIM) and gullyrelated causal factors (GRCFs), were prepared using several data sources. Secondly, the GEIM was randomly divided into two groups: training (70%) and validation (30%) datasets. Thirdly, tolerance and variance inflation factor indicators were used for multicollinearity analysis. The results of the analysis corroborated that no collinearity exists amongst GRCFs. A total of 12 topographic, hydrologic, geologic, climatologic, environmental and soil-related GRCFs and 150 gully locations were used for modelling. The watershed was divided into eight homogeneous units because the importance level of the parameters in different parts of the watershed is not the same. For this purpose, coefficients of elevation, distance to stream and distance to road parameters were used. These coefficients were obtained by extracting bi-square kernel and AIC via the GWR method. Subsequently, the RF-CF integrated model was applied in each unit. Finally, with the units combined, the final gully erosion susceptibility map was obtained. On the basis of the RF model, distance to stream, distance to road and land use/land cover exhibited a high influence on gully formation. Validation results using area under curve indicated that new GWR-CF-RF approach has a higher predictive accuracy 0.967 (96.7%) than the individual models of CF 0.763 (76.3%) and RF 0.776 (77.6%) and the CF-RF integrated model 0.897 (89.7%). Thus, the results of this research can be used by local managers and planners for environmental management.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.