2018
DOI: 10.1111/ecog.03188
|View full text |Cite
|
Sign up to set email alerts
|

The effect of sample size on the accuracy of species distribution models: considering both presences and pseudo‐absences or background sites

Abstract: Most high-performing species distribution modelling techniques require both presences, and either absences or pseudo-absences or background points. In this paper, we explore the effect of sample size, towards developing improved strategies for modelling. We generated 1800 virtual species with three levels of prevalence using ten modelling techniques, while varying the number of training presences (NTP) and the number of random points (NRP representing pseudo-absences or background sites). For five of the ten m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
92
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 118 publications
(96 citation statements)
references
References 70 publications
(144 reference statements)
3
92
0
1
Order By: Relevance
“…Liu et al (2019) argued that, as VS represent simulations, standard statistical analyses cannot be used to analyze the results. Liu et al (2019) argued that, as VS represent simulations, standard statistical analyses cannot be used to analyze the results.…”
Section: General Recommendations and Guidelinesmentioning
confidence: 99%
See 1 more Smart Citation
“…Liu et al (2019) argued that, as VS represent simulations, standard statistical analyses cannot be used to analyze the results. Liu et al (2019) argued that, as VS represent simulations, standard statistical analyses cannot be used to analyze the results.…”
Section: General Recommendations and Guidelinesmentioning
confidence: 99%
“…A fourth question that we see occasionally addressed is what kind of statistical analyses should be applied to VS studies. Liu et al (2019) argued that, as VS represent simulations, standard statistical analyses cannot be used to analyze the results. This was based on an argument advanced by White et al (2014) and dealing with simulated data at large.…”
Section: General Recommendations and Guidelinesmentioning
confidence: 99%
“…Thinned occurrence records were projected onto a 30 arc‐second 2 (~1 km 2 ) gridded map matching the predictor variable data for both Kūmarahou (105 grid cells) and Kuta (941 grid cells). To optimize SDMs, Kūmarahou and Kuta pseudo‐absences (210 and 1,882, respectively; Figures S2 and S3) were selected for a ratio of pseudo‐absence to presence of 2:1 (Barbet‐Massin et al., ; Liu, Newell, & White, ).…”
Section: Methodsmentioning
confidence: 99%
“…The recent potential distribution for each species was determined from the relationship between the gridded presence/pseudo‐absence data and environmental variables, including climate under recent (1961–1990) conditions (Anderson et al., ). We used seven standard SDMs available in the BIOMOD2 package (Thuiller, Georges, & Engler, ) to increase accuracy and ensure that our results were robust to differences in model type (Elith et al, ; Liu, Newell, & White, ): Artificial Neural Network (ANN), Flexible Discriminant Analysis (FDA), Generalized Boosting Models (GBM), Generalized Additive Model (GAM), Generalized Linear Model (GLM), Multiple Adaptive Regression Splines (MARS), and Random Forest (RF). For each species, SDMs were run in 10 replicates; each replicate was built using 70% of the presence and pseudo‐absence data (randomly sampled) and evaluated with remaining 30% of presence and pseudo‐absence data (Guisan et al., ).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation