2014
DOI: 10.1111/geb.12161
|View full text |Cite
|
Sign up to set email alerts
|

Spatial leave‐one‐out cross‐validation for variable selection in the presence of spatial autocorrelation

Abstract: Aim Processes and variables measured in ecology are almost always spatially autocorrelated, potentially leading to the choice of overly complex models when performing variable selection. One way to solve this problem is to account for residual spatial autocorrelation (RSA) for each subset of variables considered and then use a classical model selection criterion such as the Akaike information criterion (AIC). However, this method can be laborious and it raises other concerns such as which spatial model to use … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
88
1

Year Published

2015
2015
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 107 publications
(92 citation statements)
references
References 56 publications
3
88
1
Order By: Relevance
“…Early solutions were leave‐ n ‐out cross‐validation approaches (Stone , Picard and Cook ) that run iteratively, each time withholding a small randomly selected subset of the data for testing. Because these approaches have also been shown to produce biased error estimates (Shao , Kohavi , Telford et al , Amesbury et al ), further corrections have been proposed, for example by incorporating distance‐based buffers around hold‐out points (Bahn , Telford and Birks , Le Rest et al ).…”
Section: The Problem Of Structured Datamentioning
confidence: 99%
“…Early solutions were leave‐ n ‐out cross‐validation approaches (Stone , Picard and Cook ) that run iteratively, each time withholding a small randomly selected subset of the data for testing. Because these approaches have also been shown to produce biased error estimates (Shao , Kohavi , Telford et al , Amesbury et al ), further corrections have been proposed, for example by incorporating distance‐based buffers around hold‐out points (Bahn , Telford and Birks , Le Rest et al ).…”
Section: The Problem Of Structured Datamentioning
confidence: 99%
“…Spatial autocorrelation in residuals can be addressed by either including spatial structure in the model error term (e.g., Lichstein et al, 2002) or using additional predictor variable(s) that are ecologically meaningful and also match the spatial pattern of the residuals. The spatial leave-oneout procedure suggested by Le Rest et al (2014) , which is a special case of spatial cross-validation where each point left out is spatially independent from the others, can also be used to account for residual spatial autocorrelation. Higher spatial autocorrelation in species occurrence data can result in highly inflated test evaluation statistics and erroneous model projections (Boria et al, 2014).…”
Section: Accepted Manuscriptmentioning
confidence: 99%
“…When a diagnostic is evaluated using the same data that were used to fit the model, the results are well known to be biased (Bayarri and Berger 2000;Dahl 2006;Efron 1986;Hjort et al 2006). An alternative is to use cross-validation (Gelfand 1996;Le Rest et al 2014;Stone 1974;Zhang and Wang 2010), where the model is fitted to m < n observations and evaluated using the remaining n − m observations. While crossvalidation is considered a gold standard for diagnostics (Gelfand et al 1992;Marshall and Spiegelhalter 2003;Stern and Cressie 2000), it is computationally expensive and may be impractical for very large datasets.…”
Section: Discussionmentioning
confidence: 99%