2017
DOI: 10.1371/journal.pone.0175194
|View full text |Cite
|
Sign up to set email alerts
|

Model selection with multiple regression on distance matrices leads to incorrect inferences

Abstract: In landscape genetics, model selection procedures based on Information Theoretic and Bayesian principles have been used with multiple regression on distance matrices (MRM) to test the relationship between multiple vectors of pairwise genetic, geographic, and environmental distance. Using Monte Carlo simulations, we examined the ability of model selection criteria based on Akaike’s information criterion (AIC), its small-sample correction (AICc), and the Bayesian information criterion (BIC) to reliably rank cand… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 29 publications
(24 citation statements)
references
References 58 publications
0
24
0
Order By: Relevance
“…Linked‐based causal models (e.g., Cushman, McKelvey, Hayden, & Schwartz, ; Fourtune et al., ; Wang et al., ) are a promising alternative to the previously described models as they allow inferring causal relationships among the genetic response and landscape predictors beyond simple correlations, although they may also be sensitive to collinearity. Finally, most linked‐based statistical models may be subject to model selection procedures based on the comparison of model fit parameters (e.g., Keller, Holderegger, & van Strien, ) or Akaike information criterion (Burnham & Anderson, ; but see Prunier et al., ; Franckowiak et al., ).…”
Section: How To Infer the Environmental And Individual Effects On Nonmentioning
confidence: 99%
“…Linked‐based causal models (e.g., Cushman, McKelvey, Hayden, & Schwartz, ; Fourtune et al., ; Wang et al., ) are a promising alternative to the previously described models as they allow inferring causal relationships among the genetic response and landscape predictors beyond simple correlations, although they may also be sensitive to collinearity. Finally, most linked‐based statistical models may be subject to model selection procedures based on the comparison of model fit parameters (e.g., Keller, Holderegger, & van Strien, ) or Akaike information criterion (Burnham & Anderson, ; but see Prunier et al., ; Franckowiak et al., ).…”
Section: How To Infer the Environmental And Individual Effects On Nonmentioning
confidence: 99%
“…Canopy cover, frostfree period, and growing season precipitation were downloaded in the form of raster files. Compound topographic index, heat load index, roughness, and slope were calculated from the National Elevation Dataset digital elevation model (Gesch et al 2002;Gesch 2007) using the Geomorphometry and Gradient Metrics toolbox v2.0-0 for ArcGIS (Evans et al 2014). Compound topographic index is an estimate of soil wetness based on the expected movement of water downhill, heat load index reflects the amount of solar radiation reaching a surface, and roughness quantifies the change in elevation per unit area (Evans et al 2014).…”
Section: Landscape Genetic Analysismentioning
confidence: 99%
“…Compound topographic index, heat load index, roughness, and slope were calculated from the National Elevation Dataset digital elevation model (Gesch et al 2002;Gesch 2007) using the Geomorphometry and Gradient Metrics toolbox v2.0-0 for ArcGIS (Evans et al 2014). Compound topographic index is an estimate of soil wetness based on the expected movement of water downhill, heat load index reflects the amount of solar radiation reaching a surface, and roughness quantifies the change in elevation per unit area (Evans et al 2014). To test for nonlinear relationships between these continuous variables and genetic distance, after rescaling all variables to range from 0 to 1 we created two transformed resistance surfaces for each variable: T1 = 100 (originalcostsurface) , and T2 = 100 − 100 (1−originalcostsurface) .…”
Section: Landscape Genetic Analysismentioning
confidence: 99%
“…If the probability associated with X 2 and X 3 variables became non‐significant, we selected two‐factor models as the best models. Based on previous examination of possible biases introduced by classical model selection methods used in combination with MRM (Franckowiak et al., 2017), the best model was thus selected as the one explaining the largest proportion of genetic variation (i.e. with the best R 2 ).…”
Section: Methodsmentioning
confidence: 99%