2019
DOI: 10.1002/ece3.5654
|View full text |Cite
|
Sign up to set email alerts
|

The MIAmaxent R package: Variable transformation and model selection for species distribution models

Abstract: The widely used “Maxent” software for modeling species distributions from presence‐only data (Phillips et al., Ecological Modelling, 190, 2006, 231) tends to produce models with high‐predictive performance but low‐ecological interpretability, and implications of Maxent's statistical approach to variable transformation, model fitting, and model selection remain underappreciated. In particular, Maxent's approach to model selection through lasso regularization has been shown to give less parsimonious distribution… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
47
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 43 publications
(47 citation statements)
references
References 83 publications
0
47
0
Order By: Relevance
“…Figure S1: Frequency of Observed Presence (FOP) plots for the predictors of interest (WorldClim bio1-annual mean temperature; bio5-max temperature of warmest month; bio12-annual precipitation; bio10-mean temperature of warmest month; and bio11-mean temperature of coldest month (temperature in • C*10 and precipitation in mm); GEZ-global ecozone; ENVIREM continentality (in • C) for the East Asia region with 325 presences and 980 background points. These plots were created via the R package MIAmaxent [38], where the dots are the values of the predictors at the given locations, the red line a smoother regression line, and the background distribution approximate the data density; Table S1: Results of estimating the best MaxEnt model features and regularization (rm) for each geographic area of interest using the ENMeval R package [39]. Results are based on the random 5-fold method for data partitioning, where background points were randomly selected from the area of unsuitable habitat modeled from the BIOCLIM algorithm (a classic presence-only climate envelope model), and settings that primarily minimize AICc (i.e., ∆AIcc = 0) were selected for our best models (in bold); however, the AUC metrics and OR (threshold-based omission rates for test localities) metrics were calculated to select less complex models (when compared to Frequency of Observed Presence plots) and lowest number of parameters if those were giving similar or higher AUC and lowest OR because a low OR indicates less overfitting.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Figure S1: Frequency of Observed Presence (FOP) plots for the predictors of interest (WorldClim bio1-annual mean temperature; bio5-max temperature of warmest month; bio12-annual precipitation; bio10-mean temperature of warmest month; and bio11-mean temperature of coldest month (temperature in • C*10 and precipitation in mm); GEZ-global ecozone; ENVIREM continentality (in • C) for the East Asia region with 325 presences and 980 background points. These plots were created via the R package MIAmaxent [38], where the dots are the values of the predictors at the given locations, the red line a smoother regression line, and the background distribution approximate the data density; Table S1: Results of estimating the best MaxEnt model features and regularization (rm) for each geographic area of interest using the ENMeval R package [39]. Results are based on the random 5-fold method for data partitioning, where background points were randomly selected from the area of unsuitable habitat modeled from the BIOCLIM algorithm (a classic presence-only climate envelope model), and settings that primarily minimize AICc (i.e., ∆AIcc = 0) were selected for our best models (in bold); however, the AUC metrics and OR (threshold-based omission rates for test localities) metrics were calculated to select less complex models (when compared to Frequency of Observed Presence plots) and lowest number of parameters if those were giving similar or higher AUC and lowest OR because a low OR indicates less overfitting.…”
Section: Discussionmentioning
confidence: 99%
“…Results are based on the random 5-fold method for data partitioning, where background points were randomly selected from the area of unsuitable habitat modeled from the BIOCLIM algorithm (a classic presence-only climate envelope model), and settings that primarily minimize AICc (i.e., ∆AIcc = 0) were selected for our best models (in bold); however, the AUC metrics and OR (threshold-based omission rates for test localities) metrics were calculated to select less complex models (when compared to Frequency of Observed Presence plots) and lowest number of parameters if those were giving similar or higher AUC and lowest OR because a low OR indicates less overfitting. [L: linear, Q: Quadratics, H: hinge]; Table S2: Results of the nested MaxEnt-type models built during the forward DV selection using the MIAmaxent R package [38], where DV is the derived variables from the original ones using a specified transformation [Linear, Quadratics, Monotonous, Forward or Reverse Hinge, or Threshold for Continuous variable and binary for Categorical variable] that balance complexity of model with its fitness. Alpha = 0.005 was used to set the threshold for the amount of variation a DV must explain to be kept, i.e., P < alpha.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations