Using geospatial data of wildlife presence to predict a species distribution across a geographic area is among the most common tools in management and conservation. The collection of high‐quality presence–absence (PA) data through structured surveys is, however, expensive, and managers usually have access to larger amounts of low‐quality presence‐only (PO) data collected by citizen scientists, opportunistic observations and culling returns for game species. Integrated species distribution models (ISDMs) have been developed to make the most of the data available by combining the higher‐quality, but usually scarcer and more spatially restricted, PA data with the lower‐quality, unstructured, but usually more extensive PO datasets. Joint‐likelihood ISDMs can be run in a Bayesian context using integrated nested laplace approximation methods that allow the addition of a spatially structured random effect to account for data spatial autocorrelation. Here, we apply this innovative approach to fit ISDMs to empirical data, using PA and PO data for the three prevalent deer species in Ireland: red, fallow and sika deer. We collated all deer data available for the past 15 years and fitted models predicting distribution and relative abundance at a 25 km2 resolution across the island. Model predictions were associated to spatial estimate of uncertainty, allowing us to assess the quality of the model and the effect that data scarcity has on the certainty of predictions. Furthermore, we checked the performance of the three species‐specific models using two datasets, independent deer hunting returns and deer densities based on faecal pellet counts. Our work clearly demonstrates the applicability of spatially explicit ISDMs to empirical data in a Bayesian context, providing a blueprint for managers to exploit unexplored and seemingly unusable data that can, when modelled with the proper tools, serve to inform management and conservation policies.
The use of georeferenced information on the presence of a species to predict its distribution across a geographic area is one of the most common tools in management and conservation. The collection of high-quality presence-absence data through structured surveys is, however, expensive, and managers usually have more abundant low-quality presence-only data collected by citizen scientists, opportunistic observations, and culling returns for game species. Integrated Species Distribution Models (ISDMs) have been developed to make the most of the data available by combining the higher-quality, but usually less abundant and more spatially restricted presence-absence data, with the lower quality, unstructured, but usually more extensive and abundant presence-only data. Joint-likelihood ISDMs can be run in a Bayesian context using INLA (Integrated Nested Laplace Approximation) methods that allow the addition of a spatially structured random effect to account for data spatial autocorrelation. These models, however, have only been applied to simulated data so far. Here, for the first time, we apply this approach to empirical data, using presence-absence and presence-only data for the three main deer species in Ireland: red, fallow and sika deer. We collated all deer data available for the past 15 years and fitted models predicting distribution and relative abundance at a 25 km2 resolution across the island. Models' predictions were associated to spatial estimate of uncertainty, allowing us to assess the quality of the model and the effect that data scarcity has on the certainty of predictions. Furthermore, we validated the three species-specific models using independent deer hunting returns. Our work clearly demonstrates the applicability of spatially-explicit ISDMs to empirical data in a Bayesian context, providing a blueprint for managers to exploit unused and seemingly unusable data that can, when modelled with the proper tools, serve to inform management and conservation policies.
Ecological data are being collected at a large scale from a multitude of different sources, each with their own sampling protocols and assumptions. As a result, the integration of disparate datasets is a rapidly growing area in quantitative ecology, and is subsequently becoming a major asset in understanding the shifts and trends in species' distributions. However, the tools and software available to construct statistical models to integrate these disparate datasets into a unified framework is lacking. This has made these methods inaccessible to general practitioners and has stagnated the growth of data integration in more applied settings. We therefore present PointedSDMs: an easy to use R package used to construct integrated species distribution models. It provides functions to easily format the data, fit the models in a computationally efficient way and presents the output in a format that is convenient for additional work. This paper illustrates the different uses and functions available in the package, which are designed to simplify the modelling of integrated models. A case study using the package is also presented: combining three datasets coming from different sampling protocols, all containing records of Setophaga caerulescens across Pennsylvania state.
Integration of data is needed to address many of the problems currently threatening biodiversity. There has been an exponential increase in quantity and type of biodiversity data in recent years, including presence-absence, counts, and presence-only citizen science data. Species Distribution Models (SDMs) are frequently used in ecology to predict current and future ranges of species, and are a common tool used when making conservation prioritization decisions. Current SDM practice typically underutilizes the large amount of publicly available biodiversity data and does not follow a set of standard best practices. Integrating data types with open-source tools and reproducible workflows saves time, increases collaboration opportunities, and increases the power of data inference in SDMs. Here, we address the discipline-wide call for open science and standards in SDMs by (1) proposing methods and (2) generating a reproducible workflow to integrate different available data types to increase the power of SDMs. We provide an R package, intSDM, which reduces the learning curve for the use of integrated SDMs and makes them an accessible tool for use by non-programmers. We provide code and guidance on how to accommodate users diverse needs and ecological questions with different data types available on the Global Biodiversity Information Facility (GBIF), the largest biodiversity data aggregator in the world. Finally, we provide a case study of the application of our reproducible workflow by creating SDMs for vascular plants in Norway, integrating presence-only and presence-absence species occurrence data, climate, and habitat data.
Integration of disparate datasets is a rapidly growing area in quantitative ecology, and is subsequently becoming a major asset in understanding the shifts and trends in species' distributions. However, the tools and software available to construct statistical models to integrate these disparate datasets into a unified framework is lacking, stagnating the growth of data integration in applications. This paper presents PointedSDMs: an easy to use R package used to construct integrated species distribution models using the integrated nested Laplace Approximation methodology. The package is presented in this paper through an illustrative example for a selection of species found across Pennsylvania state.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.