Evaluating Bayesian spatial methods for modelling species distributions with clumped and restricted occurrence data.

PLoS One

Centre for Biodiversity and Environment Research, Department of Genetics, Evolution and Environment, University College London, London, United Kingdom.

Published: December 2017


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Statistical approaches for inferring the spatial distribution of taxa (Species Distribution Models, SDMs) commonly rely on available occurrence data, which is often clumped and geographically restricted. Although available SDM methods address some of these factors, they could be more directly and accurately modelled using a spatially-explicit approach. Software to fit models with spatial autocorrelation parameters in SDMs are now widely available, but whether such approaches for inferring SDMs aid predictions compared to other methodologies is unknown. Here, within a simulated environment using 1000 generated species' ranges, we compared the performance of two commonly used non-spatial SDM methods (Maximum Entropy Modelling, MAXENT and boosted regression trees, BRT), to a spatial Bayesian SDM method (fitted using R-INLA), when the underlying data exhibit varying combinations of clumping and geographic restriction. Finally, we tested how any recommended methodological settings designed to account for spatially non-random patterns in the data impact inference. Spatial Bayesian SDM method was the most consistently accurate method, being in the top 2 most accurate methods in 7 out of 8 data sampling scenarios. Within high-coverage sample datasets, all methods performed fairly similarly. When sampling points were randomly spread, BRT had a 1-3% greater accuracy over the other methods and when samples were clumped, the spatial Bayesian SDM method had a 4%-8% better AUC score. Alternatively, when sampling points were restricted to a small section of the true range all methods were on average 10-12% less accurate, with greater variation among the methods. Model inference under the recommended settings to account for autocorrelation was not impacted by clumping or restriction of data, except for the complexity of the spatial regression term in the spatial Bayesian model. Methods, such as those made available by R-INLA, can be successfully used to account for spatial autocorrelation in an SDM context and, by taking account of random effects, produce outputs that can better elucidate the role of covariates in predicting species occurrence. Given that it is often unclear what the drivers are behind data clumping in an empirical occurrence dataset, or indeed how geographically restricted these data are, spatially-explicit Bayesian SDMs may be the better choice when modelling the spatial distribution of target species.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5708625PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0187602PLOS

Publication Analysis

Top Keywords

spatial bayesian
16
bayesian sdm
12
sdm method
12
spatial
10
methods
9
data
8
occurrence data
8
approaches inferring
8
spatial distribution
8
geographically restricted
8

Similar Publications

Background: Between November 2023 and March 2024, coastal Kenya experienced another wave of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections detected through our continued genomic surveillance. Herein, we report the clinical and genomic epidemiology of SARS-CoV-2 infections from 179 individuals (a total of 185 positive samples) residing in the Kilifi Health and Demographic Surveillance System (KHDSS) area (~ 900 km).

Methods: We analyzed genetic, clinical, and epidemiological data from SARS-CoV-2 positive cases across pediatric inpatient, health facility outpatient, and homestead community surveillance platforms.

View Article and Find Full Text PDF

Motivation: The advent of next-generation sequencing-based spatially resolved transcriptomics (SRT) techniques has reshaped genomic studies by enabling high-throughput gene expression profiling while preserving spatial and morphological context. Understanding gene functions and interactions in different spatial domains is crucial, as it can enhance our comprehension of biological mechanisms, such as cancer-immune interactions and cell differentiation in various regions. It is necessary to cluster tissue regions into distinct spatial domains and identify discriminating genes that elucidate the clustering result, referred to as spatial domain-specific discriminating genes (DGs).

View Article and Find Full Text PDF

Spatial disparities in perfluoroalkyl and polyfluoroalkyl substances exposure and immunosuppressive effects on vaccine induced antibody levels in Guangzhou children.

Environ Pollut

September 2025

Guangdong-Hong Kong-Macao Joint Laboratory for Contaminants Exposure and Health, Guangzhou Center for Disease Control and Prevention, Guangzhou, 510440, China; School of Public Health, Southern Medical University, Guangzhou, 510515, China. Electronic address:

Perfluoroalkyl and polyfluoroalkyl substances (PFAS) are persistent environmental pollutants that are widely detected in human serum worldwide, and are associated with reduced vaccine-induced antibody responses. However, existing research has primarily focused on the effects of prenatal and adolescent PFAS exposures on antibody levels or disease incidence. A critical gap remains in understanding the association between serum PFAS concentrations and antibody levels in children.

View Article and Find Full Text PDF

Background: Long COVID affects a substantial portion of the U.S. population, yet its spatiotemporal distribution remains poorly characterized.

View Article and Find Full Text PDF