skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Guidelines for estimating occupancy from autocorrelated camera trap detections
Abstract Site occupancy models (SOMs) are a common tool for studying the spatial ecology of wildlife. When observational data are collected using passive monitoring field methods, including camera traps or autonomous recorders, detections of animals may be temporally autocorrelated, leading to biased estimates and incorrectly quantified uncertainty. We presently lack clear guidance for understanding and mitigating the consequences of temporal autocorrelation when estimating occupancy models with camera trap data.We use simulations to explore when and how autocorrelation gives rise to biased or overconfident estimates of occupancy. We explore the impact of sampling design and biological conditions on model performance in the presence of autocorrelation, investigate the usefulness of several techniques for identifying and mitigating bias and compare performance of the SOM to a model that explicitly estimates autocorrelation. We also conduct a case study using detections of 22 North American mammals.We show that a join count goodness‐of‐fit test previously proposed for identifying clustered detections is effective for detecting autocorrelation across a range of conditions. We find that strong bias occurs in the estimated occupancy intercept when survey durations are short and detection rates are low. We provide a reference table for assessing the degree of bias to be expected under all conditions. We further find that discretizing data with larger windows decreases the magnitude of bias introduced by autocorrelation. In our case study, we find that detections of most species are autocorrelated and demonstrate how larger detection windows might mitigate the resulting bias.Our findings suggest that autocorrelation is likely widespread in camera trap data and that many previous studies of occupancy based on camera trap data may have systematically underestimated occupancy probabilities. Moving forward, we recommend that ecologists estimating occupancy from camera trap data use the join count goodness‐of‐fit test to determine whether autocorrelation is present in their data. If it is, SOMs should use large detection windows to mitigate bias and more accurately quantify uncertainty in occupancy model parameters. Ecologists should not use gaps between detection periods, which are ineffective at mitigating temporal structure in data and discard useful data.  more » « less
Award ID(s):
2206784 2211768 2206783
PAR ID:
10530429
Author(s) / Creator(s):
; ; ; ; ; ; ;
Corporate Creator(s):
Editor(s):
NA
Publisher / Repository:
British Ecological Society
Date Published:
Journal Name:
Methods in Ecology and Evolution
Volume:
15
Issue:
7
ISSN:
2041-210X
Page Range / eLocation ID:
1177 to 1191
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Resource selection functions (RSFs) are among the most commonly used statistical tools in both basic and applied animal ecology. They are typically parameterized using animal tracking data, and advances in animal tracking technology have led to increasing levels of autocorrelation between locations in such data sets. Because RSFs assume that data are independent and identically distributed, such autocorrelation can cause misleadingly narrow confidence intervals and biased parameter estimates.Data thinning, generalized estimating equations and step selection functions (SSFs) have been suggested as techniques for mitigating the statistical problems posed by autocorrelation, but these approaches have notable limitations that include statistical inefficiency, unclear or arbitrary targets for adequate levels of statistical independence, constraints in input data and (in the case of SSFs) scale‐dependent inference. To remedy these problems, we introduce a method for likelihood weighting of animal locations to mitigate the negative consequences of autocorrelation on RSFs.In this study, we demonstrate that this method weights each observed location in an animal's movement track according to its level of non‐independence, expanding confidence intervals and reducing bias that can arise when there are missing data in the movement track.Ecologists and conservation biologists can use this method to improve the quality of inferences derived from RSFs. We also provide a complete, annotated analytical workflow to help new users apply our method to their own animal tracking data using thectmm Rpackage. 
    more » « less
  2. Environmental conditions are dynamic, and plants respond to those dynamics on multiple time scales. Disequilibrium occurs when a response occurs more slowly than the driving environmental changes. We review evidence regarding disequilibrium in plant distributions, including their responses to paleoclimate changes, recent climate change and new species introductions. There is strong evidence that plant species distributions are often in some disequilibrium with their environmental conditions.This disequilibrium poses a challenge when projecting future species distributions using species distribution models (SDMs). Classically, SDMs assume that the set of species occurrences is an unbiased sample of the suitable environmental conditions. However, a species in disequilibrium with the environment may have higher‐than‐expected occurrence probabilities (e.g. due to extinction debts) or lower‐than‐expected occurrence probabilities (e.g. due to dispersal limitation) in different areas. If unaccounted for, this will lead to biased estimates of the environmental suitability.We review methods for avoiding such biases in SDMs, ranging from simple thinning of the occurrence dataset to complex dynamic and process‐based models. Such models require large data inputs, natural history knowledge and technical expertise, so implementing them can be challenging. Despite this, we advocate for their increased use, since process‐based models provide the best potential to account for biases in model training data and to then represent the dynamics of species occupancy as ranges shift.Synthesis. Occurrence records for a species are often in disequilibrium with climate. SDMs trained on such data will produce biased estimates of a species' niche unless this disequilibrium is addressed in the modelling. A range of tools, spanning a wide gradient of complexity and realism, can resolve this bias. 
    more » « less
  3. Abstract Changes in land use and land cover (LULC) due to agricultural expansion, commercial land management and other human‐driven modifications significantly influence the ecology of pathogens and vectors. This underscores the urgent need to understand how these respond to rapid and dynamic land use changes in these ecosystems and, critically, to identify strategies for mitigating their impacts.In tropical Central and South America, palm trees serve as primary habitats forRhodniuskissing bugs, vectors ofTrypanosoma cruzi, the etiologic agent of Chagas disease. This study investigates how LULC, weather and traits of the palmAttalea butyraceapredict the occurrence and infection ofRhodnius pallescens, integrating field data collection, molecular detection and spatial and hierarchical analyses across a rural landscape in Panama.Rhodnius pallescenswere collected from 46 palms in 11 communities with different landscape compositions including native forests, grasslands, successional forests and artificial structures. Robust occupancy modelling using land cover data at 10 m2resolution revealed that successional forest cover at 300 m spatial scale predicted greater occurrence ofR. pallescens, whereas native forest predicted lower occurrence. Quadratic models outperformed linear models, indicating occupancy peaks at intermediate land covers and palm tree traits.Real‐time PCR assays detectedTrypanosomainfections in 70% ofR. pallescensacross communities. Spatial autocorrelation analyses showed significant spatial clustering forT. cruzibut not forTrypanosoma rangeli. We used generalized additive mixed models to assess the influence of palm‐level and landscape‐scale attributes on parasite infection and identified significant nonlinear positive associations betweenT.cruziinfection and native forest and grassland, with high predictive accuracy (AUC = 0.90).Synthesis and applications. Findings here show that successional forest predicts greater kissing bug infestation risk in palm trees, whereas native forest predicts lower kissing bug occurrence but greater infection withT. cruzi. These insights can guide land use planning towards vegetation management practices that help minimizeT. cruzitransmission risks for rural communities. Importantly, vector surveillance should target forest‐grassland ecotones and consider forest successional stages near settlements, with intensified monitoring after disturbances; this approach is applicable to other vector‐borne pathogen systems shaped by land use change. 
    more » « less
  4. Abstract Individual body size distributions (ISD) within communities are remarkably consistent across habitats and spatiotemporal scales and can be represented by size spectra, which are described by a power law. The focus of size spectra analysis is to estimate the exponent () of the power law. A common application of size spectra studies is to detect anthropogenic pressures.Many methods have been proposed for estimating most of which involve binning the data, counting the abundance within bins, and then fitting an ordinary least squares regression in log–log space. However, recent work has shown that binning procedures return biased estimates of compared to procedures that directly estimate using maximum likelihood estimation (MLE). While it is clear that MLE produces less biased estimates of site‐specificλ's, it is less clear how this bias affects the ability to test for changes inλacross space and time, a common question in the ecological literature.Here, we used simulation to compare the ability of two normalised binning methods (equal logarithmic and log2bins) and MLE to (1) recapture known values of , and (2) recapture parameters in a linear regression measuring the change in across a hypothetical environmental gradient. We also compared the methods using two previously published body size datasets across a natural temperature gradient and an anthropogenic pollution gradient.Maximum likelihood methods always performed better than common binning methods, which demonstrated consistent bias depending on the simulated values of . This bias carried over to the regressions, which were more accurate when was estimated using MLE compared to the binning procedures. Additionally, the variance in estimates using MLE methods is markedly reduced when compared to binning methods.The error induced by binning methods can be of similar magnitudes as the variation previously published in experimental and observational studies, bringing into question the effect sizes of previously published results. However, while the methods produced different regression slope estimates, they were in qualitative agreement on the sign of those slopes (i.e. all negative or all positive). Our results provide further support for the direct estimation of and its relative variation across environmental gradients using MLE over the more common methods of binning. 
    more » « less
  5. Abstract Numerous modelling techniques exist to estimate abundance of plant and animal populations. The most accurate methods account for multiple complexities found in ecological data, such as observational biases, spatial autocorrelation, and species correlations. There is, however, a lack of user‐friendly and computationally efficient software to implement the various models, particularly for large data sets.We developed thespAbundance Rpackage for fitting spatially explicit Bayesian single‐species and multi‐species hierarchical distance sampling models, N‐mixture models, and generalized linear mixed models. The models within the package can account for spatial autocorrelation using Nearest Neighbour Gaussian Processes and accommodate species correlations in multi‐species models using a latent factor approach, which enables model fitting for data sets with large numbers of sites and/or species.We provide three vignettes and three case studies that highlightspAbundancefunctionality. We used spatially explicit multi‐species distance sampling models to estimate density of 16 bird species in Florida, USA, an N‐mixture model to estimate black‐throated blue warbler (Setophaga caerulescens) abundance in New Hampshire, USA, and a spatial linear mixed model to estimate forest above‐ground biomass across the continental USA.spAbundanceprovides a user‐friendly, formula‐based interface to fit a variety of univariate and multivariate spatially explicit abundance models. The package serves as a useful tool for ecologists and conservation practitioners to generate improved inference and predictions on the spatial drivers of abundance in populations and communities. 
    more » « less