skip to main content

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Friday, December 13 until 2:00 AM ET on Saturday, December 14 due to maintenance. We apologize for the inconvenience.


Title: A multispecies hierarchical model to integrate count and distance‐sampling data
Abstract

Integrated community models—an emerging framework in which multiple data sources for multiple species are analyzed simultaneously—offer opportunities to expand inferences beyond the single‐species and single‐data‐source approaches common in ecology. We developed a novel integrated community model that combines distance sampling and single‐visit count data; within the model, information is shared among data sources (via a joint likelihood) and species (via a random‐effects structure) to estimate abundance patterns across a community. Parameters relating to abundance are shared between data sources, and the model can specify either shared or separate observation processes for each data source. Simulations demonstrated that the model provided unbiased estimates of abundance and detection parameters even when detection probabilities varied between the data types. The integrated community model also provided more accurate and more precise parameter estimates than alternative single‐species and single‐data‐source models in many instances. We applied the model to a community of 11 herbivore species in the Masai Mara National Reserve, Kenya, and found considerable interspecific variation in response to local wildlife management practices: Five species showed higher abundances in a region with passive conservation enforcement (median across species: 4.5× higher), three species showed higher abundances in a region with active conservation enforcement (median: 3.9× higher), and the remaining three species showed no abundance differences between the two regions. Furthermore, the community average of abundance was slightly higher in the region with active conservation enforcement but not definitively so (posterior mean: higher by 0.20 animals; 95% credible interval: 1.43 fewer animals, 1.86 more animals). Our integrated community modeling framework has the potential to expand the scope of inference over space, time, and levels of biological organization, but practitioners should carefully evaluate whether model assumptions are met in their systems and whether data integration is valuable for their applications.

 
more » « less
Award ID(s):
1954406 2208894 1853934 1556407
PAR ID:
10513057
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Ecology
Volume:
105
Issue:
7
ISSN:
0012-9658
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    1. The occurrence and distributions of wildlife populations and communities are shifting as a result of global changes. To evaluate whether these shifts are negatively impacting biodiversity processes, it is critical to monitor the status, trends and effects of environmental variables on entire communities. However, modelling the dynamics of multiple species simultaneously can require large amounts of diverse data, and few modelling approaches exist to simultaneously provide species and community‐level inferences.

    2. We present an ‘integrated community occupancy model’ (ICOM) that unites principles of data integration and hierarchical community modelling in a single framework to provide inferences on species‐specific and community occurrence dynamics using multiple data sources. The ICOM combines replicated and nonreplicated detection–nondetection data sources using a hierarchical framework that explicitly accounts for different detection and sampling processes across data sources. We use simulations to compare the ICOM to previously developed hierarchical community occupancy models and single species integrated distribution models. We then apply our model to assess the occurrence and biodiversity dynamics of foliage‐gleaning birds in the White Mountain National Forest in the northeastern USA from 2010 to 2018 using three independent data sources.

    3. Simulations reveal that integrating multiple data sources in the ICOM increased precision and accuracy of species and community‐level inferences compared to single data source models, although benefits of integration were dependent on the information content of individual data sources (e.g. amount of replication). Compared to single species models, the ICOM yielded more precise species‐level estimates. Within our case study, the ICOM had the highest out‐of‐sample predictive performance compared to single species models and models that used only a subset of the three data sources.

    4. The ICOM provides more precise estimates of occurrence dynamics compared to multi‐species models using single data sources or integrated single‐species models. We further found that the ICOM had improved predictive performance across a broad region of interest with an empirical case study of forest birds. The ICOM offers an attractive approach to estimate species and biodiversity dynamics, which is additionally valuable to inform management objectives of both individual species and their broader communities.

     
    more » « less
  2. Abstract

    Precise and accurate estimates of abundance and demographic rates are primary quantities of interest within wildlife conservation and management. Such quantities provide insight into population trends over time and the associated underlying ecological drivers of the systems. This information is fundamental in managing ecosystems, assessing species conservation status and developing and implementing effective conservation policy. Observational monitoring data are typically collected on wildlife populations using an array of different survey protocols, dependent on the primary questions of interest. For each of these survey designs, a range of advanced statistical techniques have been developed which are typically well understood. However, often multiple types of data may exist for the same population under study. Analyzing each data set separately implicitly discards the common information contained in the other data sets. An alternative approach that aims to optimize the shared information contained within multiple data sets is to use a “model-based data integration” approach, or more commonly referred to as an “integrated model.” This integrated modeling approach simultaneously analyzes all the available data within a single, and robust, statistical framework. This paper provides a statistical overview of ecological integrated models, with a focus on integrated population models (IPMs) which include abundance and demographic rates as quantities of interest. Four main challenges within this area are discussed, namely model specification, computational aspects, model assessment and forecasting. This should encourage researchers to explore further and develop new practical tools to ensure that full utility can be made of IPMs for future studies.

     
    more » « less
  3. Abstract

    Monitoring wildlife abundance across space and time is an essential task to study their population dynamics and inform effective management. Acoustic recording units are a promising technology for efficiently monitoring bird populations and communities. While current acoustic data models provide information on the presence/absence of individual species, new approaches are needed to monitor population abundance, ideally across large spatio‐temporal regions.

    We present an integrated modelling framework that combines high‐quality but temporally sparse bird point count survey data with acoustic recordings. Our models account for imperfect detection in both data types and false positive errors in the acoustic data. Using simulations, we compare the accuracy and precision of abundance estimates using differing amounts of acoustic vocalizations obtained from a clustering algorithm, point count data, and a subset of manually validated acoustic vocalizations. We also use our modelling framework in a case study to estimate abundance of the Eastern Wood‐Pewee (Contopus virens) in Vermont, USA.

    The simulation study reveals that combining acoustic and point count data via an integrated model improves accuracy and precision of abundance estimates compared with models informed by either acoustic or point count data alone. Improved estimates are obtained across a wide range of scenarios, with the largest gains occurring when detection probability for the point count data is low. Combining acoustic data with only a small number of point count surveys yields estimates of abundance without the need for validating any of the identified vocalizations from the acoustic data. Within our case study, the integrated models provided moderate support for a decline of the Eastern Wood‐Pewee in this region.

    Our integrated modelling approach combines dense acoustic data with few point count surveys to deliver reliable estimates of species abundance without the need for manual identification of acoustic vocalizations or a prohibitively expensive large number of repeated point count surveys. Our proposed approach offers an efficient monitoring alternative for large spatio‐temporal regions when point count data are difficult to obtain or when monitoring is focused on rare species with low detection probability.

     
    more » « less
  4. Abstract

    The characterization of species’ environmental niches and spatial distribution predictions based on them are now central to much of ecology and conservation, but implicitly requires decisions about the appropriate spatial scale (i.e.,grain) of analysis. Ecological theory and empirical evidence suggest that range‐resident species respond to their environment at two characteristic, hierarchical spatial grains: (1)response grain, the (relatively fine) grain at which an individual uses environmental resources, and (2)occupancy grain, the (relatively coarse) grain equivalent to a typical home range. We use a multi‐grain (MG) occupancy model, aided by fine‐grain remotely sensed imagery, to simultaneously estimate species–environment associations at both grains, conduct grain optimization to measure response grain, and apply this analysis framework to an example species: a medium‐sized bird (Tockus deckeni) in a heterogeneous East African landscape. Based on home range analysis of movement data, we calculate an occupancy grain of 1 km forT. deckeni. Using a grain optimization procedure across 32 grains from 10 to 500 m, we identify 60 m as the most strongly supported response grain for a suite of environmental variables, slightly coarser than opportunistic behavioral observations would have suggested. Validation confirms that the accuracy of the optimized MG occupancy model substantially exceeds that of equivalent single‐grain (SG) occupancy models. We further use a simulation approach to assess the potential impacts of accounting for the multi‐scale structure of species’ environmental requirements on estimates of population size. We find that the more strongly supported MG approach consistently predicts a minimum population size in the study landscape that is much lower than that provided by the SG model. This suggests that SG approaches commonly used in conservation applications could lead to overly optimistic abundance and population estimates, and that the MG approach may be more appropriate for supporting species conservation goals. More generally, we conclude that multi‐grain approaches of the sort presented, and increasingly enabled by growing high‐resolution remotely sensed data, hold great promise for offering a more mechanistic framework for assessing the appropriate grain(s) for population monitoring and management and enable more reliable estimates of abundances and species’ distributions.

     
    more » « less
  5. Abstract Aim

    Quantifying abundance distributions is critical for understanding both how communities assemble, and how community structure varies through time and space, yet estimating abundances requires considerable investment in fieldwork. Community‐level population genetic data potentially offer a powerful way to indirectly infer richness, abundance and the history of accumulation of biodiversity within a community. Here we introduce a joint model linking neutral community assembly and comparative phylogeography to generate both community‐level richness, abundance and genetic variation under a neutral model, capturing both equilibrium and non‐equilibrium dynamics.

    Location

    Global.

    Methods

    Our model combines a forward‐time individual‐based community assembly process with a rescaled backward‐time neutral coalescent model of multi‐taxa population genetics. We explore general dynamics of genetic and abundance‐based summary statistics and use approximate Bayesian computation (ABC) to estimate parameters underlying the model of island community assembly. Finally, we demonstrate two applications of the model using community‐scale mtDNAsequence data and densely sampled abundances of an arachnid community on La Réunion. First, we use genetic data alone to estimate a summary of the abundance distribution, ground‐truthing this against the observed abundances. Then, we jointly use the observed genetic data and abundances to estimate the proximity of the community to equilibrium.

    Results

    Simulation experiments of ourABCprocedure demonstrate that coupling abundance with genetic data leads to improved accuracy and precision of model parameter estimates compared with using abundance‐only data. We further demonstrate reasonable precision and accuracy in estimating a metric underlying the shape of the abundance distribution, temporal progress towards local equilibrium and several key parameters of the community assembly process. For the insular arachnid assemblage, we find the joint distribution of genetic diversity and abundance approaches equilibrium expectations, and that the Shannon entropy of the observed abundances can be estimated using genetic data alone.

    Main conclusions

    The framework that we present unifies neutral community assembly and comparative phylogeography to characterize the community‐level distribution of both abundance and genetic variation through time, providing a resource that should greatly enhance understanding of both the processes structuring ecological communities and the associated aggregate demographic histories.

     
    more » « less