Species traits and observer behaviors that bias data assimilation and how to accommodate them

Scher, C. Lane  (ORCID:0000000336895769); Clark, James S.

doi:10.1002/eap.2815

Abstract

Datasets that monitor biodiversity capture information differently depending on their design, which influences observer behavior and can lead to biases across observations and species. Combining different datasets can improve our ability to identify and understand threats to biodiversity, but this requires an understanding of the observation bias in each. Two datasets widely used to monitor bird populations exemplify these general concerns: eBird is a citizen science project with high spatiotemporal resolution but variation in distribution, effort, and observers, whereas the Breeding Bird Survey (BBS) is a structured survey of specific locations over time. Analyses using these two datasets can identify contradictory population trends. To understand these discrepancies and facilitate data fusion, we quantify species‐level reporting differences across eBird and the BBS in three regions across the United States by jointly modeling bird abundances using data from both datasets. First, we fit a joint Species Distribution Model that accounts for environmental conditions and effort to identify reporting differences across the datasets. We then examine how these differences in reporting are related to species traits. Finally, we analyze species reported to one dataset but not the other and determine whether traits differ between reported and unreported species. We find that most species are reported more in the BBS than eBird. Specifically, we find that compared to eBird, BBS observers tend to report higher counts of common species and species that are usually detected by sound. We also find that species associated with water are reported less in the BBS. Species typically identified by sound are reported more at sunrise than later in the morning. Our results quantify reporting differences in eBird and the BBS to enhance our understanding of how each captures information and how they should be used. The reporting rates we identify can also be incorporated into observation models through detectability or effort to improve analyses across species and datasets. The method demonstrated here can be used to compare reporting rates across any two or more datasets to examine biases.

More Like this