Spatial Clustering of Citizen Science Data Improves Downstream Species Distribution Models

Ahmed, Nahian; Roth, Mark; Hallman, Tyler A; Robinson, W Douglas; Hutchinson, Rebecca A

doi:10.1609/aaai.v39i27.34993

Citation Details

This content will become publicly available on April 11, 2026

Spatial Clustering of Citizen Science Data Improves Downstream Species Distribution Models

Citizen science biodiversity data present great opportunities for ecology and conservation across vast spatial and temporal scales. However, the opportunistic nature of these data lacks the sampling structure required by modeling methodologies that address a pervasive challenge in ecological data collection: imperfect detection, i.e., the likelihood of under-observing species on field surveys. Occupancy modeling is an example of an approach that accounts for imperfect detection by explicitly modeling the observation process separately from the biological process of habitat selection. This produces species distribution models that speak to the pattern of the species on a landscape after accounting for imperfect detection in the data, rather than the pattern of species observations corrupted by errors. To achieve this benefit, occupancy models require multiple surveys of a site across which the site's status (i.e., occupied or not) is assumed constant. Since citizen science data are not collected under the required repeated-visit protocol, observations may be grouped into sites post hoc. Existing approaches for constructing sites discard some observations and/or consider only geographic distance and not environmental similarity. In this study, we compare ten approaches for site construction in terms of their impact on downstream species distribution models for 31 bird species in Oregon, using observations recorded in the eBird database. We find that occupancy models built on sites constructed by spatial clustering algorithms perform better than existing alternatives. more »

Award ID(s):: 2046678

PAR ID:: 10585197

Author(s) / Creator(s):: Ahmed, Nahian; Roth, Mark; Hallman, Tyler A; Robinson, W Douglas; Hutchinson, Rebecca A

Publisher / Repository:: Proceedings of the 39th Annual AAAI Conference on Artificial Intelligence

Date Published:: 2025-04-11

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 39

Issue:: 27

ISSN:: 2159-5399

Page Range / eLocation ID:: 27775 to 27783

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on April 11, 2026
Journal Article:
https://doi.org/10.1609/aaai.v39i27.34993

More Like this