skip to main content


Title: Does adding community science observations to museum records improve distribution modeling of a rare endemic plant?
Abstract

Understanding the ranges of rare and endangered species is central to conserving biodiversity in the Anthropocene. Species distribution models (SDMs) have become a common and powerful tool for analyzing species–environment relationships across geographic space. Although evaluating the distribution of rare species is integral to their conservation, this can be difficult when limited distribution data are available. Community science platforms, such as iNaturalist, have emerged as alternative sources for species occurrence data. Although these observations are often thought to be of lower quality than those of natural history collections, they may have potential for improving SDMs for species with few occurrence records from collections. Here, we investigate the utility of iNaturalist data for developing SDMs for a rare high‐elevation plant,Telesonix jamesii. Because methods for modeling rare species are limited in the literature, five different modeling techniques were considered, including profile methods, statistical models, and machine learning algorithms. The inclusion of iNaturalist data doubled the number of usable records forT. jamesii.We found that a random forest (RF) model using ensemble training data performed the highest of any model (area under curve = 0.98). We then compared the performance of RF models that use only natural history training data and those that use a combination of natural history (herbarium specimens) and iNaturalist training data. All models heavily relied on climate data (mean temperature of driest quarter, and precipitation of the warmest quarter), indicating that this species is under threat as climate continues to change. Validation datasets affected model fits as well. Models using only herbarium data performed slightly poorer when evaluated with cross‐validation than when validated externally with iNaturalist data. This study can serve as a model for future SDM studies of species with similar data limitations.

 
more » « less
Award ID(s):
2102974
NSF-PAR ID:
10403824
Author(s) / Creator(s):
 ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Ecosphere
Volume:
14
Issue:
3
ISSN:
2150-8925
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Species distribution models (SDMs) that rely on regional‐scale environmental variables will play a key role in forecasting species occurrence in the face of climate change. However, in the Anthropocene, a number of local‐scale anthropogenic variables, including wildfire history, land‐use change, invasive species, and ecological restoration practices can override regional‐scale variables to drive patterns of species distribution. Incorporating these human‐induced factors into SDMs remains a major research challenge, in part because spatial variability in these factors occurs at fine scales, rendering prediction over regional extents problematic. Here, we used big sagebrush (Artemisia tridentataNutt.) as a model species to explore whether including human‐induced factors improves the fit of the SDM. We applied a Bayesian hurdle spatial approach using 21,753 data points of field‐sampled vegetation obtained from the LANDFIRE program to model sagebrush occurrence and cover by incorporating fire history metrics and restoration treatments from 1980 to 2015 throughout the Great Basin of North America. Models including fire attributes and restoration treatments performed better than those including only climate and topographic variables. Number of fires and fire occurrence had the strongest relative effects on big sagebrush occurrence and cover, respectively. The models predicted that the probability of big sagebrush occurrence decreases by 1.2% (95% CI: −6.9%, 0.6%) when one fire occurs and cover decreases by 44.7% (95% CI: −47.9%, −41.3%) if at least one fire occurred over the 36 year period of record. Restoration practices increased the probability of big sagebrush occurrence but had minimal effect on cover. Our results demonstrate the potential value of including disturbance and land management along with climate in models to predict species distributions. As an increasing number of datasets representing land‐use history become available, we anticipate that our modeling framework will have broad relevance across a range of biomes and species.

     
    more » « less
  2. Lozier, Jeffrey (Ed.)
    Abstract The advent of community-science databases in conjunction with museum specimen locality information has exponentially increased the power and accuracy of ecological niche modeling (ENM). Increased occurrence data has provided colossal potential to understand the distributions of lesser known or endangered species, including arthropods. Although niche modeling of termites has been conducted in the context of invasive and pest species, few studies have been performed to understand the distribution of basal termite genera. Using specimen records from the American Museum of Natural History (AMNH) as well as locality databases, we generated ecological niche models for 12 basal termite species belonging to six genera and three families. We extracted environmental data from the Worldclim 19 bioclimatic dataset v2, along with SoilGrids datasets and generated models using MaxEnt. We chose Optimal models based on partial Receiving Operating characteristic (pROC) and omission rate criterion and determined variable importance using permutation analysis. We also calculated response curves to understand changes in suitability with changes in environmental variables. Optimal models for our 12 termite species ranged in complexity, but no discernible pattern was noted among genera, families, or geographic range. Permutation analysis revealed that habitat suitability is affected predominantly by seasonal or monthly temperature and precipitation variation. Our findings not only highlight the efficacy of largely community-science and museum-based datasets, but our models provide a baseline for predictions of future abundance of lesser-known arthropod species in the face of habitat destruction and climate change. 
    more » « less
  3. Abstract

    Spatial biases are an intrinsic feature of occurrence data used in species distribution models (SDMs). Thinning species occurrences, where records close in the geographic or environmental space are removed from the modeling procedure, is an approach often used to address these biases. However, thinning occurrence data can also negatively affect SDM performance, given that the benefits of removing spatial biases might be outweighed by the detrimental effects of data loss caused by this approach. We used real and virtual species to evaluate how spatial and environmental thinning affected different performance metrics of four SDM methods. The occurrence data of virtual species were sampled randomly, evenly spaced, and clustered in the geographic space to simulate different types of spatial biases, and several spatial and environmental thinning distances were used to thin the occurrence data. Null datasets were also generated for each thinning distance where we randomly removed the same number of occurrences by a thinning distance and compared the results of the thinned and null datasets. We found that spatially or environmentally thinned occurrence data is no better than randomly removing them, given that thinned datasets performed similarly to null datasets. Specifically, spatial and environmental thinning led to a general decrease in model performances across all SDM methods. These results were observed for real and virtual species, were positively associated with thinning distance, and were consistent across the different types of spatial biases. Our results suggest that thinning occurrence data usually fails to improve SDM performance and that the use of thinning approaches when modeling species distributions should be considered carefully.

     
    more » « less
  4. The phenology of critical biological events in aquatic ecosystems are rapidly shifting due to climate change. Growing variability in phenological cues can increase the likelihood of trophic mismatches, causing recruitment failures in commercially, culturally, and recreationally important fisheries. We tested for changes in spawning phenology of regionally important walleye (Sander vitreus) populations in 194 Midwest US lakes in Minnesota, Michigan, and Wisconsin spanning 1939-2019 to investigate factors influencing walleye phenological responses to climate change and associated climate variability, including ice-off timing, lake physical characteristics, and population stocking history. Data from Wisconsin and Michigan lakes (185 and 5 out of 194 total lakes, respectively) were collected by the Wisconsin Department of Natural Resources (WDNR) and the Great Lakes Indian Fish and Wildlife Commission (GLIFWC) through standardized spring walleye mark-recapture surveys and spring tribal harvest season records. Standardized spring mark-recapture population estimates are performed shortly after ice-off, where following a marking event, a subsequent recapture sampling event is conducted using nighttime electrofishing (typically AC – WDNR, pulsed-DC – GLIFWC) of the entire shoreline including islands for small lakes and index stations for large lakes (Hansen et al. 2015) that is timed to coincide with peak walleye spawning activity (G. Hatzenbeler, WDNR, personal communication; M. Luehring, GLIFWC, personal communication; Beard et al. 1997). Data for four additional Minnesota lakes were collected by the Minnesota Department of Natural Resources (MNDNR) beginning in 1939 during annual collections of walleye eggs and broodstock (Schneider et al. 2010), where date of peak egg take was used to index peak spawning activity. For lakes where spawning location did not match the lake for which the ice-off data was collected, the spawning location either flowed into (Pike River) or was within 50 km of a lake where ice-off data were available (Pine River) and these ice-off data were used. Following the affirmation of off-reservation Ojibwe tribal fishing rights in the Ceded Territories of Wisconsin and the Upper Peninsula of Michigan in 1987, tribal spearfishers have targeted walleye during spring spawning (Mrnak et al. 2018). Nightly harvests are recorded as part of a compulsory creel survey (US Department of the Interior 1991). Using these records, we calculated the date of peak spawning activity in a given lake-year as the day of maximum tribal harvest. Although we were unable to account for varying effort in these data, a preliminary analysis comparing spawning dates estimated using tribal harvest to those determined from standardized agency surveys in the same lake and year showed that they were highly correlated (Pearson’s correlation: r = 0.91, P < 0.001). For lakes that had walleye spawning data from both agency surveys and tribal harvest, we used the data source with the greatest number of observation years. Ice-off phenology data was collected from two sources – either observed from the Global Lake and River Ice Phenology database (Benson et al. 2000)t, or modeled from a USGS region-wide machine-learning model which used North American Land Data Assimilation System (NLDAS) meteorological inputs combined with lake characteristics (lake position, clarity, size, depth, hypsography, etc.) to predict daily water column temperatures from 1979 - 2022, from which ice-off dates could be derived (https://www.sciencebase.gov/catalog/item/6206d3c2d34ec05caca53071; see Corson-Dosch et al. 2023 for details). Modeled data for our study lakes (see (Read et al. 2021) for modeling details), which performed well in reflecting ice phenology when compared to observed data (i.e., highly significant correlation between observed and modeled ice-off dates when both were available; r = 0.71, p < 0.001). Lake surface area (ha), latitude, and maximum depth (m) were acquired from agency databases and lake reports. Lake class was based on a WDNR lakes classification system (Rypel et al. 2019) that categorized lakes based on temperature, water clarity, depth, and fish community. Walleye stocking history was defined using the walleye stocking classification system developed by the Wisconsin Technical Working Group (see also Sass et al. 2021), which categorized lakes based on relative contributions of naturally-produced and stocked fish to adult recruitment by relying heavily on historic records of age-0 and age-1 catch rates and stocking histories. Wisconsin lakes were divided into three groups: natural recruitment (NR), a combination of stocking and natural recruitment (C-ST), and stocked only (ST). Walleye natural recruitment was indexed as age-0 walleye CPE (number of age-0 walleye captured per km of shoreline electrofished) from WDNR and GLIFWC fall electrofishing surveys (see Hansen et al. 2015 for details). We excluded lake-years where stocking of age-0 fish occurred before age-0 surveys to only include measurements of naturally-reproduced fish. 
    more » « less
  5. Elmer Ottis Wooton (1865–1945) was one of the most important early botanists to work in the Southwestern United States, contributing a great deal of natural history knowledge and botanical research on the flora of New Mexico that shaped many naturalists and scientists for generations. The extensive Wooton legacy includes herbarium collections that he and his famous student Paul Carpenter Standley (1884–1963), prolific botanist and explorer, used for the first Flora of New Mexi co by Wooten and Standley 1915 , along with resources covering botany and range management strategies for the northern Chihuahuan Desert, and an extensive, yet to be digitized, historical archive of correspondence, field notes, vegetation sketches, photographs, and lantern slides, all from his travels and field work in the region. Starting in 1890, the most complete set of Wooton’s herbarium collections were deposited in the NMC herbarium at New Mexico State University (NMSU), and his archives, now stored in a Campus library, have together been underutilized, offline resources. The goals of this ongoing project are to secure, preserve, and promote Wooton’s important historical resources, by fleshing out the botanical history of the region, raising appreciation of herbarium collections within the community, and emphasizing their unique role in facilitating contemporary research aimed at addressing pressing scientific questions such as vegetation responses to global climate change. Students and the general public involved in this project are engaged through hands-on activities including cataloging, databasing and digitization of nearly 10,000 herbarium specimens and Wooton’s archives. These outputs, combined with contemporary data collection and computational biology techniques from an ecological perspective, are being used to document vegetation changes in iconic, climate-sensitive, high-elevation mountainous ecosystems present in southwestern New Mexico. In a later phase of the project, a variety of public audiences will participate through interactive online story maps and citizen science programs such as iNaturalist , Notes from Nature , and BioBlitz . Images of herbarium specimens will be shared via an online database and other relevant biodiversity portals ( Symbiota , iDigBio , JStor ) Community members reached through this project will be better-informed citizens, who may go on to become new stewards of natural history collections, with the potential to influence policies safeguarding the future of our planet’s biodiversity. More locally, the project will support the management of Organ Mountains Desert Peaks National Monument, which was established in 2014 to protect the area's human and environmental resources, and for which knowledge and data are currently limited. 
    more » « less