Geographic And Taxonomic Occurrence R‐based Scrubbing (gatoRs): An R package and workflow for processing biodiversity data

Patten, Natalie N; Gaynor, Michelle L; Soltis, Douglas E; Soltis, Pamela S

doi:10.1002/aps3.11575

Citation Details

Geographic And Taxonomic Occurrence R‐based Scrubbing (gatoRs): An R package and workflow for processing biodiversity data

Abstract PremiseDigitized biodiversity data offer extensive information; however, obtaining and processing biodiversity data can be daunting. Complexities arise during data cleaning, such as identifying and removing problematic records. To address these issues, we created the R package Geographic And Taxonomic Occurrence R‐based Scrubbing (gatoRs). Methods and ResultsThe gatoRs workflow includes functions that streamline downloading records from the Global Biodiversity Information Facility (GBIF) and Integrated Digitized Biocollections (iDigBio). We also created functions to clean downloaded specimen records. Unlike previous R packages, gatoRs accounts for differences in download structure between GBIF and iDigBio and allows for user control via interactive cleaning steps. ConclusionsOur pipeline enables the scientific community to process biodiversity data efficiently and is accessible to the R coding novice. We anticipate that gatoRs will be useful for both established and beginning users. Furthermore, we expect our package will facilitate the introduction of biodiversity‐related concepts into the classroom via the use of herbarium specimens. more »

Award ID(s):: 2027654

PAR ID:: 10518749

Author(s) / Creator(s):: Patten, Natalie N; Gaynor, Michelle L; Soltis, Douglas E; Soltis, Pamela S

Publisher / Repository:: NSF Public Access Repository (NSF-PAR)

Date Published:: 2024-03-01

Journal Name:: Applications in Plant Sciences

Volume:: 12

Issue:: 2

ISSN:: 2168-0450

Subject(s) / Keyword(s):: GBIF basis cleaning biodiversity data download herbaria iDigBio locality cleaning spatial correction taxonomic harmonization

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1002/aps3.11575

More Like this