skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Arctos: Community-driven innovations for managing biodiversity and cultural collections
Abstract Museum collections house millions of objects and associated data records that document biological and cultural diversity. In recent decades, digitization efforts have greatly increased accessibility to these data, thereby revolutionizing interdisciplinary studies in evolutionary biology, biogeography, epidemiology, cultural change, and human-mediated environmental impacts. Curators and collection managers can make museum data as accessible as possible to scientists and learners by using a collection management system. However, selecting a system can be a challenging task. Here, we describe Arctos, a community solution for managing and accessing collections data for research and education. Specific goals are to: (1) Describe the core elements of Arctos for a broad audience with respect to the biodiversity informatics principles that enable high quality research; (2) Highlight the unique aspects of Arctos; (3) Illustrate Arctos as a model for supporting and enhancing the Digital Extended Specimen; and (4) Emphasize the role of the Arctos community for improving data discovery and enabling cross-disciplinary, integrative studies within a sustainable governance model. In addition to detailing Arctos as both a community of museum professionals and a collection database platform, we discuss how Arctos achieves its richly annotated data by creating a web of knowledge with deep connections between catalog records and derived or associated data. We also highlight the value of Arctos as an educational resource. Finally, we present a financial model of fiscal sponsorship by a non-profit organization, implemented in 2022, to ensure the long-term success and sustainability of Arctos. We attribute Arctos’ longevity of nearly three decades to its core development principles of standardization, flexibility, interdisciplinarity, and connectivity within a nimble development model for addressing novel needs and information types in response to changing technology, workflows, ethical considerations, and regulations.  more » « less
Award ID(s):
2308707
PAR ID:
10538342
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
bioRxiv
Date Published:
Format(s):
Medium: X
Institution:
bioRxiv
Sponsoring Org:
National Science Foundation
More Like this
  1. Meloro, Carlo (Ed.)
    More than tools for managing physical and digital objects, museum collection management systems (CMS) serve as platforms for structuring, integrating, and making accessible the rich data embodied by natural history collections. Here we describe Arctos, a scalable community solution for managing and publishing global biological, geological, and cultural collections data for research and education. Specific goals are to: (1) Describe the core features and implementation of Arctos for a broad audience with respect to the biodiversity informatics principles that enable high quality research; (2) Highlight the unique aspects of Arctos; (3) Illustrate Arctos as a model for supporting and enhancing the Digital Extended Specimen concept; and (4) Emphasize the role of the Arctos community for improving data discovery and enabling cross-disciplinary, integrative studies within a sustainable governance model. In addition to detailing Arctos as both a community of museum professionals and a collection database platform, we discuss how Arctos achieves its richly annotated data by creating a web of knowledge with deep connections between catalog records and derived or associated data. We also highlight the value of Arctos as an educational resource. Finally, we present the financial model of fiscal sponsorship by a nonprofit organization, implemented in 2022, to ensure the long-term success and sustainability of Arctos. 
    more » « less
  2. Leal, JH; Bieler, R (Ed.)
    Among biocollections, mollusks are a particularly powerful resource for a wide range of studies, including biogeography, conservation, ecology, environmental monitoring, evolutionary biology, and systematics. U.S. mollusk collections are housed in stand-alone natural history museums, at universities, and in a variety of governmental and non-governmental institutions. Differing in their histories, specializations, and uses, they share common needs for long-term development, and collectively contribute to biodiversity knowledge at regional, national, and global scales. Commitment by dedicated staff, collectors, and volunteers, institutional investments, philanthropy, and governmental funding have built and maintained these collections and their support infrastructure. Efforts by the North American malacological collection community since the early 1970s led to coordination in database design but left the data isolated in individual institutions. Collection digitization developed through a combination of individual/institutional initiatives and federally supported projects funded by the National Science Foundation (NSF) and the Institute of Museum and Library Services (IMLS). Advances in digital technology enabled the shift toward nationally and globally unified collections. Networking and collaboration were greatly accelerated by NSF’s Advancing Digitization of Biodiversity Collections (ADBC) program, which created a central coordinating organization (iDigBio) and funded Thematic Collections Network (TCN) projects. One such TCN was developed to mobilize nearly 90% of the known U.S. museum-collections-based data of the U.S. Atlantic and Gulf coasts (Mobilizing Millions of Marine Mollusks of the Eastern Seaboard—ESB). The project, involving 16 museum collections (plus the Smithsonian Institution as federal partner), combines data from approximately 4.5 million specimens collected from the ESB region and makes them available to the TCN portal InvertEBase and other aggregators such as iDigBio and GBIF. In addition to fostering community and expanding the corpus of available digitized mollusk records through new data entry and georeferencing (GEOLocate, CoGe) and standardizing taxonomy, the project drove key innovations for the invertebrate collections community. For instance, it worked with the Biodiversity Information Standards (TDWG) group to create a new Darwin Core standard term, “Vitality”, expanded GEOLocate to support complex geospatial types, integrated global elevation and bathymetric datasets directly into georeferencing workflow, and developed various education and outreach public outreach products. Synthesizing from the 15 following articles with individual histories of ESB-participating mollusk collections, several topics are discussed—such as what defines a “good” mollusk collection in the digital age and the importance of federal support for this national resource. 
    more » « less
  3. Among biocollections, mollusks are a particularly powerful resource for a wide range of studies, including biogeography, conservation, ecology, environmental monitoring, evolutionary biology, and systematics. U.S. mollusk collections are housed in stand-alone natural history museums, at universities, and in a variety of governmental and non-governmental institutions. Differing in their histories, specializations, and uses, they share common needs for long-term development, and collectively contribute to biodiversity knowledge at regional, national, and global scales. Commitment by dedicated staff, collectors, and volunteers, institutional investments, philanthropy, and governmental funding have built and maintained these collections and their support infrastructure. Efforts by the North American malacological collection community since the early 1970s led to coordination in database design but left the data isolated in individual institutions. Collection digitization developed through a combination of individual/institutional initiatives and federally supported projects funded by the National Science Foundation (NSF) and the Institute of Museum and Library Services (IMLS). Advances in digital technology enabled the shift toward nationally and globally unified collections. Networking and collaboration were greatly accelerated by NSF’s Advancing Digitization of Biodiversity Collections (ADBC) program, which created a central coordinating organization (iDigBio) and funded Thematic Collections Network (TCN) projects. One such TCN was developed to mobilize nearly 90% of the known U.S. museum-collections-based data of the U.S. Atlantic and Gulf coasts (Mobilizing Millions of Marine Mollusks of the Eastern Seaboard—ESB). The project, involving 16 museum collections (plus the Smithsonian Institution as federal partner), combines data from approximately 4.5 million specimens collected from the ESB region and makes them available to the TCN portal InvertEBase and other aggregators such as iDigBio and GBIF. In addition to fostering community and expanding the corpus of available digitized mollusk records through new data entry and georeferencing (GEOLocate, CoGe) and standardizing taxonomy, the project drove key innovations for the invertebrate collections community. For instance, it worked with the Biodiversity Information Standards (TDWG) group to create a new Darwin Core standard term, “Vitality”, expanded GEOLocate to support complex geospatial types, integrated global elevation and bathymetric datasets directly into georeferencing workflow, and developed various education and outreach public outreach products. Synthesizing from the 15 following articles with individual histories of ESB-participating mollusk collections, several topics are discussed—such as what defines a “good” mollusk collection in the digital age and the importance of federal support for this national resource. 
    more » « less
  4. null (Ed.)
    A wealth of information about how parasites interact with their hosts already exists in collections, scientific publications, specialized databases, and grey literature. The US National Science Foundation-funded Terrestrial Parasite Tracker Thematic Collection Network (TPT) project began in 2019 to help build a comprehensive picture of arthropod ectoparasites including the evolution of these parasite-host biotic associations, distributions, and the ecological interactions of disease vectors. TPT is a network of biodiversity collections whose data can assist scientists, educators, land managers, and policymakers to better understand the complex relationship between hosts and parasites including emergent properties that may explain the causes and frequency of human and wildlife pathogens. TPT member collections make their association information easier to access via Global Biotic Interactions (GloBI, Poelen et al. 2014), which is periodically archived through Zenodo to track progress in the TPT project. TPT leverages GloBI's ability to index biotic associations from specimen occurrence records that come from existing management systems (e.g., Arctos, Symbiota, EMu, Excel, MS Access) to avoid having to completely rework existing, or build new, cyber-infrastructures before collections can share data. TPT-affiliated collection managers use collection-specific translation tables to connect their verbatim (or original) terms used to describe associations (e.g., "ex", "found on", "host") to their interpreted, machine-readable terms in the OBO Relations Ontology (RO). These interpreted terms enable searches across previously siloed association record sets, while the original verbatim values remain accessible to help retain provenance and allow for interpretation improvements. TPT is an ambitious project, with the goal to database label data from over 1.2 million specimens of arthropod parasites of vertebrates coming from 22 collections across North America. In the first year of the project, the TPT collections created over 73,700 new records and 41,984 images. In addition, 17 TPT data providers and three other collaborators shared datasets that are now indexed by GloBI, visible on the TPT GloBI project page. These datasets came from collection specimen occurrence records and literature sources. Two TPT data archives that capture and preserve the changes in the data coming from TPT to GloBI were published through Zenodo (Poelen et al. 2020a, Poelen et al. 2020b). The archives document the changes in how data are shared by collections including the biotic association data format and quantity of data captured. The Poelen et al. 2020b report included all TPT collections and biotic interactions from Arctos collections in VertNet and the Symbiota Collection of Arthropods Network (SCAN). The total number of interactions included in this report was 376,671 records (500,000 interactions is the overall goal for TPT). In addition, close coordination with TPT collection data managers including many one-on-one conversations, a workshop, and a webinar (Sullivan et al. 2020) was conducted to help guide the data capture of biotic associations. GloBI is an effective tool to help integrate biotic association data coming from occurrence records into an openly accessible, global, linked view of existing species interaction records. The results gleaned from the TPT workshop and Zenodo data archives demonstrate that minimizing changes to existing workflows allow for custom interpretation of collection-specific interaction terms. In addition, including collection data managers in the development of the interaction term vocabularies is an important part of the process that may improve data sharing and the overall downstream data quality. 
    more » « less
  5. A comprehensive overview of volunteer-driven public programs focused on activities to enhance natural history collections (NHCs) is provided. The initiative revolves around the WeDigBio events and the Collections Club at the Field Museum, aiming to deepen the public’s connection with scientific collections, enhance participatory science, and improve data associated with natural history specimens. The implementation and journey of these programs are outlined, including surveys conducted from 2015 through 2021 to gauge participant motivation, satisfaction, and the impact of these events on public engagement with NHCs. Results show trends in on-site and virtual volunteer participation over the years, especially during the peak period of the COVID-19 pandemic. The majority of participants expressed high satisfaction, indicating a willingness to continue participating in similar activities. The surveys revealed a shift towards more altruistic motivations for participation over time, with increased emphasis on supporting the Field Museum and contributing to the scientific community. The success of participatory science events demonstrates the potential of volunteer-driven programs to contribute meaningfully to the preservation, digitisation, and understanding of biodiversity collections, ultimately transforming spectators into stewards of natural history. From 2015 to present participants celebrate a significant milestone, with over a thousand community scientists contributing to the inventorying, collection care, curation, databasing, or transcription of 286,071 specimens, objects or records. We also discuss accuracy and quality control as well as a checklist and recommendations for similar activities. 
    more » « less