skip to main content


Title: Improving discoverability of tephra data through development of data upload templates and collection tools using community-driven best practices recommendations
Tephra is a unique volcanic product that plays an unparalleled role in understanding past eruptions, the long-term behavior of volcanoes, and the effects of volcanism on climate and the environment. Tephra deposits also provide spatially widespread, extremely high-resolution time-stratigraphic markers across a range of sedimentary settings and are used by many disciplines (e.g. volcanology, seismotectonics, climate science, archaeology, ecology, public health and ash impact assessment). In the last two decades, tephra studies have become more interdisciplinary in nature but are challenged by a lack of standardization that often prevents comparison amongst various regions and across disciplines. To address this challenge, the global tephra community has come together through a series of workshops to establish best practice recommendations for tephra studies from sample collection through analysis and data reporting. This new standardized framework will facilitate consistent tephra documentation and parametrization, foster interdisciplinary communication, and improve effectiveness of data sharing among diverse communities of researchers. One specific goal is to use the best practice guidelines to inform digital tool and data repository development. Here we report on 1) a new set of templates for tephra sample documentation, geochemical method documentation and data reporting using recommended best- practice data and metadata fields, 2) a new tephra module added to StraboSpot, an open source geologic mapping and data- recording multi-platform software application, and 3) new implementations and cross-mapping of metadata requirements at SESAR (System for Earth Sample Registration) and EarthChem. Addition of tephra-specific fields to StraboSpot enables users to consistently collect and report essential tephra data in the field which is then automatically saved to an online data repository. A new tephra portal on the EarthChem website will allow users to follow simple workflows to register tephra samples at SESAR and submit microanalytical data to EarthChem.  more » « less
Award ID(s):
1846400
NSF-PAR ID:
10447838
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Goldschmidt2021
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Tephra is a unique volcanic product with an unparalleled role in understanding past eruptions, the long-term behavior of volcanoes, and the effects of volcanism on climate and the environment. Tephra deposits also provide spatially widespread, extremely high-resolution time-stratigraphic markers across a range of sedimentary settings and are used by many disciplines (e.g. volcanology, seismotectonics, climate science, archaeology, ecology, public health, ash impact assessment). The interdisciplinary shift in tephra studies over the last two decades is challenged by the lack of standardization that often prevents comparison amongst various regions and across disciplines. To address this challenge, the global tephra community has united through a series of workshops to establish best practice recommendations for tephra studies, including sample collection, analysis and data reporting (https://doi.org/10.5281/zenodo.3866266). This new standardized framework is being incorporated into digital tools and data repositories and supports FAIR (findable, accessible, interoperable and reusable) data principles. Widespread adoption will facilitate consistent tephra documentation and parametrization, foster interdisciplinary communication and improve the effectiveness of data sharing among diverse communities of researchers. Here we report on recent implementations of the best-practice recommendations including: 1) a set of templates for samples, methods documentation, and data reporting, 2) a tephra module in the StraboSpot field app (https://strabospot.org), 3) implementations at SESAR and EarthChem, including a tephra community portal (https://earthchem.org/communities/tephra/), 4) implementation in the Sparrow laboratory data system (https://sparrow-data.org/), and 5) a new manuscript supporting the framework. Data linking is facilitated by extensive use of unique identifiers including ORCIDs for people, IGSNs for field sites and samples; DOIs for publications, data, and methods; and Smithsonian IDs for volcanoes and eruptions. These developments allow users to follow simple workflows to archive data and facilitate faster access to key research by secondary users. 
    more » « less
  2. An implementation of the Sparrow data system (https://sparrow-data.org) is currently being developed to support laboratory workflows for sample preparation, geochemical analysis, and SEM imaging in support of tephra research. Tephra, consisting of fragmental material ejected from volcanoes, has a multidisciplinary array of applications from volcanology to geochronology, archaeology, environmental change, and more. The international tephra research community has developed a comprehensive set of recommendations for data and metadata collection and reporting (https://doi.org/10.5281/zenodo.3866266) as part of a broader effort to adopt FAIR practices. Implementations of these recommendations now exist for field data via StraboSpot (https://strabospot.org/files/StraboSpotTephraHelp.pdf) and for samples, analytical methods, and geochemistry via SESAR and EarthChem (https://earthchem.org/communities/tephra/). Implementing these recommended practices in Sparrow helps to (1) cover laboratory workflows between field sample collection and project data archiving and (2) address a key researcher pain point. As re-emphasized by participants in the Tephra Fusion 2022 workshop earlier this year (Wallace et al., this meeting), the huge workload currently needed to capture and organize data and metadata in preparation for archiving in community data repositories is a major obstacle to achieving FAIR practices. By capturing this information on the fly during laboratory workflows and integrating it together in a single data system, this challenge may be overcome. We are implementing the tephra community recommendations as extensions to Sparrow’s core database schema. Data import pipelines and user interfaces to streamline metadata capture are also being developed. In the longer term, we aim to achieve interoperability with an ecosystem of tools and repositories like StraboSpot, SESAR, EarthChem, and Throughput. The results of these developments will be applicable not just to tephra but also to other research areas which utilize similar laboratory and analytical methods - e.g. sedimentology, mineralogy, and petrology. 
    more » « less
  3. A series of international workshops held in 2014, 2017, 2019, and 2022 focused on improving tephra studies from field collection through publication and encouraging FAIR (findable, accessible, interoperable, reusable) data practices for tephra data and metadata. Two consensus needs for tephra studies emerged from the 2014 and 2017 workshops: (a) standardization of tephra field data collection, geochemical analysis, correlation, and data reporting, and (b) development of next generation computer tools and databases to facilitate information access across multidisciplinary communities. To achieve (a), we developed a series of recommendations for best practices in tephra studies, from sample collection through analysis and data reporting (https://zenodo.org/record/3866266). A 4-part virtual workshop series (https://tephrochronology.org/cot/Tephra2022/) was held in February and March, 2022, to update the tephra community on these developments, to get community feedback, to learn of unmet needs, and to plan a future roadmap for open and FAIR tephra data. More than 230 people from 25 nations registered for the workshop series. The community strongly emphasized the need for better computer systems, including physical infrastructure (repositories and servers), digital infrastructure (software and tools) and human infrastructure (people, training, and professional assistance), to store, manage and serve global tephra datasets. Some desired attributes of improved computer systems include: 1) user friendliness 2) ability to easily ingest multiparameter tephra data (using best practice recommended data fields); 3) interoperability with existing data repositories; 4) development of tool add-ons (plotting and statistics); 5) improved searchability 6) development of a tephra portal with access to distributed data systems, and 7) commitments to long-term support from funding agencies, publishers and the cyberinfrastructure community. 
    more » « less
  4. Abstract Tephra is a unique volcanic product with an unparalleled role in understanding past eruptions, long-term behavior of volcanoes, and the effects of volcanism on climate and the environment. Tephra deposits also provide spatially widespread, high-resolution time-stratigraphic markers across a range of sedimentary settings and thus are used in numerous disciplines (e.g., volcanology, climate science, archaeology). Nonetheless, the study of tephra deposits is challenged by a lack of standardization that inhibits data integration across geographic regions and disciplines. We present comprehensive recommendations for tephra data gathering and reporting that were developed by the tephra science community to guide future investigators and to ensure that sufficient data are gathered for interoperability. Recommendations include standardized field and laboratory data collection, reporting and correlation guidance. These are organized as tabulated lists of key metadata with their definition and purpose. They are system independent and usable for template, tool, and database development. This standardized framework promotes consistent documentation and archiving, fosters interdisciplinary communication, and improves effectiveness of data sharing among diverse communities of researchers. 
    more » « less
  5. Two programs that provide high-quality long-term ecological data, the Environmental Data Initiative (EDI) and the National Ecological Observatory Network (NEON), have recently teamed up with data users interested in synthesizing biodiversity data, such as ecological synthesis working groups supported by the US Long Term Ecological Research (LTER) Network Office, to make their data more Findable, Interoperable, Accessible, and Reusable (FAIR). To this end: we have developed a flexible intermediate data design pattern for ecological community data (L1 formatted data in Fig. 1, see Fig. 2 for design details) called "ecocomDP" (O'Brien et al. 2021), and we provide tools to work with data packages in which this design pattern has been implemented. we have developed a flexible intermediate data design pattern for ecological community data (L1 formatted data in Fig. 1, see Fig. 2 for design details) called "ecocomDP" (O'Brien et al. 2021), and we provide tools to work with data packages in which this design pattern has been implemented. The ecocomDP format provides a data pattern commonly used for reporting community level data, such as repeated observations of species-level measures of biomass, abundance, percent cover, or density across multiple locations. The ecocomDP library for R includes tools to search for data packages, download or import data packages into an R (programming language) session in a standard format, and visualization tools for data exploration steps that are recommended for data users prior to any cross-study synthesis work. To date, EDI has created 70 ecocomDP data packages derived from their holdings, which include data from the US Long Term Ecological Research (US LTER) program, Long Term Research in Environmental Biology (LTREB) program, and other projects, which are now discoverable and accessible using the ecocomDP library. Similarly, NEON data products for 12 taxonomic groups are discoverable using the ecocomDP search tool. Input from data users provided guidance for the ecocomDP developers in mapping the NEON data products to the ecocomDP format to facilitate interoperability with the ecocomDP data packages available from the EDI repository. The standardized data design pattern allows common data visualizations across data packages, and has the potential to facilitate the development of new tools and workflows for biodiversity synthesis. The broader impacts of this collaboration are intended to lower the barriers for researchers in ecology and the environmental sciences to access and work with long-term biodiversity data and provide a hub around which data providers and data users can develop best practices that will build a diverse and inclusive community of practice. 
    more » « less