Pangeo Forge: Crowdsourcing Analysis-Ready, Cloud Optimized Data Production

Stern, Charles; Abernathey, Ryan; Hamman, Joseph; Wegener, Rachel; Lepore, Chiara; Harkins, Sean; Merose, Alexander

doi:10.3389/fclim.2021.782909

Citation Details

Pangeo Forge: Crowdsourcing Analysis-Ready, Cloud Optimized Data Production

Pangeo Forge is a new community-driven platform that accelerates science by providing high-level recipe frameworks alongside cloud compute infrastructure for extracting data from provider archives, transforming it into analysis-ready, cloud-optimized (ARCO) data stores, and providing a human- and machine-readable catalog for browsing and loading. In abstracting the scientific domain logic of data recipes from cloud infrastructure concerns, Pangeo Forge aims to open a door for a broader community of scientists to participate in ARCO data production. A wholly open-source platform composed of multiple modular components, Pangeo Forge presents a foundation for the practice of reproducible, cloud-native, big-data ocean, weather, and climate science without relying on proprietary or cloud-vendor-specific tooling. more »

Award ID(s):: 1928406 2026932

PAR ID:: 10352851

Author(s) / Creator(s):: Stern, Charles; Abernathey, Ryan; Hamman, Joseph; Wegener, Rachel; Lepore, Chiara; Harkins, Sean; Merose, Alexander

Date Published:: 2022-02-10

Journal Name:: Frontiers in Climate

Volume:: 3

ISSN:: 2624-9553

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.3389/fclim.2021.782909

More Like this