Toward reproducible and interoperable environmental modeling: Integration of HydroShare with server-side methods for exposing large-extent spatial datasets to models

Choi, Young-Don; Maghami, Iman; Goodall, Jonathan L; Band, Lawrence; Nassar, Ayman; Lin, Laurence; Saby, Linnea; Li, Zhiyu; Wang, Shaowen; Calloway, Chris; Yi, Hong; Seul, Martin; Ames, Daniel P; Tarboton, David G

doi:10.1016/j.envsoft.2024.106239

Citation Details

Toward reproducible and interoperable environmental modeling: Integration of HydroShare with server-side methods for exposing large-extent spatial datasets to models

Reproducible environmental modelling often relies on spatial datasets as inputs, typically manually subset for specific areas. Yet, models can benefit from a data distribution approach facilitated by online repositories, and automating processes to foster reproducibility. This study introduces a method leveraging diverse state-scale spatial datasets to create cohesive packages for GIS-based environmental modelling. These datasets were generated and shared via GeoServer and THREDDS Data Server Connected to HydroShare, contrasting with conventional distribution methods. Using the Regional Hydro-Ecologic Simulation System (RHESSys) across three U.S. catchment-scale watersheds, we demonstrate minimal errors in spatial inputs and model streamflow outputs compared to traditional approaches. This spatial data-sharing method facilitates consistent model creation, fostering reproducibility. Its broader impact allows scientists to tailor the method to various use cases, such as exploring different scales beyond state-scale or applying it to other online repositories using existing data distribution systems, eliminating the need to develop their own. more »