skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Fourth of July Creek Ensemble Simulation Outputs
These are the simulation outputs for an ensemble simulation of the expected response of the Fourth of July Basin in the White Clouds Mountains of central Idaho. The three files represent a base-case recharge scenario and two perturbations of +/-10%. All files inside the compressed tarball archives are ParFlow binary, a description of which is available at https://parflow.readthedocs.io/en/latest/index.html and https://github.com/parflow/parflow. Each realization in the ensemble simulation contains the steady-state pressure field, velocity field components, and the associated permeability field.  more » « less
Award ID(s):
2049687
PAR ID:
10648369
Author(s) / Creator(s):
Publisher / Repository:
Washington State University
Date Published:
Subject(s) / Keyword(s):
Watershed models Uncertainty Integrated hydrologic models
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Physical aquifer models are a highly effective teaching tool for hydrology education, however they come with inherent limitations that include the high cost to purchase, the static configuration of the model materials, the time required to visualize hydrogeological phenomena, and the effort to reset and clean them over time. To address these and other limitations, we have developed an interactive computer simulation of a physical aquifer model called the ParFlow Sandtank. In this gamified interface, users run the simulation using a familiar web-app like interface with sliders and buttons while learning real hydrologic concepts. Our user interface allows participants to dive into the world of hydrology, understanding assumptions about model parameters such as hydraulic conductivity, making decisions about inputs to groundwater aquifer systems such as pumping rates, visualizing outputs such as stream flow, transport, and saturation, and exploring various factors that impact real environmental systems such as climate change. The ParFlow Sandtank has already been used in a variety of educational settings with more than 9,000 users per year, and we feel this emerging educational tool can be used broadly in educational environments and can be scaled-up to provide greater accessibility for students and educators. Here we present the capabilities and workflow of the ParFlow Sandtank, two use cases, and additional tools and custom templates that have been developed to support and enhance the reach of the ParFlow Sandtank. 
    more » « less
  2. This is the simulation data set for the manuscript: Arvelo DM, Comer J, Schmit J, Garcia R (2024) Interfacial water is separated from a hydrophobic silica surface by a gap of 1.2 nm. ACS Nano 18:18683–18692. https://doi.org/10.1021/acsnano.4c05689 This data set includes all files needed to run and analyze the simulations described in the this manuscript in the molecular dynamics software NAMD, as well as the output of the simulations. LAMMPS input files for the ReaxFF simulations are also included. The files are organized into directories corresponding to the figures of the main text and supporting information. They include molecular model structure files (NAMD psf or LAMMPS data), force field parameter files (in CHARMM format or ReaxFF format), initial atomic coordinates (pdb format), NAMD or LAMMPS configuration files, Colvars configuration files, NAMD or LAMMPS log files, and output including restart files (in binary NAMD format) and trajectories in dcd format (downsampled with a stride of 100 to 20 ns per frame). Analysis is controlled by shell scripts (Bash-compatible) that call VMD Tcl scripts or python scripts. These scripts and their output are also included. Version: 1.0. Figure5AC: Simulation of pentadecane on a 5 chains/nm^2 OTS layer. Figure5B_FigureS7: Calculation of force profile for an SiO2 tip asperity model using adaptive biasing force. Systems: octane with 5 chains/nm^2 OTS, octane with 4 chains/nm^2 OTS, decane with 5 chains/nm^2 OTS, water with 5 chains/nm^2 OTS. FigureS6: Simulations showing the effect of octadecane on the structure of the OTS layer for 3 and 5 chains/nm^2 densities. FigureS8: Calculation of the adsorption free energy of tetracosane (C24) at the OTS–water interface using ABF. FigureS9: Python script for estimating the critical concentration to form an alkane layer at the OTS–water interface using the mean-field Ising model. FigureS10: ReaxFF simulation and modeling to create the silanol-terminated amorphous silica model of an AFM tip asperity. FigureS11: Molecular dynamics simulations showing spontaneous assembly of twelve or twenty-four tetracosane (C24) molecules at the interface between water and the alkyl groups of an OTS-conjugated silica surface. 
    more » « less
  3. This dataset contains the output files of numerical simulations of the model GEOCLIM-DynSoil-steady-state, that were conducted for the study "The rise of New Guinea and the fall of Neogene global temperatures". Those outputs are modeled fields of erosion rate, silicate weathering flux, regolith thickness and primary phases depletion, on the island of New Guinea, at 30 minutes resolution, and for 19 time slices from 15 Ma to present-day. This dataset also contains additional Python scripts for generating inputs of the model (namely, slope field) and for projecting the 2D New Guinea erosion field on the 1D transect highlighted in the above-mentioned study. files description The 19 files "gdss_output_NG_slope_X.X-X.X.nc" are output files from the model GEOCLIM-DynSoil-steady-state. "X.X-X.X" indicates the time slice targeted by the simulation (e.g., "14.4-13.2" is "14.4 Ma -- 13.2 Ma). Metadata in each file (netCDF format) describes the fields computed by the model and outputted in the files. Instruction to reproduce those outputs are given in the corresponding branch of the model's Github repository (https://doi.org/10.5281/zenodo.8245945) "Total_erosion.csv" is a 3-column table indicating, for each time interval targeted by the modeling study, the beginning and ending of the time interval, and the total eroded material during that interval, as computed by the model MOVE2D, on the New Guinea 1D transect (see "Transect_locations" files) Because the transect is 1D, the "total eroded material" during a time interval is a 2D value, and its units is km^2. "make_slope_inputs.py" is the Python script that generates the slope field for each time interval, accordingly to the erosion rate during that interval. It uses, as input, a modern (0.5-0Ma) slope field, that can be found in the model's Github repository https://doi.org/10.5281/zenodo.8245945. "erosion_projection.py": script to project the erosion field computed by GEOCLIM-DynSoil-steady-state on the 1D transect, and generate the 2 figures (already present in the current repertory): "erosion_transect_map.png": map the New Guinea erosion with the transect location. "transect_erosion.pdf": plot of the erosion rate along the transect. "geoclim.py": Python script storing the erosion function taken from the GEOCLIM model. This function may be used by "erosion_projection.py", as an alternative way to recompute erosion "from scratch". "Transect_location.*": shapefile indicating the location of 1D New Guinea transect (line). 
    more » « less
  4. Weinstein, Ben (Ed.)
    # Individual Tree Predictions for 100 million trees in the National Ecological Observatory Network Preprint: https://www.biorxiv.org/content/10.1101/2023.10.25.563626v1 ## Manuscript Abstract The ecology of forest ecosystems depends on the composition of trees. Capturing fine-grained information on individual trees at broad scales allows an unprecedented view of forest ecosystems, forest restoration and responses to disturbance. To create detailed maps of tree species, airborne remote sensing can cover areas containing millions of trees at high spatial resolution. Individual tree data at wide extents promises to increase the scale of forest analysis, biogeographic research, and ecosystem monitoring without losing details on individual species composition and abundance. Computer vision using deep neural networks can convert raw sensor data into predictions of individual tree species using ground truthed data collected by field researchers. Using over 40,000 individual tree stems as training data, we create landscape-level species predictions for over 100 million individual trees for 24 sites in the National Ecological Observatory Network. Using hierarchical multi-temporal models fine-tuned for each geographic area, we produce open-source data available as 1km^2 shapefiles with individual tree species prediction, as well as crown location, crown area and height of 81 canopy tree species. Site-specific models had an average performance of 79% accuracy covering an average of six species per site, ranging from 3 to 15 species. All predictions were uploaded to Google Earth Engine to benefit the ecology community and overlay with other remote sensing assets. These data can be used to study forest macro-ecology, functional ecology, and responses to anthropogenic change. ## Data Summary Each NEON site is a single zip archive with tree predictions for all available data. For site abbreviations see: https://www.neonscience.org/field-sites/explore-field-sites. For each site, there is a .zip and .csv. The .zip is a set 1km .shp tiles. The .csv is all trees in a single file. ## Prediction metadata *Geometry* A four pointed bounding box location in utm coordinates. *indiv_id* A unique crown identifier that combines the year, site and geoindex of the NEON airborne tile (e.g. 732000_4707000) is the utm coordinate of the top left of the tile.  *sci_name* The full latin name of predicted species aligned with NEON's taxonomic nomenclature.  *ens_score* The confidence score of the species prediction. This score is the output of the multi-temporal model for the ensemble hierarchical model.  *bleaf_taxa* Highest predicted category for the broadleaf submodel *bleaf_score* The confidence score for the broadleaf taxa submodel  *oak_taxa* Highest predicted category for the oak model  *dead_label* A two class alive/dead classification based on the RGB data. 0=Alive/1=Dead. *dead_score* The confidence score of the Alive/Dead prediction.  *site_id* The four letter code for the NEON site. See https://www.neonscience.org/field-sites/explore-field-sites for site locations. *conif_taxa* Highest predicted category for the conifer model *conif_score* The confidence score for the conifer taxa submodel *dom_taxa* Highest predicted category for the dominant taxa mode submodel *dom_score* The confidence score for the dominant taxa submodel ## Training data The crops.zip contains pre-cropped files. 369 band hyperspectral files are numpy arrays. RGB crops are .tif files. Naming format is __, for example. "NEON.PLA.D07.GRSM.00583_2022_RGB.tif" is RGB crop of the predicted crown of NEON data from Great Smoky Mountain National Park (GRSM), flown in 2022.Along with the crops are .csv files for various train-test split experiments for the manuscript. ### Crop metadata There are 30,042 individuals in the annotations.csv file. We keep all data, but we recommend a filtering step of atleast 20 records per species to reduce chance of taxonomic or data cleaning errors. This leaves 132 species. *score* This was the DeepForest crown score for the crop. *taxonID*For letter species code, see NEON plant taxonomy for scientific name: https://data.neonscience.org/taxonomic-lists *individual*unique individual identifier for a given field record and crown crop *siteID*The four letter code for the NEON site. See https://www.neonscience.org/field-sites/explore-field-sites for site locations. *plotID* NEON plot ID within the site. For more information on NEON sampling see: https://www.neonscience.org/data-samples/data-collection/observational-sampling/site-level-sampling-design *CHM_height* The LiDAR derived height for the field sampling point. *image_path* Relative pathname for the hyperspectral array, can be read by numpy.load -> format of 369 bands * Height * Weight *tile_year*  Flight year of the sensor data *RGB_image_path* Relative pathname for the RGB array, can be read by rasterio.open() # Code repository The predictions were made using the DeepTreeAttention repo: https://github.com/weecology/DeepTreeAttentionKey files include model definition for a [single year model](https://github.com/weecology/DeepTreeAttention/blob/main/src/models/Hang2020.py) and [Data preprocessing](https://github.com/weecology/DeepTreeAttention/blob/cae13f1e4271b5386e2379068f8239de3033ec40/src/utils.py#L59). 
    more » « less
  5. {"Abstract":["The compressed tarball archive contains the simulation files for the Fourth of July Creek basin in the White Clouds mountains of central Idah0, USA under wet, normal, and dry flow conditions using reconstructed, transient inputs. This archive builds on the "Fourth of July Creek Ensemble Simulation Outputs" (doi: 10.7273/000004796) by adding consideration of fully transient conditions and the associated uncertainty. Combined with that archive, all the files necessary to reproduce the runs are provided here. The runs are computationally demanding so this archive also contains the minimally processed datasets of the large outputs including pressure heads and/or streamflow at the monitoring locations and streamflow along the main stem of the creek. Contained in the "Output" folder is the dataset (saved as a Matlab data object) and scripts to plot the responses at monitoring locations or along the main stem."]} 
    more » « less