skip to main content


Title: Monthly and Annual contour lines of the zero and the positive maximum of the Wind Stress Curl over Western North Atlantic during 1980-2019 and the Gulf Stream path during 1993-2019.

This dataset includes multiple fields: (i) files for monthly and annual fields for the max curl line and the zero curl line at 0.1 degree longitudinal resolutions; (ii) files for monthly and annual GS path obtained from Altimetry and originally processed by Andres (2016) at 0.1 degree longitudinal resolution. The maximum curl line (MCL) and the zero curl line (ZCL) calculations are briefly described here and are based on the original wind data (at 1.25 x 1.25 degree) provided by the Japanese reanalysis (JRA-55; Kobayashi et al., 2015) and available at https://zenodo.org/record/8200832 (Gifford et al. 2023). For details see Gifford, 2023. 

The wind stress curl (WSC) fields used for the MCL and ZCL calculations extend from 80W to 45W and 30N to 45N at the 1.25 by 1.25-degree resolution.  The MCL is defined as the maximum WSC values greater than zero within the domain per 1.25 degree longitude. As such, it is a function of longitude and is not a constant WSC value unlike the zero contour. High wind stress curl values that occurred near the coast were not included within this calculation. After MCL at the 1.25 resolution was obtained the line was smoothed with a gaussian smoothing and interpolated on to a 0.1 longitudinal resolution. The smoothed MCL lines at 0.1 degree resolution are provided in separate files for monthly and annual averages (2 files). Similarly, 2 other files (monthly and annual) are provided for the ZCL.    

Like the MCL, the ZCL is a line derived from 1.25 degree longitude throughout the domain under the condition that it's the line of zero WSC. The ZCL is constant at 0 and does not vary spatially like the MCL. If there are more than one location of zero curl for a given longitude the first location south of the MCL is selected. Similar to the MCL, the ZCL was smoothed with a gaussian smoothing and interpolated on to a 0.1 longitudinal resolution.   

The above files span the years from 1980 through 2019. So, the monthly files have 480 months starting January 1980, and the annual files have 40 years of data. The files are organized with each row being a new time step and each column being a different longitude. Therefore, the monthly MCL and ZCL files are each 480 x 351 for the 0.1 resolution data. Similarly, the annual files are 40 x 351 for the 0.1 degree resolution data.  

Note that the monthly MCLs and ZCLs are obtained from the monthly wind-stress curl fields. The annual MCLs and ZCLs are obtained from the annual wind-stress curl fields.

Since the monthly curl fields preserves more atmospheric mesoscales than the annual curl fields, the 12-month average of the monthly MCLs and ZCLs will not match with the annual MCLs and ZCLs derived from the annual curl field.  The annual MCLs and ZCLs provided here are obtained from the annual curl fields and representative metrics of the wind forcing on an annual time-scale. 

Furthermore, the monthly Gulf Stream axis path (25 cm isoheight from Altimeter, reprocessed by Andres (2016) technique) from 1993 through 2019 have been made available here. A total of 324 monthly paths of the Gulf Stream are tabulated. In addition, the annual GS paths for these 27 years (1993-2019) of altimetry era have been put together for ease of use. The monthly Gulf Stream paths have been resampled and reprocessed for uniqueness at every 0.1 degree longitude from 75W to 50W and smoothed with a 100 km (10 point) running average via matlab. The uniqueness has been achieved by using Consolidator algorithm (D’Errico, 2023). 

Each monthly or annual GS path has 251 points between 75W to 50W at 0.1 degree resolution.  

Please contact igifford@earth.miami.edu for any queries. {"references": ["Andres, M., 2016. On the recent destabilization of the Gulf Stream path downstream of Cape Hatteras. Geophysical Research Letters, 43(18), 9836-9842.", "D'Errico, J., 2023. Consolidator (https://www.mathworks.com/matlabcentral/fileexchange/ 8354-consolidator), MATLAB Central File Exchange. Retrieved June 17, 2023.", "Gifford, Ian. H., 2023. The Synchronicity of the Gulf Stream Free Jet and the Wind Induced Cyclonic Vorticity Pool. MS Thesis, University of Massachusetts Dartmouth. 75pp.", "Gifford, Ian, H., Avijit Gangopadhyay, Magdalena Andres, Glen Gawarkiewicz, Hilde Oliver, Adrienne Silver, 2023. Wind Stress, Wind Stress Curl, and Upwelling Velocities in the Northwest Atlantic (80-45W, 30-45N) during 1980-2019, https://zenodo.org/record/8200832.", "Kobayashi, S., Ota, Y., Harada, Y., Ebita, A., Moriya, M., Onoda, H., Onogi, K., Kamahori, H., Kobayashi, C., Endo, H. and Miyaoka, K., 2015. The JRA-55 reanalysis: General specifications and basic characteristics.\u202fJournal of the Meteorological Society of Japan. Ser. II,\u202f93(1), pp.5-48. Kobayashi, S., Ota, Y., Harada, Y., Ebita, A., Moriya, M., Onoda, H., Onogi, K., Kamahori, H., Kobayashi, C., Endo, H. and Miyaoka, K., 2015. The JRA-55 reanalysis: General specifications and basic characteristics.\u202fJournal of the Meteorological Society of Japan. Ser. II,\u202f93(1), pp.5-48."]} 
more » « less
Award ID(s):
2123283 1851242
PAR ID:
10448887
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Zenodo
Date Published:
Edition / Version:
1.0.0
Subject(s) / Keyword(s):
Ocean-Air Interaction Gulf Stream Force and Response Ocean Circulation Wind Stress Curl Altimetry Zero Wind Stress Curl Maximum Wind Stress Curl Gulf Stream Path
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. This dataset contains three netcdf files that pertain to monthly, seasonal, and annual fields of surface wind stress, wind stress curl, and curl-derived upwelling velocities over the Northwest Atlantic (80-45W, 30-45N) covering a forty year period from 1980 to 2019. Six-hourly surface (10 m) wind speed components from the Japanese 55-year reanalysis (JRA-55; Kobayashi et al., 2015) were processed from 1980 to 2019 over a larger North Atlantic domain of 100W to 10E and 10N to 80N. Wind stress was computed using a modified step-wise formulation, originally based on (Gill, 1982) and a non-linear drag coefficient (Large and Pond, 1981), and later modified for low speeds (Trenberth et al., 1989). See Gifford (2023) for more details.   

    After the six-hourly zonal and meridional wind stresses were calculated, the zonal change in meridional stress (curlx) and the negative meridional change in zonal stress (curly) were found using NumPy’s gradient function in Python (Harris et al., 2020) over the larger North Atlantic domain (100W-10E, 10-80N). The curl (curlx + curly) over the study domain (80-45W, 10-80N) is then extracted, which maintain a constant order of computational accuracy in the interior and along the boundaries for the smaller domain in a centered-difference gradient calculation. 

    The monthly averages of the 6-hour daily stresses and curls were then computed using the command line suite climate data operators (CDO, Schulzweida, 2022) monmean function. The seasonal (3-month average) and annual averages (12-month average) were calculated in Python using the monthly fields with NumPy (NumPy, Harris et al., 2020). 

    Corresponding upwelling velocities at different time-scales were obtained from the respective curl fields and zonal wind stress by using the Ekman pumping equation of the study by Risien and Chelton (2008; page 2393). Please see Gifford (2023) for more details.   

    The files each contain nine variables that include longitude, latitude, time, zonal wind stress, meridional wind stress, zonal change in meridional wind stress (curlx), the negative meridional change in zonal wind stress (curly), total curl, and upwelling. Units of time begin in 1980 and are months, seasons (JFM etc.), and years to 2019. The longitude variable extends from 80W to 45W and latitude is 30N to 45N with uniform 1.25 degree resolution.  

    Units of stress are in Pascals, units of curl are in Pascals per meter, and upwelling velocity is described by centimeters per day. The spatial grid is a 29 x 13 longitude x latitude array. 

    Filenames: 

    monthly_windstress_wsc_upwelling.nc: 480 time steps from 80W to 45W and 30N to 45N.

    seasonal_windstress_wsc_upwelling.nc: 160 time steps from 80W to 45W and 30N to 45N.

    annual_windstress_wsc_upwelling.nc: 40 time steps from 80W to 45W and 30N to 45N.

    Please contact igifford@earth.miami.edu for any queries. {"references": ["Gifford, I.H., 2023. The Synchronicity of the Gulf Stream Free Jet and the Wind Induced Cyclonic Vorticity Pool. MS Thesis, University of Massachusetts Dartmouth. 75pp.", "Gill, A. E. (1982). Atmosphere-ocean dynamics (Vol. 30). Academic Press.", "Harris, C.R., Millman, K.J., van der Walt, S.J. et al. Array programming with NumPy. Nature 585, 357\u2013362 (2020). DOI: 10.1038/s41586-020-2649-2.", "Japan Meteorological Agency/Japan (2013), JRA-55: Japanese 55-year Reanalysis, Daily 3-Hourly and 6-Hourly Data, https://doi.org/10.5065/D6HH6H41, Research Data Archive at the National Center for Atmospheric Research, Computational and Information Systems Laboratory, Boulder, Colo. (Updated monthly.)", "Kobayashi, S., Ota, Y., Harada, Y., Ebita, A., Moriya, M., Onoda, H., Onogi, K., Kamahori, H., Kobayashi, C., Endo, H. and Miyaoka, K., 2015. The JRA-55 reanalysis: General specifications and basic characteristics.\u202fJournal of the Meteorological Society of Japan. Ser. II,\u202f93(1), pp.5-48.", "Large, W.G. and Pond, S., 1981. Open ocean momentum flux measurements in moderate to strong winds.\u202fJournal of physical oceanography,\u202f11(3), pp.324-336.", "Risien, C.M. and Chelton, D.B., 2008. A global climatology of surface wind and wind stress fields from eight years of QuikSCAT scatterometer data.\u202fJournal of Physical Oceanography,\u202f38(11), pp.2379-2413.", "Schulzweida, Uwe. (2022). CDO User Guide (2.1.0). Zenodo. https://doi.org/10.5281/zenodo.7112925.", "Trenberth, K.E., Large, W.G. and Olson, J.G., 1989. The effective drag coefficient for evaluating wind stress over the oceans.\u202fJournal of Climate,\u202f2(12), pp.1507-1516."]} 
    more » « less
  2. This dataset contains monthly average output files from the iCAM6 simulations used in the manuscript "Enhancing understanding of the hydrological cycle via pairing of process-oriented and isotope ratio tracers," in review at the Journal of Advances in Modeling Earth Systems. A file corresponding to each of the tagged and isotopic variables used in this manuscript is included. Files are at 0.9° latitude x 1.25° longitude, and are in NetCDF format. Data from two simulations are included: 1) a simulation where the atmospheric model was "nudged" to ERA5 wind and surface pressure fields, by adding an additional tendency (see section 3.1 of associated manuscript), and 2) a simulation where the atmospheric state was allowed to freely evolve, using only boundary conditions imposed at the surface and top of atmosphere. Specific information about each of the variables provided is located in the "usage notes" section below. Associated article abstract: The hydrologic cycle couples the Earth's energy and carbon budgets through evaporation, moisture transport, and precipitation. Despite a wealth of observations and models, fundamental limitations remain in our capacity to deduce even the most basic properties of the hydrological cycle, including the spatial pattern of the residence time (RT) of water in the atmosphere and the mean distance traveled from evaporation sources to precipitation sinks. Meanwhile, geochemical tracers such as stable water isotope ratios provide a tool to probe hydrological processes, yet their interpretation remains equivocal despite several decades of use. As a result, there is a need for new mechanistic tools that link variations in water isotope ratios to underlying hydrological processes. Here we present a new suite of “process-oriented tags,” which we use to explicitly trace hydrological processes within the isotopically enabled Community Atmosphere Model, version 6 (iCAM6). Using these tags, we test the hypotheses that precipitation isotope ratios respond to parcel rainout, variations in atmospheric RT, and preserve information regarding meteorological conditions during evaporation. We present results for a historical simulation from 1980 to 2004, forced with winds from the ERA5 reanalysis. We find strong evidence that precipitation isotope ratios record information about atmospheric rainout and meteorological conditions during evaporation, but little evidence that precipitation isotope ratios vary with water vapor RT. These new tracer methods will enable more robust linkages between observations of isotope ratios in the modern hydrologic cycle or proxies of past terrestrial environments and the environmental processes underlying these observations.   Details about the simulation setup can be found in section 3 of the associated open-source manuscript, "Enhancing understanding of the hydrological cycle via pairing of process‐oriented and isotope ratio tracers." In brief, we conducted two simulations of the atmosphere from 1980-2004 using the isotope-enabled version of the Community Atmosphere Model 6 (iCAM6) at 0.9x1.25° horizontal resolution, and with 30 vertical hybrid layers spanning from the surface to ~3 hPa. In the first simulation, wind and surface pressure fields were "nudged" toward the ERA5 reanalysis dataset by adding a nudging tendency, preventing the model from diverging from observed/reanalysis wind fields. In the second simulation, no additional nudging tendency was included, and the model was allowed to evolve 'freely' with only boundary conditions provided at the top (e.g., incoming solar radiation) and bottom (e.g., observed sea surface temperatures) of the model. In addition to the isotopic variables, our simulation included a suite of 'process-oriented tracers,' which we describe in section 2 of the manuscript. These variables are meant to track a property of water associated with evaporation, condensation, or atmospheric transport. Metadata are provided about each of the files below; moreover, since the attached files are NetCDF data - this information is also provided with the data files. NetCDF metadata can be accessed using standard tools (e.g., ncdump). Each file has 4 variables: the tagged quantity, and the associated coordinate variables (time, latitude, longitude). The latter three are identical across all files, only the tagged quantity changes. Twelve files are provided for the nudged simulation, and an additional three are provided for the free simulations: Nudged simulation files iCAM6_nudged_1980-2004_mon_RHevap: Mass-weighted mean evaporation source property: RH (%) with respect to surface temperature. iCAM6_nudged_1980-2004_mon_Tevap: Mass-weighted mean evaporation source property: surface temperature in Kelvin iCAM6_nudged_1980-2004_mon_Tcond: Mass-weighted mean condensation property: temperature (K) iCAM6_nudged_1980-2004_mon_columnQ: Total (vertically integrated) precipitable water (kg/m2).  Not a tagged quantity, but necessary to calculate depletion times in section 4.3 (e.g., Fig. 11 and 12). iCAM6_nudged_1980-2004_mon_d18O: Precipitation d18O (‰ VSMOW) iCAM6_nudged_1980-2004_mon_d18Oevap_0: Mass-weighted mean evaporation source property - d18O of the evaporative flux (e.g., the 'initial' isotope ratio prior to condensation), (‰ VSMOW) iCAM6_nudged_1980-2004_mon_dxs: Precipitation deuterium excess (‰ VSMOW) - note that precipitation d2H can be calculated from this file and the precipitation d18O as d2H = d-excess - 8*d18O. iCAM6_nudged_1980-2004_mon_dexevap_0: Mass-weighted mean evaporation source property - deuterium excess of the evaporative flux iCAM6_nudged_1980-2004_mon_lnf: Integrated property - ln(f) calculated from the constant-fractionation d18O tracer (see section 3.2). iCAM6_nudged_1980-2004_mon_precip: Total precipitation rate in m/s. Note there is an error in the metadata in this file - it is total precipitation, not just convective precipitation. iCAM6_nudged_1980-2004_mon_residencetime: Mean atmospheric water residence time (in days). iCAM6_nudged_1980-2004_mon_transportdistance: Mean atmospheric water transport distance (in km). Free simulation files iCAM6_free_1980-2004_mon_d18O: Precipitation d18O (‰ VSMOW) iCAM6_free_1980-2004_mon_dxs: Precipitation deuterium excess (‰ VSMOW) - note that precipitation d2H can be calculated from this file and the precipitation d18O as d2H = d-excess - 8*d18O. iCAM6_free_1980-2004_mon_precip: Total precipitation rate in m/s. Note there is an error in the metadata in this file - it is total precipitation, not just convective precipitation. 
    more » « less
  3. Data Description:

    To improve SOC estimation in the United States, we upscaled site-based SOC measurements to the continental scale using multivariate geographic clustering (MGC) approach coupled with machine learning models. First, we used the MGC approach to segment the United States at 30 arc second resolution based on principal component information from environmental covariates (gNATSGO soil properties, WorldClim bioclimatic variables, MODIS biological variables, and physiographic variables) to 20 SOC regions. We then trained separate random forest model ensembles for each of the SOC regions identified using environmental covariates and soil profile measurements from the International Soil Carbon Network (ISCN) and an Alaska soil profile data. We estimated United States SOC for 0-30 cm and 0-100 cm depths were 52.6 + 3.2 and 108.3 + 8.2 Pg C, respectively.

    Files in collection (32):

    Collection contains 22 soil properties geospatial rasters, 4 soil SOC geospatial rasters, 2 ISCN site SOC observations csv files, and 4 R scripts

    gNATSGO TIF files:

    ├── available_water_storage_30arc_30cm_us.tif                   [30 cm depth soil available water storage]
    ├── available_water_storage_30arc_100cm_us.tif                 [100 cm depth soil available water storage]
    ├── caco3_30arc_30cm_us.tif                                                 [30 cm depth soil CaCO3 content]
    ├── caco3_30arc_100cm_us.tif                                               [100 cm depth soil CaCO3 content]
    ├── cec_30arc_30cm_us.tif                                                     [30 cm depth soil cation exchange capacity]
    ├── cec_30arc_100cm_us.tif                                                   [100 cm depth soil cation exchange capacity]
    ├── clay_30arc_30cm_us.tif                                                     [30 cm depth soil clay content]
    ├── clay_30arc_100cm_us.tif                                                   [100 cm depth soil clay content]
    ├── depthWT_30arc_us.tif                                                        [depth to water table]
    ├── kfactor_30arc_30cm_us.tif                                                 [30 cm depth soil erosion factor]
    ├── kfactor_30arc_100cm_us.tif                                               [100 cm depth soil erosion factor]
    ├── ph_30arc_100cm_us.tif                                                      [100 cm depth soil pH]
    ├── ph_30arc_100cm_us.tif                                                      [30 cm depth soil pH]
    ├── pondingFre_30arc_us.tif                                                     [ponding frequency]
    ├── sand_30arc_30cm_us.tif                                                    [30 cm depth soil sand content]
    ├── sand_30arc_100cm_us.tif                                                  [100 cm depth soil sand content]
    ├── silt_30arc_30cm_us.tif                                                        [30 cm depth soil silt content]
    ├── silt_30arc_100cm_us.tif                                                      [100 cm depth soil silt content]
    ├── water_content_30arc_30cm_us.tif                                      [30 cm depth soil water content]
    └── water_content_30arc_100cm_us.tif                                   [100 cm depth soil water content]

    SOC TIF files:

    ├──30cm SOC mean.tif                             [30 cm depth soil SOC]
    ├──100cm SOC mean.tif                           [100 cm depth soil SOC]
    ├──30cm SOC CV.tif                                 [30 cm depth soil SOC coefficient of variation]
    └──100cm SOC CV.tif                              [100 cm depth soil SOC coefficient of variation]

    site observations csv files:

    ISCN_rmNRCS_addNCSS_30cm.csv       30cm ISCN sites SOC replaced NRCS sites with NCSS centroid removed data

    ISCN_rmNRCS_addNCSS_100cm.csv       100cm ISCN sites SOC replaced NRCS sites with NCSS centroid removed data


    Data format:

    Geospatial files are provided in Geotiff format in Lat/Lon WGS84 EPSG: 4326 projection at 30 arc second resolution.

    Geospatial projection

    GEOGCS["GCS_WGS_1984", DATUM["D_WGS_1984", SPHEROID["WGS_1984",6378137,298.257223563]], PRIMEM["Greenwich",0], UNIT["Degree",0.017453292519943295]] (base) [jbk@theseus ltar_regionalization]$ g.proj -w GEOGCS["wgs84", DATUM["WGS_1984", SPHEROID["WGS_1984",6378137,298.257223563]], PRIMEM["Greenwich",0], UNIT["degree",0.0174532925199433]]

     

     
    more » « less
  4. Included are outputs from an AMIP-style (data-driven ocean and sea ice) simulation of Variable-Resolution CESM2 from 1979-2015. Resolution is 1° globally with a refined 0.25° resolution over the Southern Ocean as well over the Antarctic ice sheet. Outputs are at a monthly timescale, and include those variables relevant for evaluation. Each netcdf file ends with several relevant tags to indicate <source>.<output deisgnator>.<variable>.<time period>.nc. Atmospheric variables are labeled with "cam" whereas ice sheet variables are labeled with "clm2". clm2.h0.SNOW.198901-199812.nc cam.h1.Q.198901-199812.nc Variables are described for CESM2 (see NCAR documentation for clm and cam) Variables included for cam include FLDS, FLNS, CLDICE, CLDLIQ, LHFLX, PRECC, PRECL, PRECSC, PS,Q, SHFLX, U, V, Z3 Variables included for clm2 include FIRA, FIRE, FLDS, FSDS, FSH, QICE, QRUNOFF, QSNOMELT, QSOIL, RAIN, SNOW. Calculation of surface mass balance (SMB) from these fields is explained in Datta et al., 2023: Datta RT; Herrington A; Lenaerts JTM; Schneider DP; Trusel L; Yin Z; Dunmire D (Sep 2023) Evaluating the impact of enhanced horizontal resolution over the Antarctic domain using a variable-resolution Earth system model. The Cryosphere, 17 (9) , 3847-3866. https://doi.org/10.5194/tc-17-3847-2023 
    more » « less
  5. A biodiversity dataset graph: DataONE

    The intended use of this archive is to facilitate meta-analysis of the Data Observation Network for Earth (DataONE, [1]). DataONE is a distributed infrastructure that provides information about earth observation data. 

    This dataset provides versioned snapshots of the DataONE network as tracked by Preston [2] between 17 October 2018 and 7 July 2019.  

    The archive consists of 256 individual parts (e.g., preston-00.tar.gz, preston-01.tar.gz, ...) to allow for parallel file downloads. The archive contains three types of files: index files, provenance files and data files. Only two index and provenance files are included and have been individually included in this dataset publication. Index files provide a way to links provenance files in time to eestablish a versioning mechanism. Provenance files describe how, when and where the DataONE meta-data files were retrieved. For more information, please visit https://preston.guoda.bio or https://doi.org/10.5281/zenodo.1410543).  

    To retrieve and verify the downloaded DataONE biodiversity dataset graph, first concatenate all the downloaded preston-*.tar.gz files (e.g., cat preston-*.tar.gz > preston.tar.gz). Then, extract the archives into a "data" folder. Alternatively, you can use the preston[2] command-line tool to "clone" this dataset using:

    $ java -jar preston.jar clone --remote https://zenodo.org/record/3277312/files

    After that, verify the index of the archive by reproducing the following result:

    $ java -jar preston.jar history
    <0659a54f-b713-4f86-a917-5be166a14110> <http://purl.org/pav/hasVersion> <hash://sha256/8c67e0741d1c90db54740e08d2e39d91dfd73566ea69c1f2da0d9ab9780a9a9f> .
    <hash://sha256/3ed3acaca7ac57f546d0b8877c1927ab5e08c23eccaa8219600c59c77a72c685> <http://purl.org/pav/previousVersion> <hash://sha256/8c67e0741d1c90db54740e08d2e39d91dfd73566ea69c1f2da0d9ab9780a9a9f> .
    <hash://sha256/857753997a7595a1b372b05641b58a25d9408b7ff08d557ce1fe8b73e4bd383f> <http://purl.org/pav/previousVersion> <hash://sha256/3ed3acaca7ac57f546d0b8877c1927ab5e08c23eccaa8219600c59c77a72c685> .
    <hash://sha256/7ee0376f4c3f7aeeda36927a5211395e5da8201e810e8c7e638a0fe23d001e88> <http://purl.org/pav/previousVersion> <hash://sha256/857753997a7595a1b372b05641b58a25d9408b7ff08d557ce1fe8b73e4bd383f> .
    <hash://sha256/68b4974d8ab7c4c7a7a4305065839b60ba460aaa862590b34c67877738feba90> <http://purl.org/pav/previousVersion> <hash://sha256/7ee0376f4c3f7aeeda36927a5211395e5da8201e810e8c7e638a0fe23d001e88> .
    <hash://sha256/060a76d56255bf9482c951748c91291fddeeb20f180632132be1344e081b2372> <http://purl.org/pav/previousVersion> <hash://sha256/68b4974d8ab7c4c7a7a4305065839b60ba460aaa862590b34c67877738feba90> .
    <hash://sha256/29357bdfab4548025f8a5743301f5c3c9146fa436c39e3c9e019fb9409ac9c42> <http://purl.org/pav/previousVersion> <hash://sha256/060a76d56255bf9482c951748c91291fddeeb20f180632132be1344e081b2372> .
    <hash://sha256/3669cd95100d1d533eb8953ff4ec5092cbd8addb8879b3e6262191148a8a3ebb> <http://purl.org/pav/previousVersion> <hash://sha256/29357bdfab4548025f8a5743301f5c3c9146fa436c39e3c9e019fb9409ac9c42> .
    <hash://sha256/8dc1663299359d271cb1b4c14ad521d0f1be67743689dd18016543dc1e097efb> <http://purl.org/pav/previousVersion> <hash://sha256/3669cd95100d1d533eb8953ff4ec5092cbd8addb8879b3e6262191148a8a3ebb> .
    <hash://sha256/dc4903e8afee651db1d9bf509f20503bf9c8e89679c4bcffb46d5b97440cb6de> <http://purl.org/pav/previousVersion> <hash://sha256/8dc1663299359d271cb1b4c14ad521d0f1be67743689dd18016543dc1e097efb> .

    To check the integrity of the extracted archive, confirm that each line produce by the command "preston verify" produces lines as shown below, with each line including "CONTENT_PRESENT_VALID_HASH". Depending on hardware capacity, this may take a while.

    $ java -jar preston.jar verify
    hash://sha256/e55c1034d985740926564e94decd6dc7a70f779a33e7deb931553739cda16945    file:/home/preston/preston-dataone/data/e5/5c/e55c1034d985740926564e94decd6dc7a70f779a33e7deb931553739cda16945    OK    CONTENT_PRESENT_VALID_HASH    21580
    hash://sha256/d0ddcc2111b6134a570bcc7d89375920ef4d754130cecc0727c79d2b05a9f81f    file:/home/preston/preston-dataone/data/d0/dd/d0ddcc2111b6134a570bcc7d89375920ef4d754130cecc0727c79d2b05a9f81f    OK    CONTENT_PRESENT_VALID_HASH    2035
    hash://sha256/472de9d1c9fd7e044aac409abfbfff9f12c6b69359df995d431009580ffb0f53    file:/home/preston/preston-dataone/data/47/2d/472de9d1c9fd7e044aac409abfbfff9f12c6b69359df995d431009580ffb0f53    OK    CONTENT_PRESENT_VALID_HASH    1935
    hash://sha256/b29879462cd43862129c5cf9b149c41ecd33ffef284a4dbea4ac1c0f90108687    file:/home/preston/preston-dataone/data/b2/98/b29879462cd43862129c5cf9b149c41ecd33ffef284a4dbea4ac1c0f90108687    OK    CONTENT_PRESENT_VALID_HASH    1553

    Note that a copy of the java program "preston", preston.jar, is included in this publication. The program runs on java 8+ virtual machine using "java -jar preston.jar", or in short "preston". 

    Files in this data publication:

    README - this file

    preston.jar - executable java jar containing preston[2] v0.1.1.

    preston-[00-ff].tar.gz - preston archives containing DataONE meta-data files, their provenance and a provenance index.

    2a5de79372318317a382ea9a2cef069780b852b01210ef59e06b640a3539cb5a - preston index file
    2aecaf289def0e23a27058bf7715f226ef9189905f0be13228174825633125cf - preston index file
    3d38b70198e448674be6a63d14b9817f3a956f48bba7418fa7baa086a56c05b7 - preston index file
    66ad3e5e904740f1e835ac6718dda4279e0c24b204ea0d1113cda1352a5072ba - preston index file
    8bf062872ce958545d361e9d53a552ffb025ac29ab875caad1157c0995d34f66 - preston index file
    d9378616636be3686bbabd5bf29d50f0ef0e5ceb5ddd7dfce47f7e755b596b7d - preston index file
    da26fa6e7371385ed3f61af9a766221c833060d59dfd4869bbd7110f95f288db - preston index file
    e4103a75627857de3ee2e317429108611c244fc448c01d1d7bf652115c3b8a55 - preston index file
    eb368fedb8f100210dd968edcf80f4d13cab3dd64135a6ab744102cf15e68c94 - preston index file
    ff92b6c06ae5286bd2f1db679e0fcc4da294acb9bc01b2e9522378d99218c2e3 - preston index file

    [1] DataONE, https://www.dataone.org
    [2] https://preston.guoda.bio, https://doi.org/10.5281/zenodo.1410543 . DataONE was crawled via Preston with "preston update -u https://dataone.org".

    This work is funded in part by grant NSF OAC 1839201 from the National Science Foundation

     
    more » « less