skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Dataset: Spatial Variability and Uncertainty of Soil Nitrogen across the Conterminous United States at Different Depths
Soil nitrogen (N) is an important driver of plant productivity and ecosystem functioning; consequently, it is critical to understand its spatial variability from local-to-global scales. Here we provide a quantitative assessment of the three-dimensional spatial distribution of soil N across the conterminous United States (CONUS) using a digital soil mapping (DSM) approach. We used a random forest-regression kriging algorithm to predict soil N concentrations and associated uncertainty across six soil depths (0-5, 5-15, 15-30, 30-60, 60-100, 100-200 cm) at 5 km spatial grids. Across CONUS, there is a strong spatial dependence of soil N, where soil N concentrations decrease but uncertainty increases with soil depth. Soil N was higher in Pacific Northwest, Northeast, and Great Lakes National Ecological Observatory Network (NEON) ecoclimatic domains. Model uncertainty was higher in Atlantic Neotropical, Southern Rockies/Colorado Plateau and Southeast NEON domains. We also compared our soil N predictions with satellite-derived gross primary production (GPP) and forest biomass from the National Biomass and Carbon Dataset. Finally, we used uncertainty information to propose optimized locations for designing future soil surveys and found that the Atlantic Neotropical, Pacific Northwest, Pacific Southwest, and Appalachian/Cumberland Plateau NEON domains may require larger survey efforts. We highlight the need to increase knowledge of biophysical factors regulating soil processes at deeper depths to better characterize the three-dimensional space of soils. Our results provide a national benchmark regarding the spatial variability and uncertainty of soil N and reveal areas in need of a better representation.</p></p>This dataset includes all covariates used for modeling soil Nitrogen, the training data, and the modeling output. The output represents raster files at 5km resolution of soil N at different depths and associated model uncertainty.</p></p>Main reference:</p>Smith EM, Guevara M, Tarin T, Pouyat R, Vargas R. Spatial variability and uncertainty of soil nitrogen across the conterminous United States (in review). Ecosphere.</p>  more » « less
Award ID(s):
2103836
PAR ID:
10395727
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
figshare
Date Published:
Subject(s) / Keyword(s):
50102 Ecosystem Function 50205 Environmental Management Environmental Science Soil Science 80606 Global Information Systems 80110 Simulation and Modelling 170203 Knowledge Representation and Machine Learning
Format(s):
Medium: X Size: 150149467 Bytes
Size(s):
150149467 Bytes
Sponsoring Org:
National Science Foundation
More Like this
  1. Improving our ability to understand and predict the dynamics of the terrestrial carbon cycle remains a pressing challenge despite a rapidly growing volume and diversity of Earth Observation data. State data assimilation represents a path forward via an iterative cycle of making process-based forecasts and then statistically reconciling these forecasts against numerous ground-based and remotely-sensed data constraints into a “reanalysis” data product that provides full spatiotemporal carbon budgets with robust uncertainty accounting. Here we report on an >100x expansion of the PEcAn+SIPNET reanalysis from 500 sites CONUS, 25 ensemble members, and 2 data constraints to 6400 sites across North America, 100 ensemble members, and 5 data constraints: GEDI and Landtrendr AGB, MODIS LAI, SoilGrids Soil C, and SMAP soil moisture. We also report on an ensemble-based machine learning (ML) downscaling to a 1km product that preserves spatial, temporal, and across-variable covariances and demonstrate the impacts of these covariances on uncertainty accounting (Fig. 1). Synergistically, we use the same ML models to assess what climate, vegetation, and soil variables explain the spatiotemporal variability in different C pools and fluxes. In addition, we review a wide range of ongoing validation activities, comparing the outputs of the reanalysis against withheld data from: Ameriflux and NEON NEE and LE; USFS Forest Inventory biomass, biomass increment, tree rings, soil C, and litter; and NEON soil C and soil respiration. Finally, we touch on ML analyses to diagnose and correct systematic biases and emulator-based recalibration efforts. 
    more » « less
  2. Abstract The contributions of oceanic and atmospheric variability to spatially widespread summer droughts in the contiguous United States (hereafter, pan‐CONUS droughts) are investigated using 16‐member ensembles of the Community Climate Model version 3 (CCM3) forced with observed sea surface temperatures (SSTs) from 1856–2012. The employed SST forcing fields are either (i) global or restricted to the (ii) tropical Pacific or (iii) tropical Atlantic to isolate the impacts of these two ocean regions on pan‐CONUS droughts. Model results show that SST forcing of pan‐CONUS droughts originates almost entirely from the tropical Pacific because of atmospheric highs from the northern Pacific to eastern North America established by La Niña conditions, with little contribution from the tropical Atlantic. Notably, in all three model configurations, internal atmospheric variability influences pan‐CONUS drought occurrence by as much or more than the ocean forcing and can alone cause pan‐CONUS droughts by establishing a dominant high centered over the U.S. montane west. Similar results are found for the Community Atmosphere Model version 5 (CAM5). Model results are compared to the observational record, which supports model‐inferred contributions to pan‐CONUS droughts from La Niñas and internal atmospheric variability. While there may be an additional association with warm Atlantic SSTs in the observational record, this association is ambiguous due to the limited number of observed pan‐CONUS droughts. The ambiguity thus opens the possibility that the observational results are limited by sampling over the twentieth century and not at odds with the suggested dominance of Pacific Ocean forcing in the model ensembles. 
    more » « less
  3. Dataset Abstract A spatial variability study conducted across the LTER Main Site area (45 ha) at KBS prior to dividing the site into 1-ha experimental plots. During the 1988 growing season a stratified unaligned sampling scheme was used to collect 400-600 geo-referenced samples across the site (uniformly planted to a single variety of soybeans) for: geomorphological characteristics (microtopography, soil horizon depths, bulk density, texture); soil chemical characteristics (pH, NO3, NH4, total C, total N, moisture, inorganic P, trace metals); soil biological characteristics (N mineralization potentials, microbial biomass C, microbial biomass N, fungal/bacterial ratios, nematodes and other soil invertebrates; seed bank size); plant weed species abundance, weed biomass at peak standing crop); and insect characteristics (major pest and predator species). Most soil samples were taken before crop emergence, plant phenology samples were taken throughout the growing season, biomass samples were taken at physiological maturity, and insect samples were taken continuously. Dried soil and plant samples are archived for potential future analysis. original data source http://lter.kbs.msu.edu/datasets/6 
    more » « less
  4. The primary mechanism driving plant species loss after nitrogen (N) addition has been often hypothesized to be asymmetric competition for light, resulting from increased aboveground biomass. However, it is largely unknown whether plants’ access to soil water at different depths would affect their responses, fate, and community composition under nitrogen addition. In a semiarid grassland exposed to 8-years of N addition, we measured plant aboveground biomass and diversity under four nitrogen addition rates (0, 4, 10, and 16 g m 2 year 1), and evaluated plant use of water across the soil profile using oxygen isotope. Aboveground biomass increased significantly, but diversity and shallow soil-water content decreased, with increasing rate of nitrogen addition. The water isotopic signature for both plant and soil water at the high N rate indicated that Leymus secalinus (a perennial grass) absorbed 7% more water from the subsurface soil layer (20e100 cm) compared to Elymus dahuricus (a perennial grass) and Artemisia annua (an annual forb). L. secalinus thus had a significantly larger biomass and was more abundant than the other two species at the high N rate but did not differ significantly from the other two species under ambient and the low N rate. Species that could use water from deeper soil layers became dominant when water in the shallow layers was insufficient to meet the demands of increased aboveground plant biomass. Our study highlights the importance of water across soil depths as key driver of plant growth and dominance in grasslands under N addition. 
    more » « less
  5. Abstract The National Ecological Observatory Network Terrestrial Observation System (NEON TOS) produces open‐access data products that allow data users to investigate the impact of change drivers on key “sentinel” taxa and soils. The spatial and temporal sampling strategy that coordinates implementation of these protocols enables integration across TOS products and with products generated by NEON aquatic, remote sensing, and terrestrial instrument subsystems. Here, we illustrate the plots and sampling units that make up the physical foundation of a NEON TOS site, and we describe the scales (subplot, plot, airshed, and site) at which sampling is spatially colocated across protocols and subsystems. We also describe how moderate resolution imaging spectroradiometer‐enhanced vegetation index (MODIS‐EVI) phenology data are used to temporally coordinate TOS sampling within and across years at the continental scale of the observatory. Individually, TOS protocols produce data products that provide insight into populations, communities, and ecosystem processes. Within the spatial and temporal framework that guides cross‐protocol implementation, the ability to draw inference across data products is enhanced. To illustrate this point, we develop an example using R software that links two TOS data products collected with different temporal frequencies at both plot and site spatial scales. A thorough understanding of how TOS protocols are integrated with each other in space and time, and with other NEON subsystems, is necessary to leverage NEON data products to maximum effect. For example, a researcher must understand the spatial and temporal scales at which soil biogeochemistry data, soil microbe biomass data, and plant litter production and chemistry data may be combined to quantify soil nutrient stocks and fluxes across NEON sites. We present clear links among TOS protocols and across NEON subsystems that will enhance the utility of NEON TOS data products for the data user community. 
    more » « less