Upscaling soil organic carbon measurements at the continental scale using multivariate clustering analysis and machine learning

wang, zhuonan; Kumar, Jitendra; R., Samantha Weintraub-Leff; Todd-Brown, Katherine; Mishra, Umakant; Sihi, Debjani

doi:10.5281/zenodo.8057232

{"Abstract":["Data Description<\/strong>:<\/p>\n\nTo improve SOC estimation in the United States, we upscaled site-based SOC measurements to the continental scale using multivariate geographic clustering (MGC) approach coupled with machine learning models. First, we used the MGC approach to segment the United States at 30 arc second resolution based on principal component information from environmental covariates (gNATSGO soil properties, WorldClim bioclimatic variables, MODIS biological variables, and physiographic variables) to 20 SOC regions. We then trained separate random forest model ensembles for each of the SOC regions identified using environmental covariates and soil profile measurements from the International Soil Carbon Network (ISCN) and an Alaska soil profile data. We estimated United States SOC for 0-30 cm and 0-100 cm depths were 52.6 + 3.2 and 108.3 + 8.2 Pg C, respectively.<\/p>\n\nFiles in collection (32):<\/p>\n\nCollection contains 22 soil properties geospatial rasters, 4 soil SOC geospatial rasters, 2 ISCN site SOC observations csv files, and 4 R scripts<\/p>\n\ngNATSGO TIF files:<\/p>\n\n├── available_water_storage_30arc_30cm_us.tif [30 cm depth soil available water storage]\n├── available_water_storage_30arc_100cm_us.tif [100 cm depth soil available water storage]\n├── caco3_30arc_30cm_us.tif [30 cm depth soil CaCO3 content]\n├── caco3_30arc_100cm_us.tif [100 cm depth soil CaCO3 content]\n├── cec_30arc_30cm_us.tif [30 cm depth soil cation exchange capacity]\n├── cec_30arc_100cm_us.tif [100 cm depth soil cation exchange capacity]\n├── clay_30arc_30cm_us.tif [30 cm depth soil clay content]\n├── clay_30arc_100cm_us.tif [100 cm depth soil clay content]\n├── depthWT_30arc_us.tif [depth to water table]\n├── kfactor_30arc_30cm_us.tif [30 cm depth soil erosion factor]\n├── kfactor_30arc_100cm_us.tif [100 cm depth soil erosion factor]\n├── ph_30arc_100cm_us.tif [100 cm depth soil pH]\n├── ph_30arc_100cm_us.tif [30 cm depth soil pH]\n├── pondingFre_30arc_us.tif [ponding frequency]\n├── sand_30arc_30cm_us.tif [30 cm depth soil sand content]\n├── sand_30arc_100cm_us.tif [100 cm depth soil sand content]\n├── silt_30arc_30cm_us.tif [30 cm depth soil silt content]\n├── silt_30arc_100cm_us.tif [100 cm depth soil silt content]\n├── water_content_30arc_30cm_us.tif [30 cm depth soil water content]\n└── water_content_30arc_100cm_us.tif [100 cm depth soil water content]<\/p>\n\nSOC TIF files:<\/p>\n\n├──30cm SOC mean.tif [30 cm depth soil SOC]\n├──100cm SOC mean.tif [100 cm depth soil SOC]\n├──30cm SOC CV.tif [30 cm depth soil SOC coefficient of variation]\n└──100cm SOC CV.tif [100 cm depth soil SOC coefficient of variation]<\/p>\n\nsite observations csv files:<\/p>\n\nISCN_rmNRCS_addNCSS_30cm.csv 30cm ISCN sites SOC replaced NRCS sites with NCSS centroid removed data<\/p>\n\nISCN_rmNRCS_addNCSS_100cm.csv 100cm ISCN sites SOC replaced NRCS sites with NCSS centroid removed data<\/p>\n\n\nData format<\/strong>:<\/p>\n\nGeospatial files are provided in Geotiff format in Lat/Lon WGS84 EPSG: 4326 projection at 30 arc second resolution.<\/p>\n\nGeospatial projection<\/strong>: <\/p>\n\nGEOGCS["GCS_WGS_1984",\n DATUM["D_WGS_1984",\n SPHEROID["WGS_1984",6378137,298.257223563]],\n PRIMEM["Greenwich",0],\n UNIT["Degree",0.017453292519943295]]\n(base) [jbk@theseus ltar_regionalization]$ g.proj -w\nGEOGCS["wgs84",\n DATUM["WGS_1984",\n SPHEROID["WGS_1984",6378137,298.257223563]],\n PRIMEM["Greenwich",0],\n UNIT["degree",0.0174532925199433]]\n<\/code>\n\n <\/p>"]}

More Like this