skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Fine-grained, spatio-temporal datasets measuring 200 years of land development in the United States
The collection, processing, and analysis of remote sensing data since the early 1970s has rapidly improved our understanding of change on the Earth's surface. While satellite-based Earth observation has proven to be of vast scientific value, these data are typically confined to recent decades of observation and often lack important thematic detail. Here, we advance in this arena by constructing new spatially explicit settlement data for the United States that extend back to the early 19th century and are consistently enumerated at fine spatial and temporal granularity (i.e. 250 m spatial and 5-year temporal resolution). We create these time series using a large, novel building-stock database to extract and map retrospective, fine-grained spatial distributions of built-up properties in the conterminous United States from 1810 to 2015. From our data extraction, we analyse and publish a series of gridded geospatial datasets that enable novel retrospective historical analysis of the built environment at an unprecedented spatial and temporal resolution. The datasets are part of the Historical Settlement Data Compilation for the United States (https://dataverse.harvard.edu/dataverse/hisdacus, last access: 25 January 2021) and are available at https://doi.org/10.7910/DVN/YSWMDR (Uhl and Leyk, 2020a), https://doi.org/10.7910/DVN/SJ213V (Uhl and Leyk, 2020b), and https://doi.org/10.7910/DVN/J6CYUJ (Uhl and Leyk, 2020c).  more » « less
Award ID(s):
1924670
PAR ID:
10290537
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Earth system science data
Volume:
13
ISSN:
1866-3508
Page Range / eLocation ID:
119-153
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract. The collection, processing, and analysis of remote sensing data since the early 1970s has rapidly improved our understanding of change on the Earth's surface. While satellite-based Earth observation has proven to be of vast scientific value, these data are typically confined to recent decades of observation and often lack important thematic detail. Here, we advance in this arena by constructing new spatially explicit settlement data for the United States that extend back to the early 19th century and are consistently enumerated at fine spatial and temporal granularity (i.e. 250 m spatial and 5-year temporal resolution). We create these time series using a large, novel building-stock database to extract and map retrospective, fine-grained spatial distributions of built-up properties in the conterminous United States from 1810 to 2015. From our data extraction, we analyse and publish a series of gridded geospatial datasets that enable novel retrospective historical analysis of the built environment at an unprecedented spatial and temporal resolution. The datasets are part of the Historical Settlement Data Compilation for the United States (https://dataverse.harvard.edu/dataverse/hisdacus, last access: 25 January 2021) and are available at https://doi.org/10.7910/DVN/YSWMDR (Uhl and Leyk, 2020a), https://doi.org/10.7910/DVN/SJ213V (Uhl and Leyk, 2020b), and https://doi.org/10.7910/DVN/J6CYUJ (Uhl and Leyk, 2020c). 
    more » « less
  2. This CSV file contains geometric and topological road network statistics for the majority of counties in the conterminous U.S. The underlying road network data is the USGS-NTD v2019. These road network data from 2019 were clipped to historical settlement extents obtained from the HISDAC-US dataset  Road network statistics are multi-temporal, calculated in time slices for the years: 1810-1900, 1880-1920, 1900-1940, 1920-1960, 1940-1980, 1960-2000, 1980-2015 The historical built-up areas used to model the historical road networks are derived from historical settlement layers from the  Historical settlement data compilation for the U.S. (HISDAC-US, Leyk & Uhl 2018). See Burghardt et al. (2022) for details on the modelling strategy. Spatial coverage: all U.S. counties that are covered by the HISDAC-US historical settlement layers. This datasets includes around 2,700 U.S. counties. In the remaining counties, construction year coverage in the underlying ZTRAX data (Zillow Transaction and Assessment Dataset) is low. See Uhl et al. (2021) for details. All data created by Johannes H. Uhl, University of Colorado Boulder, USA. Code available at https://github.com/johannesuhl/USRoadNetworkEvolution. References: Burghardt, K., Uhl, J., Lerman, K.,  & Leyk, S. (2022). Road  Network Evolution in the Urban and Rural  United States Since 1900.  Computers, Environment and Urban Systems. Leyk, S., & Uhl, J. H. (2018). HISDAC-US, historical settlement  data  compilation for the conterminous United States over 200 years. Scientific data, 5(1), 1-14. DOI:  https://doi.org/10.1038/sdata.2018.175  Uhl, J. H., Leyk, S., McShane, C. M., Braswell, A. E., Connor, D.  S.,  & Balk, D. (2021). Fine-grained, spatiotemporal datasets  measuring  200 years of land development in the United States. Earth system science data, 13(1), 119-153. DOI:  https://doi.org/10.5194/essd-13-119-2021  
    more » « less
  3. An ESRI Shapfile containing spatially generalized built-up areas for each decade from 1900 to 2010, and for 2015, for each core-based statistical area (CBSA, i.e., metropolitan and micropolitan statistical area) in the conterminous United States. These areas are derived from historical settlement layers from the Historical settlement data compilation for the U.S. (HISDAC-US, Leyk & Uhl 2018). See Burghardt et al. (2022) for details on the data processing. Additionally, there is a CSV file (HISDAC-US_patch_statistics.csv) containing the counts of built-up property records (BUPR), and -locations (BUPL), as well as total building indoor area (BUI) and built-up area (BUA) per CBSA, year, and patch, extraced from the HISDAC-US data (Uhl & Leyk 2018, Uhl et al. 2021). This CSV can be joined to the shapefile (column uid2) by concatenating the columns msaid_year_Id. Spatial coverage: all CBSAs that are covered by the HISDAC-US historical settlement layers. This dataset includes around 2,700 U.S. counties. In the remaining counties, construction year coverage in the underlying ZTRAX data (Zillow Transaction and Assessment Dataset) is low. See Uhl et al. (2021) for details. All data created by Johannes H. Uhl, University of Colorado Boulder, USA. Code available at https://github.com/johannesuhl/USRoadNetworkEvolution. References: Burghardt, K., Uhl, J., Lerman, K.,  & Leyk, S. (2022). Road Network Evolution in the Urban and Rural  United States Since 1900. Computers, Environment and Urban Systems. Leyk, S., & Uhl, J. H. (2018). HISDAC-US, historical settlement data  compilation for the conterminous United States over 200 years. Scientific data, 5(1), 1-14. DOI:  https://doi.org/10.1038/sdata.2018.175  Uhl, J. H., Leyk, S., McShane, C. M., Braswell, A. E., Connor, D. S.,  & Balk, D. (2021). Fine-grained, spatiotemporal datasets measuring  200 years of land development in the United States. Earth system science data, 13(1), 119-153. DOI:  https://doi.org/10.5194/essd-13-119-2021  
    more » « less
  4. These geotiff files represent road network statistics for each core-based statistical area (CBSA) in the conterminous U.S., within grid cells of 1km x 1km. The road network statistics are based on the National transportation dataset (USGS-NTD) v2019. These statistics include: gridcell_stats_azimuthvariety_1km_all_cbsas.tif: The number of unique road angles (azimuth / orientation) in bins of 10 degrees per 1 sqkm grid cell. gridcell_stats_deadendrate_1km_all_cbsas.tif: The proportion of dead ends (nodes of degree 1) of all nodes per 1 sqkm grid cell. gridcell_stats_kmroad_1km_all_cbsas.tif: The approximate total road network length per 1 sqkm grid cell. This is based on the road segment length appended to each road segment centroid and may be biased for very long road segments. gridcell_stats_meandegree_1km_all_cbsas.tif: The average nodal degree of all nodes per 1 sqkm grid cell. gridcell_stats_meangriddedness_1km_all_cbsas.tif: The average griddedness of all nodes per 1 sqkm grid cell. gridcell_stats_nodedensity_1km_all_cbsas.tif: The number of nodes per 1 sqkm grid cell. gridcell_stats_nodesperkmroad_1km_all_cbsas.tif: The number of nodes per km road within each 1 sqkm grid cell. gridcell_stats_firstbuiltup_1km_all_cbsas.tif: The approximate settlement age per 1 sqkm grid cell. This layer is derived from the HISDAC-US First-built-up year (FBUY) layer, which is derived from Zillow's Transaction and Assessment Dataset (ZTRAX). The FBUY data is available here: Leyk, Stefan; Uhl, Johannes H., 2018, "FBUY.tar.gz", Historical settlement composite layer for the U.S. 1810 - 2015, https://doi.org/10.7910/DVN/PKJ90M/BOA5YC, Harvard Dataverse, V2  gridcell_stats_1km_all_cbsas_arcmap10.8.mxd: ESRI ArcMap 10.8 MXD file for quick visualization of the gridded surfaces. Spatial resolution: 1x1km Spatial reference: SR-ORG:7480, USA_Contiguous_Albers_Equal_Area_Conic_USGS_version Source data: USGS-NTD, HISDAC-US. File format: GeoTIFF. Spatial coverage of the road network metrics: All CBSAs in the conterminous U.S. Spatial coverage of the "first built-up year" surface: all U.S. counties that are covered by the HISDAC-US  historical settlement layers. This datasets includes around 2,700 U.S.  counties. In the remaining counties, construction year coverage in the  underlying ZTRAX data (Zillow Transaction and Assessment Dataset) is  low. See Leyk & Uhl (2018) for details. All data created by Johannes H. Uhl, University of Colorado Boulder, USA. Code available at https://github.com/johannesuhl/USRoadNetworkEvolution. References: Burghardt, K., Uhl, J., Lerman, K.,  & Leyk, S. (2022). Road   Network Evolution in the Urban and Rural  United States Since 1900.   Computers, Environment and Urban Systems. Leyk, S., & Uhl, J. H. (2018). HISDAC-US, historical settlement   data  compilation for the conterminous United States over 200 years. Scientific data, 5(1), 1-14. DOI:  https://doi.org/10.1038/sdata.2018.175  
    more » « less
  5. Abstract. Multi-temporal measurements quantifying the changes to the Earth's surface are critical for understanding many natural, anthropogenic, and social processes. Researchers typically use remotely sensed Earth observation data to quantify and characterize such changes in land use and land cover (LULC). However, such data sources are limited in their availability prior to the 1980s. While an observational window of 40 to 50 years is sufficient to study most recent LULC changes, processes such as urbanization, land development, and the evolution of urban and coupled nature–human systems often operate over longer time periods covering several decades or even centuries. Thus, to quantify and better understand such processes, alternative historical–geospatial data sources are required that extend farther back in time. However, such data are rare, and processing is labor-intensive, often involving manual work. To overcome the resulting lack in quantitative knowledge of urban systems and the built environment prior to the 1980s, we leverage cadastral data with rich thematic property attribution, such as building usage and construction year. We scraped, harmonized, and processed over 12 000 000 building footprints including construction years to create a multi-faceted series of gridded surfaces, describing the evolution of human settlements in Spain from 1900 to 2020, at 100 m spatial and 5-year temporal resolution. These surfaces include measures of building density, built-up intensity, and built-up land use. We evaluated our data against a variety of data sources including remotely sensed human settlement data and land cover data, model-based historical land use depictions, and historical maps and historical aerial imagery and find high levels of agreement. This new data product, the Historical Settlement Data Compilation for Spain (HISDAC-ES), is publicly available (https://doi.org/10.6084/m9.figshare.22009643, Uhl et al., 2023a) and represents a rich source for quantitative, long-term analyses of the built environment and related processes over large spatial and temporal extents and at fine resolutions. 
    more » « less