skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: An Integrated Multi-Source Dataset for Measuring Settlement Evolution in the United States from 1810 to 2020
Abstract Understanding changes in the built environment is vital for sustainable urban development and disaster preparedness. Recent years have seen the emergence of a variety of global, continent-level, and nation-wide datasets related to the current state and the evolution of the built environment, human settlements or building stocks. However, such datasets may face limitations like incomplete coverage, sparse building information, coarse resolution, and limited timeframes. This study addresses these challenges by integrating three spatial datasets to create an extensive, attribute-rich sequence of settlement layers spanning 200 years for the contiguous U.S. This integration process involves complex data processing, merging property-level real estate, parcel, and remote sensing-based building footprint data, and creating gridded multi-temporal settlement layers. This effort unveils the latest edition (Version 2) of the Historical Settlement Data Compilation for the U.S. (HISDAC-US), which includes the latest land use and structural information as of the year 2021. It enables detailed research on urban form and structure, helps assess and map the built environment’s risk to natural hazards, assists in population modeling, supports land use analysis, and aids health studies.  more » « less
Award ID(s):
2121976
PAR ID:
10494537
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Data
Volume:
11
Issue:
1
ISSN:
2052-4463
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. This CSV file contains geometric and topological road network statistics for the majority of counties in the conterminous U.S. The underlying road network data is the USGS-NTD v2019. These road network data from 2019 were clipped to historical settlement extents obtained from the HISDAC-US dataset  Road network statistics are multi-temporal, calculated in time slices for the years: 1810-1900, 1880-1920, 1900-1940, 1920-1960, 1940-1980, 1960-2000, 1980-2015 The historical built-up areas used to model the historical road networks are derived from historical settlement layers from the  Historical settlement data compilation for the U.S. (HISDAC-US, Leyk & Uhl 2018). See Burghardt et al. (2022) for details on the modelling strategy. Spatial coverage: all U.S. counties that are covered by the HISDAC-US historical settlement layers. This datasets includes around 2,700 U.S. counties. In the remaining counties, construction year coverage in the underlying ZTRAX data (Zillow Transaction and Assessment Dataset) is low. See Uhl et al. (2021) for details. All data created by Johannes H. Uhl, University of Colorado Boulder, USA. Code available at https://github.com/johannesuhl/USRoadNetworkEvolution. References: Burghardt, K., Uhl, J., Lerman, K.,  & Leyk, S. (2022). Road  Network Evolution in the Urban and Rural  United States Since 1900.  Computers, Environment and Urban Systems. Leyk, S., & Uhl, J. H. (2018). HISDAC-US, historical settlement  data  compilation for the conterminous United States over 200 years. Scientific data, 5(1), 1-14. DOI:  https://doi.org/10.1038/sdata.2018.175  Uhl, J. H., Leyk, S., McShane, C. M., Braswell, A. E., Connor, D.  S.,  & Balk, D. (2021). Fine-grained, spatiotemporal datasets  measuring  200 years of land development in the United States. Earth system science data, 13(1), 119-153. DOI:  https://doi.org/10.5194/essd-13-119-2021  
    more » « less
  2. An ESRI Shapfile containing spatially generalized built-up areas for each decade from 1900 to 2010, and for 2015, for each core-based statistical area (CBSA, i.e., metropolitan and micropolitan statistical area) in the conterminous United States. These areas are derived from historical settlement layers from the Historical settlement data compilation for the U.S. (HISDAC-US, Leyk & Uhl 2018). See Burghardt et al. (2022) for details on the data processing. Additionally, there is a CSV file (HISDAC-US_patch_statistics.csv) containing the counts of built-up property records (BUPR), and -locations (BUPL), as well as total building indoor area (BUI) and built-up area (BUA) per CBSA, year, and patch, extraced from the HISDAC-US data (Uhl & Leyk 2018, Uhl et al. 2021). This CSV can be joined to the shapefile (column uid2) by concatenating the columns msaid_year_Id. Spatial coverage: all CBSAs that are covered by the HISDAC-US historical settlement layers. This dataset includes around 2,700 U.S. counties. In the remaining counties, construction year coverage in the underlying ZTRAX data (Zillow Transaction and Assessment Dataset) is low. See Uhl et al. (2021) for details. All data created by Johannes H. Uhl, University of Colorado Boulder, USA. Code available at https://github.com/johannesuhl/USRoadNetworkEvolution. References: Burghardt, K., Uhl, J., Lerman, K.,  & Leyk, S. (2022). Road Network Evolution in the Urban and Rural  United States Since 1900. Computers, Environment and Urban Systems. Leyk, S., & Uhl, J. H. (2018). HISDAC-US, historical settlement data  compilation for the conterminous United States over 200 years. Scientific data, 5(1), 1-14. DOI:  https://doi.org/10.1038/sdata.2018.175  Uhl, J. H., Leyk, S., McShane, C. M., Braswell, A. E., Connor, D. S.,  & Balk, D. (2021). Fine-grained, spatiotemporal datasets measuring  200 years of land development in the United States. Earth system science data, 13(1), 119-153. DOI:  https://doi.org/10.5194/essd-13-119-2021  
    more » « less
  3. Abstract. Multi-temporal measurements quantifying the changes to the Earth's surface are critical for understanding many natural, anthropogenic, and social processes. Researchers typically use remotely sensed Earth observation data to quantify and characterize such changes in land use and land cover (LULC). However, such data sources are limited in their availability prior to the 1980s. While an observational window of 40 to 50 years is sufficient to study most recent LULC changes, processes such as urbanization, land development, and the evolution of urban and coupled nature–human systems often operate over longer time periods covering several decades or even centuries. Thus, to quantify and better understand such processes, alternative historical–geospatial data sources are required that extend farther back in time. However, such data are rare, and processing is labor-intensive, often involving manual work. To overcome the resulting lack in quantitative knowledge of urban systems and the built environment prior to the 1980s, we leverage cadastral data with rich thematic property attribution, such as building usage and construction year. We scraped, harmonized, and processed over 12 000 000 building footprints including construction years to create a multi-faceted series of gridded surfaces, describing the evolution of human settlements in Spain from 1900 to 2020, at 100 m spatial and 5-year temporal resolution. These surfaces include measures of building density, built-up intensity, and built-up land use. We evaluated our data against a variety of data sources including remotely sensed human settlement data and land cover data, model-based historical land use depictions, and historical maps and historical aerial imagery and find high levels of agreement. This new data product, the Historical Settlement Data Compilation for Spain (HISDAC-ES), is publicly available (https://doi.org/10.6084/m9.figshare.22009643, Uhl et al., 2023a) and represents a rich source for quantitative, long-term analyses of the built environment and related processes over large spatial and temporal extents and at fine resolutions. 
    more » « less
  4. Abstract Multiple aspects of our society are reflected in how we have transformed land through time. However, limited availability of historical-spatial data at fine granularity have hindered our ability to advance our understanding of the ways in which land was developed over the long-term. Using a proprietary, national housing and property database, which is a result of large-scale, industry-fuelled data harmonization efforts, we created publicly available sequences of gridded surfaces that describe built land use progression in the conterminous United States at fine spatial (i.e., 250 m × 250 m) and temporal resolution (i.e., 1 year - 5 years) between the years 1940 and 2015. There are six land use classes represented in the data product: agricultural, commercial, industrial, residential-owned, residential-income, and recreational facilities, as well as complimentary uncertainty layers informing the users about quantifiable components of data uncertainty. The datasets are part of the Historical Settlement Data Compilation for the U.S. (HISDAC-US) and enable the creation of new knowledge of long-term land use dynamics, opening novel avenues of inquiry across multiple fields of study. 
    more » « less
  5. Dias, João Miguel (Ed.)
    Current estimates of U.S. property at risk of coastal hazards and sea level rise (SLR) are staggering—evaluated at over a trillion U.S. dollars. Despite being enormous in the aggregate, potential losses due to SLR depend on mitigation, adaptation, and exposure and are highly uneven in their distribution across coastal cities. We provide the first analysis of how changes in exposure ( how and when ) have unfolded over more than a century of coastal urban development in the United States. We do so by leveraging new historical settlement layers from the Historical Settlement Data Compilation for the U.S. (HISDAC-US) to examine building patterns within and between the SLR zones of the conterminous United States since the early twentieth century. Our analysis reveals that SLR zones developed faster and continue to have higher structure density than non-coastal, urban, and inland areas. These patterns are particularly prominent in locations affected by hurricanes. However, density levels in historically less-developed coastal areas are now quickly converging on early settled SLR zones, many of which have reached building saturation. These “saturation effects” suggest that adaptation polices targeting existing buildings and developed areas are likely to grow in importance relative to the protection of previously undeveloped land. 
    more » « less