skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Mapping migration regions and their evolution from population-scale family trees: what can they tell us about cultural identities and regions today?
Using a population-scale family tree dataset, this paper proposes a study of migration regions and their evolution in the U.S. between 1789 and 1924. To extract migration events, we use the child ladder approach, which traces family moves based on changes in birthplaces of consecutive children in each individual family. We calculate a time series measure of migration rate and partition the time into optimal periods so that each period has a distinct migration network. We apply community detection to derive migration regions from each network of different periods. We map these regions and use a pair-counting measure to statistically compare the similarity of regions in consecutive time periods. Migration regions reveal the extent to which the strong regional identities we see today, and, in the past, which were rooted in migration. The North/South divide was pervasive not only in the early periods but throughout U.S. history. Migration regions are important for understanding the development of regional and national cultural forms such as music, literature, foodways, and dialects, as well as political divisions and events.  more » « less
Award ID(s):
2215568
PAR ID:
10418209
Author(s) / Creator(s):
; ; ;
Editor(s):
Moncla, Ludovic; Martins, Bruno; McDonough, Katherine
Date Published:
Journal Name:
Proceedings of the 6th ACM SIGSPATIAL International Workshop on Geospatial Humanities
Page Range / eLocation ID:
24 to 27
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The quality and representativeness of longitudinal datasets play a central role in historical migration research. In this study, we apply the child-ladder (CL) method to a population-scale family tree dataset to analyze U.S. interstate family migration from 1850 to 1920. The CL method infers moves from changes in birthplaces between successive children, allowing for more precise dating of migration events. However, it is limited to families with at least two children. To evaluate the representativeness and utility of family trees for migration research, we compare the CL data to the IPUMS Multigenerational Longitudinal Panel (MLP), which tracks household moves across census decades and serves as a proxy for broader population migration. The CL data reveal higher migration rates, suggesting a likely closer approximation to migration levels in the overall population. Also, by capturing intercensal and return migrations, the CL method provide a detailed view of migration patterns across space and time. Despite differences in migration rates, both datasets reveal similar regional migration structures, especially in the earlier periods. These findings show that population-scale family trees when analyzed using the CL method, offer a valuable complement to linked census data by enhancing our understanding of long-term U.S. migration patterns and regional divisions. 
    more » « less
  2. We introduce a novel method for summarization of whiteboard lecture videos using key handwritten content regions. A deep neural network is used for detecting bounding boxes that contain semantically meaningful groups of handwritten content. A neural network embedding is learnt, under triplet loss, from the detected regions in order to discriminate between unique handwritten content. The detected regions along with embeddings at every frame of the lecture video are used to extract unique handwritten content across the video which are presented as the video summary. Additionally, a spatiotemporal index is constructed from the video which records the time and location of each individual summary region in the video which can potentially be used for content-based search and navigation. We train and test our methods on the publicly available AccessMath dataset. We use the DetEval scheme to benchmark our summarization by recall of unique ground truth objects (92.09%) and average number of summary regions (128) compared to the ground truth (88). 
    more » « less
  3. Abstract Neuroimaging in the preclinical phase of Alzheimer’s disease provides information crucial to early intervention, particularly in people with a high genetic risk. Metabolic network modularity, recently applied to the study of dementia, is increased in Alzheimer’s disease patients compared with controls, but network modularity in cognitively unimpaired elderly with various risks of developing Alzheimer’s disease needs to be determined. Based on their 5-year cognitive progression, we stratified 117 cognitively normal participants (78.3 ± 4.0 years of age, 52 women) into three age-matched groups, each with a different level of risk for Alzheimer’s disease. From their fluorodeoxyglucose PET we constructed metabolic networks, evaluated their modular structures using the Louvain algorithm, and compared them between risk groups. As the risk for Alzheimer’s disease increased, the metabolic connections among brain regions weakened and became more modular, indicating network fragmentation and functional impairment of the brain. We then set out to determine the correlation between regional brain metabolism, particularly in the modules derived from the previous analysis, and the regional expression of Alzheimer-risk genes in the brain, obtained from the Allen Human Brain Atlas. In all risk groups of this elderly population, the regional brain expression of most Alzheimer-risk genes showed a strong correlation with brain metabolism, particularly in the module that corresponded to regions of the brain that are affected earliest and most severely in Alzheimer’s disease. Among the genes, APOE and CD33 showed the strongest negative correlation and SORL1 showed the strongest positive correlation with brain metabolism. The Pearson correlation coefficients remained significant when contrasted against a null-hypothesis distribution of correlation coefficients across the whole transcriptome of 20 736 genes (SORL1: P = 0.0130; CD33, P = 0.0136; APOE: P = 0.0093). The strong regional correlation between Alzheimer-related gene expression in the brain and brain metabolism in older adults highlights the role of brain metabolism in the genesis of dementia. 
    more » « less
  4. Abstract The recent intensification of floods and droughts in the Fraser River Basin (FRB) of British Columbia has had profound cultural, ecological, and economic impacts that are expected to be exacerbated further by anthropogenic climate change. In part due to short instrumental runoff records, the long-term stationarity of hydroclimatic extremes in this major North American watershed remains poorly understood, highlighting the need to use high-resolution paleoenvironmental proxies to inform on past streamflow. Here we use a network of tree-ring proxy records to develop 11 subbasin-scale, complementary flood- and drought-season reconstructions, the first of their kind. The reconstructions explicitly target management-relevant flood and drought seasons within each basin, and are examined in tandem to provide an expanded assessment of extreme events across the FRB with immediate implications for water management. We find that past high flood-season flows have been of greater magnitude and occurred in more consecutive years than during the observational record alone. Early 20th century low flows in the drought season were especially severe in both duration and magnitude in some subbasins relative to recent dry periods. Our Fraser subbasin-scale reconstructions provide long-term benchmarks for the natural flood and drought variability prior to anthropogenic forcing. These reconstructions demonstrate that the instrumental streamflow records upon which current management is based likely underestimate the full natural magnitude, duration, and frequency of extreme seasonal flows in the FRB, as well as the potential severity of future anthropogenically forced events. 
    more » « less
  5. Entrepreneurial ecosystem researchers generally focus on the few dominant factors affecting entrepreneurship. Insufficient attention has been paid to the interdependencies among regional conditions within an entrepreneurial ecosystem. We focus on the collective effects of factors for regional biomedical entrepreneurship. We use the fuzzy-set qualitative comparative analysis (fsQCA) method to identify sets of regional conditions promoting biomedical entrepreneurship in all 381 U.S. metropolitan areas. The results indicate three configurations contributing to high levels of regional biomedical entrepreneurship: the first one combines public sector biomedical R&D, biomedical patents, and human capital, thus stressing science conditions and related human capital; the second combines public sector biomedical R&D, biomedical patents, clinical trials, and venture capital, thus placing more emphasis on the regional infrastructure sustaining entrepreneurial activity; the third combines private sector biomedical R&D, biomedical patents, human capital, per capita income, population density, and venture capital, thus emphasizing the private sector’s role on boosting regional biomedical entrepreneurship. There is no single recipe for a region to increase its level of biomedical entrepreneurship. 
    more » « less