skip to main content


Title: Estimating Hourly Population Distribution Patterns at High Spatiotemporal Resolution in Urban Areas Using Geo-Tagged Tweets and Dasymetric Mapping
This paper introduces a spatiotemporal analysis framework for estimating hourly changing population distribution patterns in urban areas using geo-tagged tweets (the messages containing users’ geospatial locations), land use data, and dasymetric maps. We collected geo-tagged social media (tweets) within the County of San Diego during one year (2015) by using Twitter’s Streaming Application Programming Interfaces (APIs). A semi-manual Twitter content verification procedure for data cleaning was applied first to separate tweets created by humans from non-human users (bots). The next step was to calculate the number of unique Twitter users every hour within census blocks. The final step was to estimate the actual population by transforming the numbers of unique Twitter users in each census block into estimated population densities with spatial and temporal factors using dasymetric maps. The temporal factor was estimated based on hourly changes of Twitter messages within San Diego County, CA. The spatial factor was estimated by using the dasymetric method with land use maps and 2010 census data. Comparing to census data, our methods can provide better estimated population in airports, shopping malls, sports stadiums, zoo and parks, and business areas during the day time.  more » « less
Award ID(s):
1634641
NSF-PAR ID:
10207787
Author(s) / Creator(s):
Date Published:
Journal Name:
11th International Conference on Geographic Information Science (GIScience 2021)
Page Range / eLocation ID:
10:1-10:16
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Seeking spatiotemporal patterns about how citizens interact with the urban space is critical for understanding how cities function. Such interactions were studied in various forms focusing on patterns of people’s presence, action, and transition in the urban environment, which are defined as human-urban interactions in this paper. Using human activity datasets that utilize mobile positioning technology for tracking the locations and movements of individuals, researchers developed stochastic models to uncover preferential return behaviors and recurrent transitional activity structures in human-urban interactions. Ad-hoc heuristics and spatial clustering methods were applied to derive meaningful activity places in those studies. However, the lack of semantic meaning in the recorded locations makes it difficult to examine the details about how people interact with different activity places. In this study, we utilized geographic context-aware Twitter data to investigate the spatiotemporal patterns of people’s interactions with their activity places in different urban settings. To test consistency of our findings, we used geo-located tweets to derive the activity places in Twitter users’ location histories over three major U.S. metropolitan areas: Greater Boston Area, Chicago, and San Diego, where the geographic context of each location was inferred from its closest land use parcel. The results showed striking spatial and temporal similarities in Twitter users’ interactions with their activity places among the three cities. By using entropy-based predictability measures, this study not only confirmed the preferential return behaviors as people tend to revisit a few highly frequented places but also revealed detailed characteristics of those activity places.

     
    more » « less
  2. Situation awareness plays an important role in disaster response and emergency management. Displaying real-time location-based social media messages along with videos, pictures, and hashtags during a disaster event could help first responders improve their situation awareness. A geo-targeted event observation (Geo) Viewer was developed for monitoring real-time social media messages in target areas with four major functions: (1) real-time display of geo-tagged tweets within the target area; (2) interactive mapping functions; (3) spatial, text, and temporal search functions using keywords, spatial boundaries, or dates; and (4) manual labeling and text-tagging of messages. Different from traditional web GIS maps, the user interface design of GeoViewer provides the interactive display of multimedia content and maps. The front-end user interface to visualize and query tweets is built with open source programming libraries using server-side MongoDB. GeoViewer is built for assisting emergency responses and disaster management tasks by tracking disaster event impacts, recovery activities, and residents’ needs in the target region. 
    more » « less
  3. Accurate estimation of land use/land cover (LULC) areas is critical, especially over the semi-arid environments of the southwestern United States where water shortage and loss of rangelands and croplands are affecting the food production systems. This study was conducted within the context of providing an improved understanding of New Mexico’s (NM’s) Food–Energy–Water Systems (FEWS) at the county level. The main goal of this analysis was to evaluate the most important LULC classes for NM’s FEWS by implementing standardized protocols of accuracy assessment and providing bias-corrected area estimates of these classes. The LULC data used in the study was based on National Land Cover Database (NLCD) legacy maps of 1992, 2001, 2006, 2011, and 2016. The analysis was conducted using the cloud-based geospatial processing and modeling tools available from System for Earth Observation Data Access, Processing, and Analysis for Land Monitoring (SEPAL) of the Food and Agricultural Organization. Accuracy assessment, uncertainty analysis, and bias-adjusted area estimates were evaluated by collecting a total of 11,428 reference samples using the Open Foris Collect Earth tool that provided access to high spatial and temporal resolution images available in Google Earth. The reference samples were allocated using a stratified random sampling approach. The results showed an overall accuracy that ranged from 71%–100% in all six study counties. The user’s and producer’s accuracy of most LULC classes were about or above 80%. The obtained bias-adjusted area estimates were higher than those based on pixel counting. The bias-adjusted area estimates simultaneously showed decreasing and increasing trends in grassland and shrubland, respectively in four counties that include Curry, Roosevelt, Lea, and Eddy during the 1992–2016 period. Doña Ana county experienced increasing and decreasing trends in grassland and shrubland areas, respectively. San Juan county experienced decreasing trends in both grassland and shrubland areas. Cultivated cropland areas showed decreasing trends in three counties in southeast NM that rely on groundwater resources including Curry, Roosevelt, and Lea. Similarly, cultivated cropland areas showed increasing trends in the other three counties that rely on surface water or conjunctive use of surface and groundwater resources including San Juan, Doña Ana, and Eddy. The use of SEPAL allowed for efficient assessment and production of more accurate bias-adjusted area estimates compared to using pixel counting. Providing such information can help in understanding the behavior of NM’s food production systems including rangelands and croplands, better monitoring and characterizing NM’s FEWS, and evaluating their behavior under changing environmental and climatic conditions. More effort is needed to evaluate the ability of the NLCD data and other similar products to provide more accurate LULC area estimates at local scales. 
    more » « less
  4. Kelp beds provide significant ecosystem services and socioeconomic benefits globally, and prominently in coastal zones of the California Current. Their distributions and abundance, however, vary greatly over space and time. Here, we describe long-term patterns of Giant Kelp (Macrocystis pyrifera) sea surface canopy area off the coast of San Diego County from 1983 through 2019 along with recent patterns of water column nitrate (NO3-) exposure inferred fromin situtemperature data in 2014 and 2015 at sites spanning 30 km of the coastline near San Diego California, USA. Site-specific patterns of kelp persistence and resilience were associated with ocean and climate dynamics, with total sea surface kelp canopy area varying approximately 33-fold over the almost 4 decades (min 0.34 km2in 1984; max 11.25 km2in 2008, median 4.79 km2). Site-normalized canopy areas showed that recent kelp persistence since 2014 was greater at Point Loma and La Jolla, the largest kelp beds off California, than at the much smaller kelp bed off Cardiff. NO3-exposure was estimated from an 11-month time series ofin situwater column temperature collected in 2014 and 2015 at 4 kelp beds, using a relationship between temperature and NO3-concentration previously established for the region. The vertical position of the 14.5°C isotherm, an indicator of the main thermocline and nutricline, varied across the entire water column at semidiurnal to seasonal frequencies. We use a novel means of quantifying estimated water column NO3-exposure integrated through time (mol-days m-2) adapted from degree days approaches commonly used to characterize thermal exposures. Water column integrated NO3-exposure binned by quarters of the time series showed strong seasonal differences with highest exposure in Mar - May 2015, lowest exposure in Sep - Dec 2014, with consistently highest exposure off Point Loma. The water column integrated NO3-signal was filtered to provide estimates of the contribution to total nitrate exposure from high frequency variability (ƒ >= 1 cycle 30 hr-1) associated predominantly with internal waves, and low frequency variability driven predominantly by seasonal upwelling. While seasonal upwelling accounted for > 90% of NO3-exposure across the full year, during warm periods when seasonal upwelling was reduced or absent and NO3-exposure was low overall, the proportion due to internal waves increased markedly to 84 to 100% of the site-specific total exposure. The high frequency variability associated with internal waves may supply critical nutrient availability during anomalously warm periods. Overall, these analyses support a hypothesis that differences in NO3-exposure among sites due to seasonal upwelling and higher frequency internal wave forcing contribute to spatial patterns in Giant Kelp persistence in southern California. The study period includes anomalously warm surface conditions and the marine heatwave associated with the “Pacific Warm Blob” superimposed on the seasonal thermal signal and corresponding to the onset of a multi-year decline in kelp canopy area and marked differences in kelp persistence among sites. Our analysis suggests that, particularly during periods of warm surface conditions, variation in NO3-exposure associated with processes occurring at higher frequencies, including internal waves can be a significant source of NO3-exposure to kelp beds in this region. The patterns described here also offer a view of the potential roles of seasonal and higher frequency nutrient dynamics for Giant Kelp persistence in southern California under continuing ocean surface warming and increasing frequency and intensity of marine heatwaves.

     
    more » « less
  5. Recent studies have documented increases in anti-Asian hate throughout the COVID-19 pandemic. Yet relatively little is known about how anti-Asian content on social media, as well as positive messages to combat the hate, have varied over time. In this study, we investigated temporal changes in the frequency of anti-Asian and counter-hate messages on Twitter during the first 16 months of the COVID-19 pandemic. Using the Twitter Data Collection Application Programming Interface, we queried all tweets from January 30, 2020 to April 30, 2021 that contained specific anti-Asian (e.g., #chinavirus, #kungflu) and counter-hate (e.g., #hateisavirus) keywords. From this initial data set, we extracted a random subset of 1,000 Twitter users who had used one or more anti-Asian or counter-hate keywords. For each of these users, we calculated the total number of anti-Asian and counter-hate keywords posted each month. Latent growth curve analysis revealed that the frequency of anti-Asian keywords fluctuated over time in a curvilinear pattern, increasing steadily in the early months and then decreasing in the later months of our data collection. In contrast, the frequency of counter-hate keywords remained low for several months and then increased in a linear manner. Significant between-user variability in both anti-Asian and counter-hate content was observed, highlighting individual differences in the generation of hate and counter-hate messages within our sample. Together, these findings begin to shed light on longitudinal patterns of hate and counter-hate on social media during the COVID-19 pandemic. 
    more » « less