skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Assessing Trustworthiness of Crowdsourced Flood Incident Reports Using Waze Data: A Norfolk, Virginia Case Study
Climate change and sea-level rise are increasingly leading to higher and prolonged high tides, which, in combination with the growing intensity of rainfall and storm surges, and insufficient drainage infrastructure, result in frequent recurrent flooding in coastal cities. There is a pressing need to understand the occurrence of roadway flooding incidents in order to enact appropriate mitigation measures. Agency data for roadway flooding events are scarce and resource-intensive to collect. Crowdsourced data can provide a low-cost alternative for mapping roadway flood incidents in real time; however, the reliability is questionable. This research demonstrates a framework for asserting trustworthiness on crowdsourced flood incident data in a case study of Norfolk, Virginia. Publicly available (but spatially limited) flood incident data from the city in combination with different environmental and topographical factors are used to create a logistic regression model to predict the probability of roadway flooding at any location on the roadway network. The prediction accuracy of the model was found to be 90.5%. When applying this model to crowdsourced Waze flood incident data, 71.7% of the reports were predicted to be trustworthy. This study demonstrates the potential for using Waze incident report data for roadway flooding detection, providing a framework for cities to identify trustworthy reports in real time to enable rapid situation assessment and mitigation to reduce incident impact.  more » « less
Award ID(s):
1735587
PAR ID:
10291591
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Transportation Research Record: Journal of the Transportation Research Board
ISSN:
0361-1981
Page Range / eLocation ID:
036119812110312
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    The number of emergencies have increased over the years with the growth in urbanization. This pattern has overwhelmed the emergency services with limited resources and demands the optimization of response processes. It is partly due to traditional ‘reactive’ approach of emergency services to collect data about incidents, where a source initiates a call to the emergency number (e.g., 911 in U.S.), delaying and limiting the potentially optimal response. Crowdsourcing platforms such as Waze provides an opportunity to develop a rapid, ‘proactive’ approach to collect data about incidents through crowd-generated observational reports. However, the reliability of reporting sources and spatio-temporal uncertainty of the reported incidents challenge the design of such a proactive approach. Thus, this paper presents a novel method for emergency incident detection using noisy crowdsourced Waze data. We propose a principled computational framework based on Bayesian theory to model the uncertainty in the reliability of crowd-generated reports and their integration across space and time to detect incidents. Extensive experiments using data collected from Waze and the official reported incidents in Nashville, Tenessee in the U.S. show our method can outperform strong baselines for both Fl-score and AUC. The application of this work provides an extensible framework to incorporate different noisy data sources for proactive incident detection to improve and optimize emergency response operations in our communities. 
    more » « less
  2. The use of crowdsourced data has been finding practical use for enhancing situational awareness during disasters. While recent studies have shown promising results regarding the potential of crowdsourced data (such as user-generated flood reports) for flash flood mapping and situational awareness, little attention has been paid to data imbalance issues that could introduce biases in data and assessment. To address this gap, in this study, we examine biases present in crowdsourced reports to identify data imbalance with a goal of improving disaster situational awareness. Three biases are examined: sample bias, spatial bias, and demographic bias. To examine these biases, we analyzed reported flooding from 3-1-1 reports (which is a citizen hotline allowing the community to report problems such as flooding) and Waze reports (which is a GPS navigation app that allows drivers to report flooded roads) with respect to FEMA damage data collected in the aftermaths of Tropical Storm Imelda in Harris County, Texas, in 2019 and Hurricane Ida in New York City in 2021. First, sample bias is assessed by expanding the flood-related categories in 3-1-1 reports. Integrating other flooding related topics into the Global Moran's I and Local Indicator of Spatial Association (LISA) revealed more communities that were impacted by floods. To examine spatial bias, we perform the LISA and BI-LISA tests on the data sets—FEMA damage, 3-1-1 reports, and Waze reports—at the census tract level and census block group level. By looking at two geographical aggregations, we found that the larger spatial aggregations, census tracts, show less data imbalance in the results. Through a regression analysis, we found that 3-1-1 reports and Waze reports have data imbalance limitations in areas where minority populations and single parent households reside. The findings of this study advance understanding of data imbalance and biases in crowdsourced datasets that are growingly used for disaster situational awareness. Through addressing data imbalance issues, researchers and practitioners can proactively mitigate biases in crowdsourced data and prevent biased and inequitable decisions and actions. 
    more » « less
  3. ABSTRACT Urban flooding is an increasing threat to cities and resident well‐being. The Federal Emergency Management Agency (FEMA) typically reports losses attributed to flooding which result from a stream overtopping its banks, discounting impacts of higher frequency, lower impact flooding that occurs when precipitation intensity exceeds the capacity of a drainage system. Despite its importance, the drivers of street flooding can often be difficult to identify, given street flooding data scarcity and the multitude of storm, built environment, and social factors involved. To address this knowledge gap, this study uses 922 street flooding reports to the city in Denver, Colorado, USA from 2000 to 2019 in coordination with rain gauge network data and Census tract information to improve understanding of spatiotemporal drivers of urban flooding. An initial threshold analysis using rainfall intensity to predict street flooding had performance close to random chance, which led us to investigate other drivers. A logistic regression describing the probability of a storm leading to a flood report showed the strongest predictors of urban flooding were, in descending order, maximum 5‐min rainfall intensity, population density, storm depth, storm duration, median tract income, and stormwater pipe density. The logistic regression also showed that rainfall intensity and population density are nearly as important in determining the likelihood of a flood report incidence. In addition, topographic wetness index values at locations of flooding reports were higher than randomly selected points. A linear regression predicting the number of reports per area identified percent impervious as the single most important predictor. Our methodologies can be used to better inform urban flood awareness, response, and mitigation and are applicable to any city with flood reports and spatial precipitation data. 
    more » « less
  4. Decision making in utilities, municipal, and energy companies depends on accurate and trustworthy weather information and predictions. Recently, crowdsourced personal weather stations (PWS) are being increasingly used to provide a higher spatial and temporal resolution of weather measurements. However, tools and methods to ensure the trustworthiness of the crowdsourced data in real-time are lacking. In this paper, we present a Reputation System for Crowdsourced Rainfall Networks (RSCRN) to assign trust scores to personal weather stations in a region. Using real PWS data from the Weather Underground service in the high flood risk region of Norfolk, Virginia, we evaluate the performance of the proposed RSCRN. The proposed method is able to converge to a confident trust score for a PWS within 10--20 observations after installation. Collectively, the results indicate that the trust score derived from the RSCRN can reflect the collective measure of trustworthiness to the PWS, ensuring both useful and trustworthy data for modeling and decision-making in the future. 
    more » « less
  5. Coastal highways along narrow barrier islands are vulnerable to flooding due to ocean and bay-side events, which create hazardous travel conditions and may restrict access to surrounding communities. This study investigates the vulnerability of a segment of highway passing through the Pea Island National Wildlife Refuge in the Outer Banks, North Carolina, USA. Publicly available data, computational modeling, and field observations of shoreline change are synthesized to develop fragility models for roadway flooding and marsh conditions. At 99% significance, peak daily water levels and significant wave heights at nearby monitoring stations are determined as significant predictors of roadway closure due to flooding. Computational investigations of bay-side storms identify peak water levels and the buffer distance between the estuarine shoreline and the roadway as significant predictors of roadway transect flooding. To assess the vulnerability of the marsh in the buffer area, a classification scheme is proposed and used to evaluate marsh conditions due to long-term and episodic (storm) stressors. Marsh vulnerability is found to be predicted by the long-term erosion rate and distance from the shoreline to the 5 m depth contour of the nearby flood tidal channel. The results indicate the importance of erosion mitigation and marsh conservation to enhance the resilience of coastal transportation infrastructure. 
    more » « less