skip to main content


Title: Data-Driven Flood Alert System (FAS) Using Extreme Gradient Boosting (XGBoost) to Forecast Flood Stages
Heavy rainfall leads to severe flooding problems with catastrophic socio-economic impacts worldwide. Hydrologic forecasting models have been applied to provide alerts of extreme flood events and reduce damage, yet they are still subject to many uncertainties due to the complexity of hydrologic processes and errors in forecasted timing and intensity of the floods. This study demonstrates the efficacy of using eXtreme Gradient Boosting (XGBoost) as a state-of-the-art machine learning (ML) model to forecast gauge stage levels at a 5-min interval with various look-out time windows. A flood alert system (FAS) built upon the XGBoost models is evaluated by two historical flooding events for a flood-prone watershed in Houston, Texas. The predicted stage values from the FAS are compared with observed values with demonstrating good performance by statistical metrics (RMSE and KGE). This study further compares the performance from two scenarios with different input data settings of the FAS: (1) using the data from the gauges within the study area only and (2) including the data from additional gauges outside of the study area. The results suggest that models that use the gauge information within the study area only (Scenario 1) are sufficient and advantageous in terms of their accuracy in predicting the arrival times of the floods. One of the benefits of the FAS outlined in this study is that the XGBoost-based FAS can run in a continuous mode to automatically detect floods without requiring an external starting trigger to switch on as usually required by the conventional event-based FAS systems. This paper illustrates a data-driven FAS framework as a prototype that stakeholders can utilize solely based on their gauging information for local flood warning and mitigation practices.  more » « less
Award ID(s):
1832065 1940163
NSF-PAR ID:
10347490
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Water
Volume:
14
Issue:
5
ISSN:
2073-4441
Page Range / eLocation ID:
747
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Flooding during extreme weather events damages critical infrastructure, property, and threatens lives. Hurricane María devastated Puerto Rico (PR) on 20 September 2017. Sixty-four deaths were directly attributable to the flooding. This paper describes the development of a hydrologic model using the Gridded Surface Subsurface Hydrologic Analysis (GSSHA), capable of simulating flood depth and extent for the Añasco coastal flood plain in Western PR. The purpose of the study was to develop a numerical model to simulate flooding from extreme weather events and to evaluate the impacts on critical infrastructure and communities; Hurricane María is used as a case study. GSSHA was calibrated for Irma, a Category 3 hurricane, which struck the northeastern corner of the island on 7 September 2017, two weeks before Hurricane María. The upper Añasco watershed was calibrated using United States Geological Survey (USGS) stream discharge data. The model was validated using a storm of similar magnitude on 11–13 December 2007. Owing to the damage sustained by PR’s WSR-88D weather radar during Hurricane María, rainfall was estimated in this study using the Weather Research Forecast (WRF) model. Flooding in the coastal floodplain during Hurricane María was simulated using three methods: (1) Use of observed discharge hydrograph from the upper watershed as an inflow boundary condition for the coastal floodplain area, along with the WRF rainfall in the coastal flood plain; (2) Use of WRF rainfall to simulate runoff in the upper watershed and coastal flood plain; and (3) Similar to approach (2), except the use of bias-corrected WRF rainfall. Flooding results were compared with forty-two values of flood depth obtained during face-to-face interviews with residents of the affected communities. Impacts on critical infrastructure (water, electric, and public schools) were evaluated, assuming any structure exposed to 20 cm or more of flooding would sustain damage. Calibration equations were also used to improve flood depth estimates. Our model included the influence of storm surge, which we found to have a minimal effect on flood depths within the study area. Water infrastructure was more severely impacted by flooding than electrical infrastructure. From these findings, we conclude that the model developed in this study can be used with sufficient accuracy to identify infrastructure affected by future flooding events. 
    more » « less
  2. Among the different types of natural disasters, floods are the most devastating, widespread, and frequent. Floods account for approximately 30% of the total loss caused by natural disasters. Accurate flood-risk mapping is critical in reducing such damages by correctly predicting the extent of a flood when coupled with rain and stage gage data, supporting emergency-response planning, developing land use plans and regulations with regard to the construction of structures and infrastructures, and providing damage assessment in both spatial and temporal measurements. The reliability and accuracy of such flood assessment maps is dependent on the quality of the digital elevation model (DEM) in flood conditions. This study investigates the quality of an Unmanned Aerial Vehicle (UAV)-based DEM for spatial flood assessment mapping and evaluating the extent of a flood event in Princeville, North Carolina during Hurricane Matthew. The challenges and problems of on-demand DEM production during a flooding event were discussed. An accuracy analysis was performed by comparing the water surface extracted from the UAV-derived DEM with the water surface/stage obtained using the nearby US Geologic Survey (USGS) stream gauge station and LiDAR data. 
    more » « less
  3. Abstract

    Tide gauge water levels are commonly used as a proxy for flood incidence on land. These proxies are useful for projecting how sea‐level rise (SLR) will increase the frequency of coastal flooding. However, tide gauges do not account for land‐based sources of coastal flooding and therefore flood thresholds and the proxies derived from them likely underestimate the current and future frequency of coastal flooding. Here we present a new sensor framework for measuring the incidence of coastal floods that captures both subterranean and land‐based contributions to flooding. The low‐cost, open‐source sensor framework consists of a storm drain water level sensor, roadway camera, and wireless gateway that transmit data in real‐time. During 5 months of deployment in the Town of Beaufort, North Carolina, 24 flood events were recorded. Twenty‐five percent of those events were driven by land‐based sources—rainfall, combined with moderate high tides and reduced capacity in storm drains. Consequently, we find that flood frequency is higher than that suggested by proxies that rely exclusively on tide gauge water levels for determining flood incidence. This finding likely extends to other locations where stormwater networks are at a reduced drainage capacity due to SLR. Our results highlight the benefits of instrumenting stormwater networks directly to capture multiple drivers of coastal flooding. More accurate estimates of the frequency and drivers of floods in low‐lying coastal communities can enable the development of more effective long‐term adaptation strategies.

     
    more » « less
  4. Abstract

    Changes in the severity and likelihood of flooding events are typically associated with changes in the intensity and frequency of streamflows, but temporal adjustments in a river's conveyance capacity can also contribute to shifts in flood hazard. To assess the relative importance of channel conveyance to flood hazard, we compare variations in channel conveyance to variations in the flow magnitude of moderate (1.2 years) floods at 50 river gauges in western Washington State between 1930 and 2020. In unregulated rivers, moderate floods have increased across the region, but in regulated rivers this trend is suppressed and in some cases reversed. Variations in channel conveyance are ubiquitous, but the magnitude and timing of adjustments are not regionally uniform. At 40% of gages, conveyance changes steadily and gradually. More often, however, conveyance variability is nonlinear, consisting of multidecadal oscillations (36% of gages), rapid changes due to unusually large sediment‐supply events (14% of gages), and increases or decreases to conveyance following flow regulation (10% of gages). The relative importance of conveyance variability for flood risk depends on the mode of adjustment; in certain locations with historic landslides, extreme floods, and flow regulation, the influence of conveyance changes on flood risk matches or exceeds that of streamflow at the same site. Flood hazard management would benefit from incorporating historic long‐term and short‐term conveyance changes in predictions of future flood hazard variability.

     
    more » « less
  5. Abstract

    The Mississippi River basin drains nearly one-half of the contiguous United States, and its rivers serve as economic corridors that facilitate trade and transportation. Flooding remains a perennial hazard on the major tributaries of the Mississippi River basin, and reducing the economic and humanitarian consequences of these events depends on improving their seasonal predictability. Here, we use climate reanalysis and river gauge data to document the evolution of floods on the Missouri and Ohio Rivers—the two largest tributaries of the Mississippi River—and how they are influenced by major modes of climate variability centered in the Pacific and Atlantic Oceans. We show that the largest floods on these tributaries are preceded by the advection and convergence of moisture from the Gulf of Mexico following distinct atmospheric mechanisms, where Missouri River floods are associated with heavy spring and summer precipitation events delivered by the Great Plains low-level jet, whereas Ohio River floods are associated with frontal precipitation events in winter when the North Atlantic subtropical high is anomalously strong. Further, we demonstrate that the El Niño–Southern Oscillation can serve as a precursor for floods on these rivers by mediating antecedent soil moisture, with Missouri River floods often preceded by a warm eastern tropical Pacific (El Niño) and Ohio River floods often preceded by a cool eastern tropical Pacific (La Niña) in the months leading up peak discharge. We also use recent floods in 2019 and 2021 to demonstrate how linking flood hazard to sea surface temperature anomalies holds potential to improve seasonal predictability of hydrologic extremes on these rivers.

     
    more » « less