<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Predicting Flash Floods in the Dallas-Fort Worth Metroplex Using Workflows and Cloud Computing</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>09/01/2021</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10301314</idno>
					<idno type="doi">10.1109/eScience51609.2021.00050</idno>
					<title level='j'>2021 IEEE 17th International Conference on eScience (eScience)</title>
<idno></idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Eric Lyons</author><author>Dong-Jun Seo</author><author>Sunghee Kim</author><author>Hamideh Habibi</author><author>George Papadimitriou</author><author>Ryan Tanaka</author><author>Ewa Deelman</author><author>Michael Zink</author><author>Anirban Mandal</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Accurate and timely prediction of flash flooding events can be a very useful tool for stormwater officials and first responders. Having lead time with which to issue evacuation directives, to close flood prone roadways, to deploy rescue gear and personnel, and to fortify areas against flooding is essential to minimize property damage and risk of casualties. In this poster, we are presenting a flash flooding prediction workflow based on the Hydrology Lab-Research Distributed Hydrologic Model (HL-RDHM). This workflow leverages cloud computing and the Pegasus Workflow Management System to provide continuous high resolution flood predictions for the Dallas-Fort Worth Metroplex area in North Texas, and can be easily expanded to other regions.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>The Research Distributed Hydrologic Model (RDHM) <ref type="bibr">[1]</ref>  <ref type="bibr">[2]</ref> was the byproduct of a collaborative effort between several National Oceanic and Atmospheric Administration (NOAA) <ref type="bibr">[3]</ref> and National Weather Service (NWS) <ref type="bibr">[4]</ref> hydrology research laboratories. This model was developed to improve streamflow predictions in streams and rivers and to improve flash flood forecasting by incorporating evolving estimates of relevant parameters, such as soil moisture, soil temperature, surface permeability, and vegetation, in addition to the primary forcing mechanism, rainfall. Since its inception in 2008, local domain knowledge has improved. In the Dallas-Fort Worth (DFW) metroplex in North Texas, high resolution Lidar based surveys have enabled the creation of accurate topology mappings of riverbeds and man-made drainage systems. Additionally, urban surface changes have been well documented thanks to such mappings. Improvements of gridded rainfall estimates <ref type="bibr">[5]</ref> have the added to the model accuracy. The RDHM model has traditionally been run on a single, powerful dedicated server. In an effort to improve scalability as resolutions and domain sizes increase, and to tailor the output for stakeholder decision support, we have optimized the source code, containerized the model for use in public and educational clouds, and adopted the Pegasus Workflow Management System <ref type="bibr">[6]</ref> to manage the workflow executions. The RDHM workflow (depicted in Figure <ref type="figure">1</ref>) was developed using the Pegasus 5.0 Python API. The workflow is executed in an ExoGENI <ref type="bibr">[7]</ref> cloud deployment orchestrated by the Mobius multi-cloud provisioning system <ref type="bibr">[8]</ref> and Pegasus on a continual basis using the latest precipitation data provided by two sources: the Next Generation Weather Radar (NEXRAD) system and a network of Doppler radars, operated by the Collaborative Adaptive Sensing of the Atmosphere (CASA) <ref type="bibr">[9]</ref> partnership. This data is combined and fed into the RDHM flood model, along with a multitude of input parameters derived from high-resolution topography, land cover, land use, and soil data of the DFW region, and other hydrologic and climatological data. To ensure that the RDHM flood model has access to best estimates of the current hydrologic conditions, its output from the previous run of the workflow is also passed in as an input, forming a feedback loop over all consecutive runs of this workflow. Once complete, the RDHM model outputs streamflow, runoff, water depth, and return period estimates valuable for flash flood forecasting, along with the updated parameterizations to be used for the next instantiation of the model. This output data is formatted as XMRG <ref type="bibr">[10]</ref>, a legacy binary format in the Hydrological Rainfall Analysis </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. VISUALIZING RDHM DATA</head><p>As GeoJSON files are generated from each workflow run on a minute by minute basis, they can be made available for viewing on the web. Streamflow, return period, and runoff rates are some of the features which can be visualized and interact with on the map.</p><p>Streamflow. Utilizing the rainfall data provided by the radar systems and layers of hydrologic parameters, the RDHM model can simulate stream flows. This is illustrated in Figure <ref type="figure">2</ref> where the colored lines represent stream flows through river basins. Typically, as seen on the lower right portion, the streams flow within narrow, well defined channels. But in the case of floods, as in the center of the image, the streams exceed its banks and can flow over much larger areas.</p><p>Flooding Return Period. Return period is the time interval that you would expect a flood of some size to occur on Fig. <ref type="figure">4</ref>: RDHM generated runoff rate average at a given location. Return periods are very often used in engineering and zoning codes, dictating the type of events for which infrastructure must be designed to withstand. Indications that an event may exceed a 50 year return period for example, would be an indication that property damage may be likely to occur. The colored regions in Figure <ref type="figure">3</ref> show the estimated return period of flooding in the area, and based on these areas automated alerts can be sent to stormwater managers.</p><p>Runoff rate and Water Depth. Depending on how wet or dry the soil conditions are, a certain amount of rainfall infiltrates into the ground. Runoff rate is the rainfall rate subtracted by the infiltration rate. When the soil is saturated, runoff rate is as high as rainfall rate itself which is more likely to produce flooding than in dryer soil conditions. This is depicted in Figure <ref type="figure">4</ref>. Runoff is a primary indicator for high water, particularly in low lying areas, where the water will accumulate with a lag from the rainfall itself. In conjunction with topography, the model accumulates runoff and calculates water depth explicitly. This is very important for the protection of motorists who may become stuck should they attempt to cross these areas.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>IV. CONCLUSION -COMMUNITY IMPACT</head><p>The results of the RDHM pipeline in the cloud using Pegasus are displayed live to over 1000 emergency managers and first responders in the </p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="259" xml:id="foot_0"><p>2021 IEEE 17th International Conference on eScience (eScience) 978-1-6654-0361-0/21/$31.00 &#169;2021 IEEE DOI 10.1109/eScience51609.2021.00050 2021 IEEE 17th International Conference on eScience (eScience) | 978-1-6654-0361-0/21/$31.00 &#169;2021 IEEE | DOI: 10.1109/eScience51609.2021.00050</p></note>
		</body>
		</text>
</TEI>
