Localizing pipe leaks is a significant challenge for water utilities worldwide. Pipe leaks in water distribution systems (WDSs) can cause the loss of a large amount of treated water, leading to pressure loss, increased energy costs, and contamination risks. What makes localizing pipe leaks challenging is the underground location of the water pipes and the similarity in impact on hydraulic properties (e.g., pressure, flow) due to leaks as compared to the effects of WDS operational changes. Physical methods to locate leaks are expensive, intrusive, and heavily localized. Computational approaches such as data-driven machine learning models provide an economical alternative to physical methods. Machine learning models are readily available and easily customizable to most problems; therefore, there is an increasing trend in their application for leak localization in WDSs. While several studies have applied machine learning models to localize leaks in single pipes and small test networks, these studies have yet to thoroughly test these models against the different complexities of leak localization problems, and hence their applicability to real-world WDSs is still unclear. The simplicity of the WDSs, the oversimplification of leak characteristics, and the lack of consideration of modeling and measuring device uncertainties adopted in most of these studies make the scalability of their proposed approaches questionable to real-world WDSs. Our study addresses this issue by devising four study cases of different complexity that account for realistic leak characteristics and model- and measuring device-related uncertainties. Two established machine learning models—multilayer perceptron (MLP) and convolutional neural network (CNN)—are trained and tested for their ability to localize the leaks and predict their sizes for each of the four study cases using different simulated hydraulic inputs. In addition, the potential benefit of combining different types of hydraulic data as inputs to the machine learning models in localizing leaks is also explored. Pressure and flow, two common hydraulic measurements, are used as inputs to the machine learning models. Further, the impact of single and multiple time point input in leak localization is also investigated. The results for the L-Town network indicate good accuracies for both the models for all study cases, with CNN consistently outperforming MLP.
more »
« less
This content will become publicly available on July 1, 2026
Pressure Sensor Placement in Water Distribution Systems Using Leak Resolution–Specific Surrogates for Leak Localization
The effectiveness of model-based leak localization methods in water distribution systems (WDSs), including optimization-based and machine learning approaches, significantly depends on the quality and quantity of input data. Pressure data, easily accessible due to nonintrusive sensor installation and maintenance, are commonly used. However, economic constraints limit the number of sensors in WDSs, highlighting the need for strategic sensor placement to enhance data quality. This study introduces a novel, method-independent sensor placement strategy that integrates cluster definitions (leak resolution) with intuitive surrogates for localization performance, addressing the limitations of existing methods reliant on complex, nonintuitive metrics. We propose the Euclidean cluster-based optimal placement of sensors (ECOPS) approach, which employs sensitivity and uniqueness as fundamental signal properties to guide sensor placement. Validation tests within a comprehensive real-world WDS demonstrate that ECOPS outperforms existing surrogate-based approaches and improves the performance of current sensors installed for leak characterization. These findings provide compelling evidence of ECOPS’s potential for enhancing pressure sensor placement, thereby improving leak localization in WDS applications.
more »
« less
- Award ID(s):
- 1763028
- PAR ID:
- 10599576
- Publisher / Repository:
- ASCE
- Date Published:
- Journal Name:
- Journal of Water Resources Planning and Management
- Volume:
- 151
- Issue:
- 7
- ISSN:
- 0733-9496
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Water distribution systems (WDSs) face a significant challenge in the form of pipe leaks. Pipe leaks can cause loss of a large amount of treated water, leading to pressure loss, increased energy costs, and contamination risks. Locating pipe leaks has been a constant challenge for water utilities and stakeholders due to the underground location of the pipes. Physical methods to detect leaks are expensive, intrusive, and heavily localized. Computational approaches provide an economical alternative to physical methods. Data-driven machine learning-based computational approaches have garnered growing interest in recent years to address the challenge of detecting pipe leaks in WDSs. While several studies have applied machine learning models for leak detection on single pipes and small test networks, their applicability to the real-world WDSs is unclear. Most of these studies simplify the leak characteristics and ignore modeling and measuring device uncertainties, which makes the scalability of their approaches questionable to real-world WDSs. Our study addresses this issue by devising four study cases that account for the realistic leak characteristics (multiple, multi-size, and randomly located leaks) and incorporating noise in the input data to account for the model- and measuring device- related uncertainties. A machine learning-based approach that uses simulated pressure as input to predict both location and size of leaks is proposed. Two different machine learning models: Multilayer Perceptron (MLP) and Convolutional Neural Network (CNN), are trained and tested for the four study cases, and their performances are compared. The precision and recall results for the L-Town network indicate good accuracies for both the models for all study cases, with CNN generally outperforming MLP.more » « less
-
Leakages in water distribution networks (WDNs) are estimated to globally cost 39 billion USD per year and cause water and revenue losses, infrastructure degradation, and other cascading effects. Their impacts can be prevented and mitigated with prompt identification and accurate leak localization. In this work, we propose the leakage identification and localization algorithm (LILA), a pressure-based algorithm for data-driven leakage identification and model-based localization in WDNs. First, LILA identifies potential leakages via semi-supervised linear regression of pairwise sensor pressure data and provides the location of their nearest sensors. Second, LILA locates leaky pipes relying on an initial set of candidate pipes and a simulation-based optimization framework with iterative linear and mixed-integer linear programming. LILA is tested on data from the L-Town network devised for the Battle of Leakage Detection and Isolation Methods. Results show that LILA can identify all leakages included in the data set and locate them within a maximum distance of 374 m from their real location. Abrupt leakages are identified immediately or within 2 h, while more time is required to raise alarms on incipient leakages.more » « less
-
Pulmonary air leak is the most common complication of lung surgery, with air leaks that persist longer than 5 days representing a major source of post-surgery morbidity. Clinical management of air leaks is challenging due to limited methods to precisely locate and assess leaks. Here, we present a sound-guided methodology that enables rapid quantitative assessment and precise localization of air leaks by analyzing the distinct sounds generated as the air escapes through defective lung tissue. Air leaks often present after lung surgery due to loss of tissue integrity at or near a staple line. Accordingly, we investigated air leak sounds from a focal pleural defect in a rat model and from a staple line failure in a clinically relevant swine model to demonstrate the high sensitivity and translational potential of this approach. In rat and swine models of free-flowing air leak under positive pressure ventilation with intrapleural microphone 1 cm from the lung surface, we identified that: (a) pulmonary air leaks generate sounds that contain distinct harmonic series, (b) acoustic characteristics of air leak sounds can be used to classify leak severity, and (c) precise location of the air leak can be determined with high resolution (within 1 cm) by mapping the sound loudness level across the lung surface. Our findings suggest that sound-guided assessment and localization of pulmonary air leaks could serve as a diagnostic tool to inform air leak detection and treatment strategies during video-assisted thoracoscopic surgery (VATS) or thoracotomy procedures.more » « less
-
CIM: A Novel Clustering-based Energy-Efficient Data Imputation Method for Human Activity RecognitionHuman activity recognition (HAR) is an important component in a number of health applications, including rehabilitation, Parkinson’s disease, daily activity monitoring, and fitness monitoring. State-of-the-art HAR approaches use multiple sensors on the body to accurately identify activities at runtime. These approaches typically assume that data from all sensors are available for runtime activity recognition. However, data from one or more sensors may be unavailable due to malfunction, energy constraints, or communication challenges between the sensors. Missing data can lead to significant degradation in the accuracy, thus affecting quality of service to users. A common approach for handling missing data is to train classifiers or sensor data recovery algorithms for each combination of missing sensors. However, this results in significant memory and energy overhead on resource-constrained wearable devices. In strong contrast to prior approaches, this paper presents a clustering-based approach (CIM) to impute missing data at runtime. We first define a set of possible clusters and representative data patterns for each sensor in HAR. Then, we create and store a mapping between clusters across sensors. At runtime, when data from a sensor are missing, we utilize the stored mapping table to obtain most likely cluster for the missing sensor. The representative window for the identified cluster is then used as imputation to perform activity classification. We also provide a method to obtain imputation-aware activity prediction sets to handle uncertainty in data when using imputation. Experiments on three HAR datasets show that CIM achieves accuracy within 10% of a baseline without missing data for one missing sensor when providing single activity labels. The accuracy gap drops to less than 1% with imputation-aware classification. Measurements on a low-power processor show that CIM achieves close to 100% energy savings compared to state-of-the-art generative approaches.more » « less
An official website of the United States government
