Online monitoring is increasingly essential for the effective management and operation of urban sewer systems, yet resource limitations necessitate careful planning of sensor deployment. This study aims to address the impact of time lags on monitoring point selection in urban drainage systems using unsupervised machine learning techniques. A novel method is introduced to determine the optimal number and placement of sensors in manholes, using cluster analysis informed by simulated time-series data. The proposed methodology involves two sequential stages: the first stage clusters time-series data based on morphology similarity using the time-lagged cross-correlation (TLCC) coefficient, which measures the temporal alignment between datasets. The second stage further refines these clusters by considering magnitude similarity, employing dynamic time warping distance to quantify shape-based similarities and improve clustering accuracy. The proposed approach allows for flexible threshold adjustments to accommodate specific engineering requirements, enabling the design of monitoring strategies tailored to a predetermined number of locations. Furthermore, the study explores the impact of rainfall intensity on sensor placement, providing actionable guidance for sewer managers to improve monitoring efficiency and address urban water management challenges.

  • A novel bilayer iterative clustering approach is proposed to optimize urban drainage monitoring.

  • Time-lagged cross-correlation and dynamic time warping improve clustering precision.

  • The methodology is validated using both a hypothetical network and a real-world case study.

  • Monitoring strategies address resource limitations and adapt to varying rainfall intensities.

The Internet of Things (IoT) has become a vital tool for managing urban drainage systems (UDS), providing real-time monitoring that enables the early detection of potential risks and a comprehensive understanding of sewer conditions (Lee et al. 2016; Edmondson et al. 2018). IoT systems also facilitate the accumulation of long-term dynamic data, which support the evaluation and optimization of sewer network operations (Kang et al. 2012; Bai et al. 2021). To obtain reliable and accurate monitoring data, it is crucial to optimize the placement of monitoring points a key area of focus (Schilperoort et al. 2008; Banik et al. 2015). However, limited human and financial resources often constrain the deployment of monitoring systems (Meijer et al. 2018; Vonach et al. 2018). Thus, the challenge lies in achieving effective monitoring with the fewest possible monitoring points (Banik et al. 2017; Sambito & Freni 2021).

Optimal sensor placement has been extensively studied in related fields, including contamination detection in water distribution networks, surface water monitoring, and flood forecasting (Rathi & Gupta 2014a; Fattoruso et al. 2015; Rathi et al. 2016; Adedoja et al. 2018; Jiang et al. 2020). Meta-heuristic algorithms, such as ant colony optimization, genetic algorithms, and particle swarm optimization, are frequently employed to identify optimal sensor locations (Afshar et al. 2015; Gong et al. 2022; Harif et al. 2023). While these algorithms can navigate complex search spaces to identify near-optimal solutions, they often face challenges related to the computational burden, subjectivity, and reduced robustness in large-scale systems (Rathi & Gupta 2014b; Sharma & Kumar 2022).

Clustering techniques have emerged as an efficient alternative to reduce computational complexity. For example, Perelman and Ostfeld (2012) proposed clustering network structures to identify pollution sources and place monitoring points at cluster centers, followed by optimization within each cluster. This method improves efficiency and provides valuable insights into network topology (Guo et al. 2018; Cardoso et al. 2021). Similarly, information theory has been applied to evaluate the information value and redundancy of potential monitoring points, enhancing decision-making and reducing computational costs (Khorshidi et al. 2018; Brentan et al. 2021). These approaches demonstrate the value of predetermining potential sensor locations before applying optimization algorithms.

Despite advancements in related fields, optimizing monitoring locations in UDS remains underexplored. Existing studies often leverage data-driven machine learning techniques, such as cluster analysis, to analyze time-series data generated by monitoring systems (Li et al. 2020; Li 2021). For instance, Guo et al. (2018) used fuzzy clustering to select monitoring points based on simulated scenarios, while Wang et al. (2023) integrated hydraulic and water quality attributes with decision-making factors to refine clustering methods. However, in UDS, rainfall can experience delays as it flows through pipes and nodes (Talei & Chua 2012; Zhang et al. 2023). Traditional monitoring strategies often overlook these time lags, but accounting for them is vital to accurately tracking the system's dynamic changes and enabling effective responses. Moreover, existing studies on clustering time-series data lack controllability and cannot achieve fine-grained control over the clustering degree in line with the actual engineering needs.

Cluster analysis relies on similarity measures to group objects, making the choice of similarity metric critical (Basaran & Günes 2016). Traditional measures, such as Euclidean distance and Pearson's correlation coefficient, are commonly used but are unsuitable for time-series data with time lags or gaps (Popivanov & Miller 2002; Gontijo et al. 2020; Li et al. 2022). The dynamic time warping (DTW) algorithm addresses these limitations by aligning time-series data flexibly, accounting for temporal delays between measurement points (Dai et al. 2014; Lee et al. 2020). Additionally, time-lagged cross-correlation (TLCC) offers a robust method to quantify alignment and similarity between sequences (Tal et al. 2011; Tóth & Balogh 2012). Both TLCC and DTW distances allow for a more accurate identification of time differences between monitoring points, thus optimizing sensor placement.

This study aims to address the impact of time lags on monitoring point selection in UDS using unsupervised machine learning techniques. A novel bilayer iterative clustering method was developed to analyze the similarity relationships among monitoring points while optimizing their number and placement. The methodology was validated on a virtual sewer network and applied to a real-world case study in Ningbo, China, demonstrating its effectiveness in enhancing monitoring strategies for UDS.

Overview of the proposed framework

This study employs an iterative clustering approach based on the similarity of morphology and magnitude in time-series data to optimize monitoring schemes by adjusting the thresholds of monitoring indicators. The methodology is summarized in the flowchart, as shown in Figure 1, and outlined as follows:
  • (1) Data collection and model development: Data on the pipe network, topography, rainfall, and other relevant factors are collected to develop a drainage network model of the study area using simulation software (InfoWorks ICM).

  • (2) Model calibration and simulation: The model is calibrated and validated using limited existing monitoring data, and simulations are conducted to generate time-series data at all nodes under various rainfall conditions.

  • (3) Threshold customization: Setting appropriate thresholds is crucial for refining the clustering process. The outer threshold, based on the Pearson correlation coefficient, determines the similarity of time series in terms of morphology. The inner threshold, derived from the Euclidean distance formula, further refines the clusters by accounting for differences in magnitude. Both thresholds are adjustable, allowing them to be tailored to the specific characteristics of the data and the goals of the analysis.

  • (4) Morphology similarity clustering: Time series are clustered based on morphology similarity using the TLCC coefficient as the similarity measure. TLCC quantifies the alignment of temporal patterns between two time series while accounting for time delays caused by flow dynamics. The first-layer threshold is applied to group nodes into clusters that exhibit similar temporal patterns, ensuring an appropriate level of clustering resolution.

  • (5) Magnitude similarity clustering: Each morphology-based cluster is further refined by clustering for magnitude similarity, employing DTW distance as the similarity measure. DTW aligns time series by stretching or compressing segments to minimize differences, enabling the identification of nodes with similar magnitude profiles, even if timing variations exist. The second-layer threshold is applied during this stage to further refine the clustering process.

  • (6) Selection of monitoring points: Representative nodes from each refined cluster are selected as optimal monitoring locations based on their ability to best represent the cluster's overall characteristics. The selection process prioritizes nodes with the smallest average distance to all other points within the cluster, ensuring that the chosen locations are centrally located. Additionally, the quality of the clustering is assessed using the silhouette coefficient (SC), with nodes from clusters that exhibit higher silhouette scores being favored.

Figure 1

The flowchart of the proposed research methodology.

Figure 1

The flowchart of the proposed research methodology.

Close modal

One should be aware that the order of conducting morphology and magnitude similarity clustering hinges on the particular data set and monitoring goal, and different orders will result in different clustering outcomes. This study exemplifies the case of first conducting morphology similarity clustering and then conducting magnitude similarity clustering. In this case, the clustering results will emphasize the morphology characteristics of the samples, and the magnitude similarity will contribute to further refining the clustering outcomes.

Development of the clustering model

The clustering methodology serves as a powerful tool for analyzing multivariate data by grouping objects based on their similarity or distance. Objects within the same cluster exhibit high similarity, while significant differences exist between clusters. Hierarchical clustering, a classic partitioning method, constructs a hierarchical structure of class similarities, enabling the clear visual identification of divisions. Compared to other clustering techniques, hierarchical clustering offers distinct advantages, such as not requiring a predefined number of clusters, flexible definitions for distance and similarity, and fewer constraints.

In this study, a bilayer iterative clustering approach is proposed to identify monitoring locations with precision and flexibility. The methodology consists of two successive stages: the first stage clusters time-series data based on morphological similarity, while the second stage refines these clusters by considering magnitude similarity. In the first stage, the TLCC coefficient is used to evaluate performance similarity, grouping time series into initial clusters. The second stage further subdivides these clusters using the DTW distance as a measure of similarity. Agglomerative hierarchical clustering is employed throughout, relying on the maximum distance between elements within clusters to ensure cohesiveness.

To ensure stable and reliable clustering, the process incorporates iterative analysis under specific threshold constraints. The workflow begins by calculating the TLCC coefficient between all time series in the dataset, forming a similarity matrix. Using hierarchical clustering, the time series are grouped into clusters (Mi, where ) based on an outer threshold δ1. Within each cluster, the similarity of time series falls within the threshold δ1, while the similarity between clusters exceeds this threshold. These clusters are then further refined in the inner layer, where the DTW distance is used as the similarity measure. Applying an inner threshold δ2, each outer cluster is subdivided into smaller clusters (Ni, where ). The total number of clusters is determined as .

The thresholds δ1 and δ2 are adjustable to meet specific engineering requirements, allowing for tailored clustering refinement. Smaller thresholds result in more detailed clusters but increase the total number of clusters. These thresholds are selected based on the following principles:

  • (1) Outer threshold (δ1): This is determined by referencing the Pearson correlation coefficient's rank order (e.g., Table 1). Strong positive correlations are typically used to select δ1, ensuring finer clustering of morphology similarity.

  • (2) Inner threshold (δ2): This is derived from a maximum allowable amplitude difference (d) among monitoring indices with similar morphology but differing magnitudes. Using the Euclidean distance formula, δ2 is calculated as , where n is the time-series dimension. Smaller values of d result in finer clustering by emphasizing magnitude similarity.

Table 1

Reference for the threshold δ1 selection

DegreePositive correlationNegative correlation
Very weak or no correlation 0.8–1.0 1.0–1.2 
Weak correlation 0.6–0.8 1.2–1.4 
Moderate correlation 0.4–0.6 1.4–1.6 
Strong correlation 0.2–0.4 1.6–1.8 
Extremely strong correlation 0.0–0.2 1.8–2.0 
DegreePositive correlationNegative correlation
Very weak or no correlation 0.8–1.0 1.0–1.2 
Weak correlation 0.6–0.8 1.2–1.4 
Moderate correlation 0.4–0.6 1.4–1.6 
Strong correlation 0.2–0.4 1.6–1.8 
Extremely strong correlation 0.0–0.2 1.8–2.0 

Time-lagged cross-correlation analysis

Selecting an appropriate similarity measure is essential for accurately clustering time-series data. Metrics such as the correlation coefficient are commonly used to quantify the degree of similarity, with higher values indicating greater similarity. Pearson's correlation coefficient, a widely adopted measure of morphology similarity, is particularly effective for capturing the direction of change in time-series patterns. It accounts for minor local variations in morphology and eliminates the need for prior data normalization. However, Pearson's correlation coefficient is insufficient for monitoring indicators in UDS due to the presence of time lags inherent in water transport processes.

To address this limitation, the cross-correlation function is used in this study to calculate morphology similarity while accounting for time lags. The time lag between two time series is identified at the point where the correlation reaches its maximum value. This maximum correlation value, considered a lag-dependent Pearson correlation coefficient, reflects the degree of similarity between the two time series. Specifically, one series is shifted by a time lag p relative to the other.

The cross-correlation functions of the two time series, xt and yt, are defined as follows:
(1)
(2)
where xt and yt are the time series of two monitoring indicators; p is the time lag; L is the truncation point; n is the length of the time series; and are the means of the time series; σx and σy are the standard deviations of the time series; and Cxy(p) and rxy(p) is the cross-covariance and cross-correlation coefficient at lag p, respectively.

DTW for similarity measurement

Assessing the similarity between time-series data often involves measuring distances, where smaller distances indicate greater similarity. While time series from different nodes may share similar patterns, direct comparisons at the same time points are often ineffective due to flow travel time delays. Synchronizing the time series before comparison is essential. For instance, as shown in Figure 2(a), two time series may exhibit similar overall shapes but remain misaligned along the time axis. Using traditional Euclidean distance with a one-to-one mapping (Figure 2(b)) in such cases is inadequate, as it is highly sensitive to even minor temporal distortions.
Figure 2

Comparison of Euclidean distance and DTW. (a) Misaligned time series, (b) Euclidean distance mapping, (c) DTW alignment, and (d) DTW warping path.

Figure 2

Comparison of Euclidean distance and DTW. (a) Misaligned time series, (b) Euclidean distance mapping, (c) DTW alignment, and (d) DTW warping path.

Close modal

To address this limitation, DTW offers a more flexible approach by allowing elastic shifts along the time axis. DTW accommodates both global phase corrections and local distortions, such as stretched or compressed segments. This algorithm aligns two time series optimally and provides a distance measure to quantify their similarity. As shown in Figure 2(c), DTW achieves an alignment where the distance between corresponding points accurately reflects the true similarity of the time series.

The DTW algorithm begins with two sequences, X = {x,x2, … xi, … ,xn} of length n and Y = {y1,y2, … yj, … ,ym,} of length m. An n × m matrix is constructed, where each element (i, j) represents the local distance between points xi and yj. The Euclidean distance is typically used to calculate these local distances, defined as: d(xi, yj) = (xiyj)2. A warping path, W = {w1,w2, … ,wq}, defines the alignment between X and Y. The warping path must satisfy the following constraints: (1) boundary condition: the first and last points of X and Y must align. (2) Continuity condition: each point in one sequence must align with at least one point in the other. (3) Monotonicity condition: The alignment must preserve the temporal order of the sequences. The optimal warping path minimizes the total warping cost, resulting in the DTW distance:
(3)
Here, Q is the length of the warping path, and dividing by Q ensures consistency across paths of varying lengths. The optimal path is identified by constructing a cumulative cost matrix γ(i, j) and solving the following recursive equation using dynamic programming (Figure 2(d)):
(4)

While the classical DTW algorithm effectively handles local time shifts, it can occasionally produce unnatural alignments, known as pathological warping. To address this, a modified DTW algorithm is used in this study. This version introduces constraints on the warping path by limiting the total number of connections during optimization. These constraints reduce the likelihood of incorrect alignments while maintaining flexibility in similarity measurement. The maximum allowable warping path length is used as the time lag between the two time series.

For further details on this modified DTW algorithm, refer to Zhang's work (2017). This enhancement ensures that DTW remains a reliable tool for analyzing time-series data in urban drainage monitoring strategies.

Optimization of monitoring sites

Clustering analysis provides a systematic approach to determine monitoring locations by grouping nodes based on their time series similarity. Varying the similarity threshold δ produces different clustering outcomes. For each threshold δ, monitoring locations are identified as the nodes within each cluster that have the smallest average distance to all other nodes in the cluster.

The selection of an optimal number of clusters is critical to devising an effective monitoring strategy. The goal is to maximize clustering quality by ensuring that nodes within a cluster exhibit high similarity while minimizing similarity between clusters. To validate clustering results, various techniques, such as evaluation metrics and illustrative examples, can be employed. In this study, the SC is used to assess the effectiveness of clustering. This metric evaluates both the compactness within clusters and the separation between clusters without requiring prior knowledge of the true labels (Rousseeuw 1987). For each sample i, the SC is defined as:
(5)
where ai is the average distance of sample i to other samples within the same cluster, and bi is the average distance of sample iii to the nearest neighboring cluster. The si values range from −1 to 1, where values close to 1 indicate well-clustered samples, values near −1 suggest misclassification, and values close to 0 indicate boundary samples between clusters.
To evaluate overall clustering validity, the average SCk for a dataset with n samples and k clusters is calculated as follows:
(6)

The SCk value ranges from −1 to 1, with higher values indicating better clustering quality. This metric is used to determine the most suitable number of clusters.

Although increasing the number of clusters may reduce the average distance among time series within each cluster, excessive clustering often leads to diminishing returns due to marginal effects. To balance clustering quality and practicality, a maximum threshold for the number of clusters (kmax) is predefined. The SCk is then calculated for candidate cluster numbers ranging from 2 to kmax. The k value corresponding to the highest SCk is selected as the optimal number of clusters.

In real-world datasets, anomalous time series, such as those caused by sewer surcharges in specific regions, can complicate clustering. When the chosen k is inappropriate, these anomalies may be misclassified with regular time series, leading to a significant drop in SCk. Properly isolating anomalous time series into separate clusters or grouping them together enhances clustering quality and ensures meaningful results. By using SC as a quantitative metric, this approach effectively identifies and addresses such anomalies, improving the reliability of monitoring strategies.

Hypothetical network configuration and simulation

The sewer network shown in Figure 3 represents a simplified, hypothetical drainage system designed to illustrate the application of the proposed methodology. This network includes 42 manholes and serves a drainage area of 23.95 hectares, divided into 48 sub-catchments. A dynamic wave routing model was developed using InfoWorks ICM (Wallingford, UK), with parameters appropriately configured.
Figure 3

Simplified sewer network created for illustrative purposes.

Figure 3

Simplified sewer network created for illustrative purposes.

Close modal
Rainfall data for a 0.5-year recurrence interval were used for the simulation. The event recorded a cumulative precipitation of 31.9 mm over 120 min (Figure 4). Using this rainfall scenario, flow rate and water level time-series data for each node were generated through an 8-h simulation of the network. Outputs were recorded at 1-min intervals, providing comprehensive data for subsequent analysis.
Figure 4

Temporal pattern of design rainfall (P = 0.5 a).

Figure 4

Temporal pattern of design rainfall (P = 0.5 a).

Close modal

Clustering results and analysis

The proposed methodology was applied to a simplified virtual sewer network to evaluate its effectiveness. For the flow rate index, a threshold value of δ1 = 0.2 was set. The maximum morphology variance of the original flow rate time series was calculated as 0.77, exceeding the threshold δ1, necessitating clustering to ensure morphological similarity. Through iterative outer clustering, six clusters (M1, M2, … , M6) were formed (Figure 5(a)), each with a maximum morphology variance below δ1.
Figure 5

.Flow rate time-series clustering results. (a) Morphology similarity clustering results (Clustering Step 1). (b–g) Magnitude similarity clustering results (Clustering Step 2) based on results from Clustering Step 1: M1–M6. (h) Bilayer clustering results.

Figure 5

.Flow rate time-series clustering results. (a) Morphology similarity clustering results (Clustering Step 1). (b–g) Magnitude similarity clustering results (Clustering Step 2) based on results from Clustering Step 1: M1–M6. (h) Bilayer clustering results.

Close modal

These six clusters, each representing distinct morphologies, were then used as the basis for inner-layer clustering based on magnitude similarity. Setting the variation threshold δ2 = 0.4386, the corresponding maximum allowable difference in flow rate amplitude (df) = 0.02 m3/s was determined. Clusters, such as M1, M3, M4, and M6, exceeded this threshold, leading to further subdivision into two clusters each. This bilayer clustering approach ultimately produced 10 clusters (Figure 5(b)–5(g)), with maximum magnitude differences within each cluster below δ2.

For the water level index, thresholds of δ1 = 0.2 and δ2 = 6.5795 (maximum allowable difference in water level amplitude (dw) = 0.3 m) were applied, resulting in 10 clusters after bilayer iterative clustering (Figure 6(a)–6(d)). The clustering results for flow rate and water level time series (Figure 5(h) and Figure 6(e)) clearly distinguish clusters by their patterns and magnitudes, with no misclassification or omissions. However, it is worth noting that despite the hydrological relationship between flow rate and water level, the clustering results differ significantly. This can be attributed to factors such as hydraulic decoupling at certain nodes in the network, the presence of flow-regulation mechanisms, and temporal variability in the network dynamics (Gibbs et al. 1972; Cao et al. 2002; Cristiano et al. 2017; Liu et al. 2022). These factors can lead to weak correlation between the flow rate and water level at some drainage pipeline nodes, causing the clustering of these two variables to be distinct.
Figure 6

Water level time series clustering results. (a) Morphology similarity clustering results (Clustering Step 1); (b–d) Magnitude similarity clustering results (Clustering Step 2) based on results from Clustering Step 1: M1-M3. (e) Bilayer clustering results.

Figure 6

Water level time series clustering results. (a) Morphology similarity clustering results (Clustering Step 1); (b–d) Magnitude similarity clustering results (Clustering Step 2) based on results from Clustering Step 1: M1-M3. (e) Bilayer clustering results.

Close modal

This example demonstrates the effectiveness of the proposed method in distinguishing between the complex behaviors of flow rate and water level indices in sewer networks.

Identification of key monitoring points

The clustering outcomes enable the development of tailored monitoring schemes. For each cluster, the node with the lowest average distance to others is selected as the representative monitoring location. Figure 7 illustrates the geographic distribution of the selected monitoring locations. The different colors correspond to the clustering results, while the triangles mark the chosen monitoring points.
Figure 7

The location of monitoring sensors. (a) Flow rate and (b) water level.

Figure 7

The location of monitoring sensors. (a) Flow rate and (b) water level.

Close modal

For flow rate monitoring, using thresholds δ1 = 0.2 and δ2 = 0.4386 (df = 0.02 m3/s), 10 monitoring locations were identified, covering both main and branch pipes (Figure 7(a)). These locations effectively capture flow rate conditions across the study area at this threshold level while avoiding redundancy.

For water level monitoring, thresholds δ1 = 0.2 and δ2 = 6.5795 (dw = 0.3 m) also resulted in 10 monitoring locations (Figure 7(b)). However, only 60% of these locations overlapped with those for flow rate monitoring. Water level monitoring points were more evenly distributed and concentrated in the main pipe.

The geographic distribution of representative nodes revealed that clustering is influenced by upstream–downstream relationships and pipeline structures. This highlights the need to consider specific monitoring objectives, such as overflow prediction, flood risk assessment, or system optimization, when determining monitoring locations. Different monitoring indicators may require separate or combined analyses to ensure optimal placement.

Evaluation of controllable clustering performance

Adjusting the inner and outer thresholds allows for control over the granularity of clustering, enabling the selection of an optimal balance between precision and practical constraints. The number of clusters is influenced by monitoring requirements and economic limitations.

The final number of clusters and the evaluation indicator SCk under varying thresholds are shown in Figure 8. A smaller threshold results in more clusters, reflecting greater refinement. Flow rate time series exhibit wide morphological variations, while water level time series appear more consistent. When δ2 is constant and df ≥ 0.025 m3/s, the cluster count fluctuates with δ1 (Figure 8(a)). In contrast, for water level data, δ1 has minimal impact on cluster numbers, even with the same δ2 (Figure 8(b)). Both indices show fewer clusters as δ2 increases when δ1 remains unchanged, highlighting significant magnitude disparities among data points.
Figure 8

Final number of clusters and evaluation index SCk adjusted by outer and inner thresholds. (a) Flow rate and (b) water level.

Figure 8

Final number of clusters and evaluation index SCk adjusted by outer and inner thresholds. (a) Flow rate and (b) water level.

Close modal

The monitoring scheme's validity was assessed using SCk. Figure 10 illustrates that increasing cluster numbers does not always improve SCk; instead, it follows a decreasing trend if clustering becomes unreasonable. For flow rate data, SCk is highly sensitive to δ1, declining when df < 0.03 m3/s before rising. The maximum SCk of 0.793 occurs at δ1 = 0.2 and df = 0.04 m3/s, with eight clusters identified. For water level data, SCk is less influenced by δ1. The maximum SCk of 0.575 is achieved with seven clusters at δ1 = 0.1 and dw = 0.4 m. These results underscore the importance of adjusting thresholds to balance precision with practical requirements, ensuring optimal clustering and monitoring configurations.

This evaluation ensures that the selected clustering configuration maximizes SCk while meeting project-specific monitoring needs and device limitations. The number of clusters corresponds to the number of required sensors, enabling the identification of optimal monitoring locations through rigorous clustering analysis.

Study area characteristics and data sources

The study was conducted in the BL community, an older neighborhood in Zhenhai District, Ningbo, China (Figure 9(a)). Spanning 7.13 hectares, this area is characterized by high population density. Land use within the catchment is divided into residential areas (67.2%), green space (25.3%), and roadways and open spaces (7.5%) (Figure 9(b)). Runoff surfaces in the region are primarily composed of roofs, green surfaces, and road surfaces. The area experiences an average annual temperature of 16.4 °C and receives a mean annual precipitation of 1,480 mm, mostly occurring between May and September.
Figure 9

Study area information. (a) Geographical location and (b) land-use information.

Figure 9

Study area information. (a) Geographical location and (b) land-use information.

Close modal
Figure 10

Drainage network model.

Figure 10

Drainage network model.

Close modal

To establish a hydraulic model for the drainage system, data on rainfall, sewer network specifications, and hydrological and hydraulic parameters were integrated. Rainfall data were collected using an L3 tipping bucket rain gauge, recording at one-min intervals. During the monitoring period (July 30 to December 31, 2020), rainfall occurred for 35 days, totaling 408 mm. Rainfall return periods (P = 0.5, 5, and 50 years) were calculated using the Ningbo rainstorm intensity formula (Zhejiang Provincial Department of Housing and Urban-Rural Development 2020). Rainfall event characteristics and statistics are summarized in Table 2.

Table 2

Part of rainfall events

Rainfall eventsAug. 27 18:30–20:30Sept. 11 9:20–10:20Sept. 14 11:00–18:00P = 0.5
2 h
P = 5
2 h
P = 50
2 h
Duration of rainfall (h) 10 
Accumulated precipitation (mm) 15 21.6 31 31.9 74.8 117.7 
Peak rainfall intensity (mm/min) 0.8 0.6 0.4 1.1 2.6 4.1 
Rainfall eventsAug. 27 18:30–20:30Sept. 11 9:20–10:20Sept. 14 11:00–18:00P = 0.5
2 h
P = 5
2 h
P = 50
2 h
Duration of rainfall (h) 10 
Accumulated precipitation (mm) 15 21.6 31 31.9 74.8 117.7 
Peak rainfall intensity (mm/min) 0.8 0.6 0.4 1.1 2.6 4.1 

Flow rate and water level data were gathered during rainfall events via on-site monitoring. Network data, including manhole elevations, pipe dimensions, and invert elevations, were obtained in CAD format from the 2009 pipeline census by the Zhenhai Planning, Surveying, and Design Institute. Hydrological and hydraulic parameters such as land use, vegetation coverage, and surface slopes were also provided by the institute. Manning's roughness coefficient for the drainage pipes was determined based on regional pipe characteristics.

Development of the hydraulic model

The case study hydraulic model was constructed in InfoWorks ICM, incorporating pipeline and manhole data from CAD files. The pipeline network was simplified to represent main and branch pipelines, excluding manholes on road edges. The pipeline topology was carefully reviewed to ensure accuracy.

The model covered a 7.56-hectare area, including 316 sub-catchments, 282 manholes, 279 drainage pipes, one pumping station, and six outlets. The network spans approximately 5,587.8 m, with pipe diameters ranging from 200 to 400 mm (Figure 10). Monitoring points were strategically arranged, focusing on manholes with branch access. These points, marked as blue nodes, include 43 nodes and 44 pipes, with flow monitoring directions indicated by blue pipes. Surface runoff parameters were assigned based on actual land-use conditions, and a fixed percentage runoff model was employed. Confluence dynamics were analyzed using the SWMM model, leveraging detailed land-use data.

Initial model parameters were set based on site-specific conditions and prior studies. The fixed coefficients for runoff, confluence, initial loss, and Manning's roughness were subjected to iterative adjustments to achieve precise model calibration. To accomplish this, calibration and validation were performed using rainfall events from August 27 and September 11, 2020, which provided flow data for two representative nodes (Node 1 and Node 2). These events, with rainfall depths of 15.0 and 21.6 mm over 2 and 10 h, respectively, were used to fine-tune parameters such as runoff coefficients, confluence factors, initial loss, and Manning's roughness. Situated in close proximity to the midstream and downstream regions of the pipeline system, the two designated nodes served as representative locations, effectively representing the overall characteristics of the model.

The calibrated model demonstrated strong agreement between observed and simulated hydrographs at the selected nodes (Figure 11). Performance metrics, including the squared deterministic coefficient (R2) and Nash–Sutcliffe efficiency (NSE), confirmed the model's accuracy (Table 3) (Cheng et al. 2020). Both calibration and validation stages achieved NSE values above 0.77 and R2 values exceeding 0.78, indicating the model's reliability in evaluating monitoring point identification within the drainage network.
Table 3

Results of model calibration and verification

Node numberCalibration (Aug. 27)
Verification (Sept. 11)
R2NSER2NSE
Node 1 0.88 0.87 0.78 0.77 
Node 2 0.92 0.90 0.88 0.87 
Node numberCalibration (Aug. 27)
Verification (Sept. 11)
R2NSER2NSE
Node 1 0.88 0.87 0.78 0.77 
Node 2 0.92 0.90 0.88 0.87 
Figure 11

Calibration results from data collected on August 27, 2020: (a) Node 1 and (b) Node 2. Validation results from data collected on September 11, 2020: (c) Node 1 and (d) Node 2.

Figure 11

Calibration results from data collected on August 27, 2020: (a) Node 1 and (b) Node 2. Validation results from data collected on September 11, 2020: (c) Node 1 and (d) Node 2.

Close modal

Optimization of monitoring point allocation

On September 14th, a light drizzle accumulated 31 mm of rainfall over 7 h throughout the study area, with a peak intensity of 0.4 mm/min. A bilayer iterative clustering analysis was applied to simulated time series of the flow rate and the water level, yielding optimal results (Figure 12(a) and 12(b)).
Figure 12

Final cluster analysis results and evaluation metrics (based on data from September 14th). (a) Flow rate clusters with the SCk index. (b) Water level clusters with the SCk index. Optimal monitoring locations. (c) Flow rate (δ1 = 0.65, df = 0.005 m3/s). (d) Water level (δ1 = 0.2, dw = 0.1 m).

Figure 12

Final cluster analysis results and evaluation metrics (based on data from September 14th). (a) Flow rate clusters with the SCk index. (b) Water level clusters with the SCk index. Optimal monitoring locations. (c) Flow rate (δ1 = 0.65, df = 0.005 m3/s). (d) Water level (δ1 = 0.2, dw = 0.1 m).

Close modal

For the flow rate indicator, the optimal clustering quality was achieved at δ1 = 0.65 and df = 0.005 m3/s, yielding an impressive clustering coefficient (SCk) of 0.922 with seven monitoring points. For the water level indicator, the best results were obtained at δ1 = 0.2 and dw = 0.1 m, with SCk = 0.666 and 14 monitoring points. These outcomes highlight that different thresholds influence clustering refinement but do not imply correctness or error, allowing threshold selection to align with specific engineering requirements.

The optimal monitoring point arrangements are depicted in Figure 12(c) and 12(d). The pipes marked in different colors represent those belonging to the same flow rate cluster, and triangles mark monitoring locations. The flow rate monitoring scheme reveals that nodes within the same cluster are spatially dispersed yet maintain similar positions relative to upstream and downstream pipe network sections. Conversely, the water level monitoring scheme requires twice as many points, with locations differing significantly due to the strong influence of topography.

Monitoring points are selected to represent the overall pipe network performance while allowing for targeted investigations. Clusters demonstrate process line similarities with slight variations within thresholds, making it sufficient to monitor one point per cluster.

However, financial constraints often limit monitoring at all optimal points. Decision-makers must adjust thresholds to balance clustering outcomes and SCk values, tailoring monitoring schemes to practical limitations. The methodology proposed in this study provides a structured approach to optimizing monitoring point selection under various constraints, aiding decision-making in complex scenarios.

Comparative analysis of rainfall scenarios

Monitoring stormwater runoff during rainfall provides critical insights into the overall runoff volume and water quality in UDS. However, runoff characteristics, such as quantity and quality, can vary significantly across drainage nodes due to factors like rainfall intensity and the intervals between events. Therefore, rainfall variability must be carefully considered when selecting monitoring locations.

Using the ICM model, data for various rainfall return periods were simulated to optimize the placement of monitoring points under different rainfall scenarios, including extreme events. In addition to analyzing the rainfall event on September 14th, the Chicago rain pattern was applied to generate design rainfall events with return periods (P) of 0.5, 5, and 50 years. These events, characterized by a rain peak location parameter of 0.5 and a duration of 2 h, allowed for a systematic evaluation of monitoring strategies. Flow monitoring points were identified automatically for each scenario using consistent thresholds (δ1 = 0.25 and df = 0.01 m3/s), as shown in Figure 13.
Figure 13

Optimal monitoring scheme of different rainfall levels (δ1 = 0.25, df = 0.01 m3/s). (a) September 14th, (b) P = 0.5, (c) P = 5 and (d) P = 50.

Figure 13

Optimal monitoring scheme of different rainfall levels (δ1 = 0.25, df = 0.01 m3/s). (a) September 14th, (b) P = 0.5, (c) P = 5 and (d) P = 50.

Close modal

The results indicate that higher rainfall return periods require more monitoring points, with adjustments or additions to the configuration used for lower return periods. This relationship can be explained by the greater variability in runoff characteristics, such as higher flow volumes, more intense pollutant transport, and spatially heterogeneous runoff patterns, associated with heavier rainfall events (Saharia et al. 2021; Zhou et al. 2021). During such extreme events, additional monitoring points are necessary to capture the variability in both flow and water quality across the drainage network. Importantly, monitoring points optimized for smaller rainfall events can still provide valuable data during more intense rainfall, ensuring flexibility and reliability in the monitoring strategy.

Sensor allocation is often constrained by budget limitations, requiring careful optimization of monitoring strategies. By adjusting the threshold values, it is possible to identify flow monitoring points with the highest clustering quality, even when the total number of monitoring points is fixed at 10. As illustrated in Figure 14, the optimal locations for monitoring can vary significantly depending on the predefined number of monitoring points.
Figure 14

The location of flow rate monitoring points under different rainfall events (total of 10 sensors). (a) September 14th (δ1 = 0.15, df = 0.01 m3/s, SCk = 0.877). (b) P = 0.5 (δ1 = 0.15, df = 0.02 m3/s, SCk = 0.814). (c) P = 5 (δ1 = 0.45, df = 0.01 m3/s, SCk = 0.898). (d) P = 50 (δ1 = 0.45, df = 0.015 m3/s, SCk = 0.836).

Figure 14

The location of flow rate monitoring points under different rainfall events (total of 10 sensors). (a) September 14th (δ1 = 0.15, df = 0.01 m3/s, SCk = 0.877). (b) P = 0.5 (δ1 = 0.15, df = 0.02 m3/s, SCk = 0.814). (c) P = 5 (δ1 = 0.45, df = 0.01 m3/s, SCk = 0.898). (d) P = 50 (δ1 = 0.45, df = 0.015 m3/s, SCk = 0.836).

Close modal

Table 4 highlights the consistency of flow monitoring points across different rainfall events. A high degree of overlap (50%) is observed among the monitoring points identified for the September 14th rainfall event and those for events with return periods of P = 0.5 and P = 5. Similarly, strong consistency exists between events with return periods of P = 5 and P = 50. However, the agreement drops to 30% when considering all three events (P = 0.5, 5, and 50) collectively.

Table 4

The percentage of coincident flow rate monitoring points for the different rainfall events

Rainfall eventsSept. 14P = 0.5P = 5P = 50
Sept. 14 100 50 50 40 
P = 0.5 50 100 30 30 
P = 5 50 30 100 50 
P = 50 40 30 50 100 
Rainfall eventsSept. 14P = 0.5P = 5P = 50
Sept. 14 100 50 50 40 
P = 0.5 50 100 30 30 
P = 5 50 30 100 50 
P = 50 40 30 50 100 

This variability underscores the limitations of relying on a single rainfall event to define a monitoring scheme. To address this, composite similarity measures can be calculated using Equations (7) and (8):
(7)
(8)
Here, ri and δi represent the cross-correlation coefficient and DTW distance for a specific rainfall event, respectively, while αi is the weight assigned to each event. The weights (αi) should reflect the monitoring objectives; for instance, larger rainfall events may receive higher weights when studying nodal overflows. By incorporating these composite measures into the bilayer iterative clustering analysis, monitoring points can be optimally configured to balance the variability across different rainfall scenarios.

While additional experiments would offer valuable insights, the proposed method provides a strong foundation for optimizing the placement of monitoring points under various rainfall scenarios. We recommend that future research, when feasible, incorporates real-world rainfall data to validate and refine the monitoring network design, enhancing the practical applicability of the strategy.

This paper introduces a novel approach based on iterative clustering to determine optimal locations for monitoring sites in UDS. The method utilizes model simulations to analyze the temporal dynamics of all nodes under varying rainfall conditions. To address the influence of time delays on node-specific time-series data, the analysis incorporates TLCC coefficients and DTW distances within a bilayer clustering framework. By adjusting the threshold values, the method achieves a balance between clustering refinement and engineering requirements, enabling a practical and accurate determination of monitoring points.

The application of this approach to both an illustrative scenario and a case study demonstrates that the placement of monitoring points is influenced by the chosen indicators. Clusters of nodes tend to form in similar upstream or downstream segments of the pipe network, reflecting the spatial organization of the system. For water level observations, clusters are confined to distinct regions, requiring a greater number of monitoring points to achieve optimal clustering accuracy. Notably, monitoring points identified under normal rainfall conditions retain their effectiveness during more intense rainfall events, highlighting their adaptability.

However, practical constraints, such as equipment limitations and rainfall variability, necessitate a careful weighting of different rainfall events to achieve a judicious monitoring arrangement. These findings underscore the utility of the proposed approach in developing efficient systems for managing pipe networks and monitoring urban drainage, offering valuable insights for improved urban water infrastructure planning.

The authors would like to acknowledge the staff from the Zhenhai Planning, Surveying, and Design Institute for their assistance in the field sensor location survey and installation.

This study was supported by the National Key Research and Development Program of China (2022YFC3203200) and the Key Research and Development Program of Ningbo (2023Z216).

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Adedoja
O. S.
,
Hamam
Y.
,
Khalaf
B.
&
Sadiku
R.
(
2018
)
A state-of-the-art review of an optimal sensor placement for contaminant warning system in a water distribution network
,
Urban Water Journal
,
15
(
10
),
985
1000
.
https://doi.org/10.1080/1573062X.2019.1597378
.
Afshar
A.
,
Massoumi
F.
,
Afshar
A.
&
Mariño
M. A.
(
2015
)
State of the art review of ant colony optimization applications in water resource management
,
Water Resources Management
,
29
(
11
),
3891
3904
.
https://doi.org/10.1007/s11269-015-1016-9
.
Bai
G.
,
Hou
J.
,
Zhang
Y.
,
Han
H.
,
Yang
D.
,
Shi
B.
,
Ma
Y.
&
Ji
G.
(
2021
)
High-resolution simulation and monitoring of urban flood processes at the campus scale
,
Journal of Hydrologic Engineering
,
26
(
8
),
05021018
.
Banik
B. K.
,
Alfonso
L.
,
Torres
A. S.
,
Mynett
A.
,
Di Cristo
C.
&
Leopardi
A.
(
2015
)
Optimal placement of water quality monitoring stations in sewer systems: an information theory approach
,
Procedia Engineering
,
119
,
1308
1317
.
https://doi.org/10.1016/j.proeng.2015.08.956
.
Banik
B. K.
,
Alfonso
L.
,
Di Cristo
C.
,
Leopardi
A.
&
Mynett
A.
(
2017
)
Evaluation of different formulations to optimally locate sensors in sewer systems
,
Journal of Water Resources Planning and Management
,
143
(
7
),
1
12
.
https://doi.org/10.1061/(asce)wr.1943-5452.0000778
.
Basaran
B.
&
Günes
F.
(
2016
)
Data clustering
,
Intelligent Multidimensional Data Clustering and Analysis
,
31
(
3
),
28
72
.
https://doi.org/10.4018/978-1-5225-1776-4.ch002
.
Brentan
B.
,
Carpitella
S.
,
Barros
D.
,
Meirelles
G.
,
Certa
A.
&
Izquierdo
J.
(
2021
)
Water quality sensor placement: a multi-objective and multi-criteria approach
,
Water Resource Management
,
35
(
1
),
225
241
.
https://doi.org/10.1007/s11269-020-02720-3
.
Cao
Z.
,
Day
R.
&
Egashira
S.
(
2002
)
Coupled and decoupled numerical modeling of flow and morphological evolution in alluvial rivers
,
Journal of Hydraulic Engineering
,
128
,
306
321
.
https://doi.org/10.1061/(ASCE)0733-9429(2002)128:3(306)
.
Cardoso
S. M.
,
Barros
D. B.
,
Oliveira
E.
,
Brentan
B.
&
Ribeiro
L.
(
2021
)
Optimal sensor placement for contamination detection: a multi-objective and probabilistic approach
,
Environmental Modelling & Software
,
135
,
104896
.
https://doi.org/10.1016/j.envsoft.2020.104896
.
Cheng
M.
,
Qin
H.
,
Fu
G.
&
He
K.
(
2020
)
Performance evaluation of time-sharing utilization of multi-function sponge space to reduce waterlogging in a highly urbanizing area
,
Journal of Environmental Management
,
269
,
110760
.
https://doi.org/10.1016/j.jenvman.2020.110760
.
Cristiano
E.
,
ten Veldhuis
M.-C.
&
van de Giesen
N.
(
2017
)
Spatial and temporal variability of rainfall and their effects on hydrological response in urban areas – a review
,
Hydrology and Earth System Sciences
,
21
,
3859
3878
.
https://doi.org/10.5194/hess-21-3859-2017
.
Dai
C.
,
Wang
K.
&
Jin
R.
(
2014
)
Monitoring profile trajectories with dynamic time warping alignment
,
Quality and Reliability Engineering International
,
30
(
6
),
815
827
.
https://doi.org/10.1002/qre.1667
.
Edmondson
V.
,
Cerny
M.
,
Lim
M.
,
Gledson
B.
,
Lockley
S.
&
Woodward
J.
(
2018
)
A smart sewer asset information model to enable an ‘Internet of Things’ for operational wastewater management
,
Automation in Construction
,
91
(
February
),
193
205
.
https://doi.org/10.1016/j.autcon.2018.03.003
.
Fattoruso
G.
,
Agresta
A.
,
Guarnieri
G.
,
Lanza
B.
,
Buonanno
A.
,
Molinara
M.
,
Marrocco
C.
,
De Vito
S.
,
Tortorella
F.
&
Francia
G. D.
(
2015
)
Optimal sensors placement for flood forecasting modelling
,
Procedia Engineering
,
119
,
927
936
.
https://doi.org/10.1016/j.proeng.2015.08.971
.
Gibbs
C. V.
,
Alexander
S. M.
&
Leiser
C. P.
(
1972
)
System for regulation of combined sewage flows
,
Journal of the Sanitary Engineering Division
,
98
,
951
972
.
https://doi.org/10.1061/JSEDAI.0001509
.
Gong
J.
,
Yan
X.
&
Hu
C.
(
2022
)
An ensemble-surrogate assisted cooperative particle swarm optimisation algorithm for water contamination source identification
,
International Journal of Bio-Inspired Computation
,
19
(
3
),
169
177
.
https://doi.org/10.1504/IJBIC.2022.123129
.
Gontijo
T. S.
,
Costa
M. A.
&
De Santis
R. B.
(
2020
)
Similarity search in electricity prices: an ultra-fast method for finding analogs
,
Journal of Renewable and Sustainable Energy
,
12
(
5
),
58103
.
https://doi.org/10.1063/5.0021557
.
Guo
X.
,
Zhao
D.
,
Du
P.
&
Li
M.
(
2018
)
Automatic setting of urban drainage pipe monitoring points based on scenario simulation and fuzzy clustering
,
Urban Water Journal
,
15
(
7
),
700
712
.
https://doi.org/10.1080/1573062X.2018.1539504
.
Harif
S.
,
Azizyan
G.
,
Dehghani Darmian
M.
&
Givehchi
M.
(
2023
)
Selecting the best location of water quality sensors in water distribution networks by considering the importance of nodes and contaminations using NSGA-III (case study: Zahedan water distribution network, Iran)
,
Environmental Science and Pollution Research
,
30
(
18
),
53229
53252
.
https://doi.org/10.1007/s11356-023-26075-5
.
Jiang
J.
,
Tang
S.
,
Han
D.
,
Fu
G.
,
Solomatine
D.
&
Zheng
Y.
(
2020
)
A comprehensive review on the design and optimization of surface water quality monitoring networks
,
Environmental Modelling & Software
,
132
,
104792
.
https://doi.org/10.1016/j.envsoft.2020.104792
.
Kang
O.
,
Lee
S.
,
Kim
M.
,
Yoo
S.
&
Yoo
C.
(
2012
)
Sewage pipe network contamination mechanism and pipe network monitoring and abnormal diagnosis based on system analysis
,
Korean Chemical Engineering Research
,
50
(
6
),
980
987
.
Khorshidi
M. S.
,
Nikoo
M. R.
&
Sadegh
M.
(
2018
)
Optimal and objective placement of sensors in water distribution systems using information theory
,
Water Research
,
143
,
218
228
.
https://doi.org/10.1016/j.watres.2018.06.050
.
Lee
E. H.
,
Lee
Y. S.
,
Joo
J. G.
,
Jung
D.
&
Kim
J. H.
(
2016
)
Flood reduction in urban drainage systems: cooperative operation of centralized and decentralized reservoirs
,
Water (Switzerland)
,
8
(
10
),
469
.
https://doi.org/10.3390/w8100469
.
Lee
S.
,
Kim
J.
,
Hwang
J.
,
Lee
E. J.
,
Lee
K. J.
,
Oh
J.
,
Park
J.
&
Heo
T. Y.
(
2020
)
Clustering of time series water quality data using dynamic time warping: a case study from the Bukhan River water quality monitoring network
,
Water (Switzerland)
,
12
(
9
),
2411
.
https://doi.org/10.3390/w12092411
.
Li
J.
,
Hassan
D.
,
Brewer
S.
&
Sitzenfrei
R.
(
2020
)
Is clustering time-series water depth useful? An exploratory study for flooding detection in urban drainage systems
,
Water
,
12
(
9
),
2433
.
https://doi.org/10.3390/w12092433
.
Li
A.
,
Fan
C.
,
Xiao
F.
&
Chen
Z.
(
2022
)
Distance measures in building informatics: an in-depth assessment through typical tasks in building energy management
,
Energy and Buildings
,
258
,
111817
.
https://doi.org/10.1016/j.enbuild.2021.111817
.
Liu
J.
,
Cao
Z.
&
Li
X.
(
2022
)
Coupled modelling of flow and non-capacity sediment transport in sewer flushing channel
,
Water Research
,
219
,
118557
.
https://doi.org/10.1016/j.watres.2022.118557
.
Meijer
D.
,
van Bijnen
M.
,
Langeveld
J.
,
Korving
H.
,
Post
J.
&
Clemens
F.
(
2018
)
Identifying critical elements in sewer networks using graph-theory
,
Water (Switzerland)
,
10
(
2
),
136
.
https://doi.org/10.3390/w10020136
.
Perelman
L.
&
Ostfeld
A.
(
2012
)
Water-distribution systems simplifications through clustering
,
Journal of Water Resources Planning and Management
,
138
(
3
),
218
229
.
https://doi.org/10.1061/(ASCE)WR.1943-5452.0000173
.
Popivanov
I.
&
Miller
R. J.
(
2002
) ‘
Similarity search over time-series data using wavelets
’,
Proceedings – International Conference on Data Engineering
,
212
221
.
https://doi.org/10.1109/ICDE.2002.994711
.
Rathi
S.
&
Gupta
R.
(
2014a
)
A critical review of sensor location methods for contamination detection in water distribution networks
,
Water Quality Research Journal
,
50
(
2
),
95
108
.
https://doi.org/10.2166/wqrjc.2014.011
.
Rathi
S.
&
Gupta
R.
(
2014b
) ‘
Sensor placement methods for contamination detection in water distribution networks: a review
’,
Procedia Engineering, 16th water Distribution System Analysis Conference, WDSA2014
, Vol.
89
,
181
188
.
https://doi.org/10.1016/j.proeng.2014.11.175
.
Rathi
S.
,
Gupta
R.
,
Kamble
S.
&
Sargaonkar
A.
(
2016
)
Risk based analysis for contamination event selection and optimal sensor placement for intermittent water distribution network security
,
Water Resource Management
,
30
(
8
),
2671
2685
.
https://doi.org/10.1007/s11269-016-1309-7
.
Rousseeuw
P. J.
(
1987
)
Silhouettes: a graphical aid to the interpretation and validation of cluster analysis
,
Journal of Computational and Applied Mathematics
,
20
,
53
65
.
https://doi.org/10.1016/0377-0427(87)90125-7
.
Saharia
M.
,
Kirstetter
P.-E.
,
Vergara
H.
,
Gourley
J. J.
,
Emmanuel
I.
&
Andrieu
H.
(
2021
)
On the impact of rainfall spatial variability, geomorphology, and climatology on flash floods
,
Water Resources Research
,
57
,
e2020WR029124
.
https://doi.org/10.1029/2020WR029124
.
Sambito
M.
&
Freni
G.
(
2021
)
Strategies for improving optimal positioning of quality sensors in urban drainage systems for non-conservative contaminants
,
Water (Switzerland)
,
13
(
7
),
934
.
https://doi.org/10.3390/w13070934
.
Schilperoort
R. P. S.
,
Dirksen
J.
&
Clemens
F. H. L. R.
(
2008
) ‘
Practical aspects for long-term monitoring campaigns: pitfalls to avoid to maximize data yield
’,
Presented at the 11th International Conference on Urban Drainage
, Vol.
31
,
Edinburgh, UK
.
1–10
Sharma
S.
&
Kumar
V.
(
2022
)
A comprehensive review on multi-objective optimization techniques: past, present and future
,
Archives of Computational Methods in Engineering
,
29
(
7
),
5605
5633
.
https://doi.org/10.1007/s11831-022-09778-9
.
Tal
B.
,
Bencze
A.
,
Zoletnik
S.
,
Veres
G.
&
Por
G.
(
2011
)
Cross-correlation based time delay estimation for turbulent flow velocity measurements: statistical considerations
,
Physics of Plasmas
,
18
(
12
),
122304
.
https://doi.org/10.1063/1.3662432
.
Talei
A.
&
Chua
L. H. C.
(
2012
)
Influence of lag time on event-based rainfall–runoff modeling using the data driven approach
,
Journal of Hydrology
,
438–439
,
223
233
.
https://doi.org/10.1016/j.jhydrol.2012.03.027
.
Tóth
G.
&
Balogh
B.
(
2012
)
How to identify cross correlations: a statistical test with time lag and its application on air-pollutant time series
,
Journal of Chemometrics
,
26
(
3–4
),
125
133
.
https://doi.org/10.1002/cem.2414
.
Vonach
T.
,
Tscheikner-Gratl
F.
,
Rauch
W.
&
Kleidorfer
M.
(
2018
)
A heuristic method for measurement site selection in sewer systems
,
Water (Switzerland)
,
10
(
2
),
1
16
.
https://doi.org/10.3390/w10020122
.
Wang
S.
,
Zhang
X.
,
Wang
J.
,
Tao
T.
,
Xin
K.
,
Yan
H.
&
Li
S.
(
2023
)
Optimal sensor placement for the routine monitoring of urban drainage systems: a re-clustering method
,
Journal of Environmental Management
,
335
,
117579
.
https://doi.org/10.1016/j.jenvman.2023.117579
.
Zhang
Z.
,
Tavenard
R.
,
Bailly
A.
,
Tang
X.
,
Tang
P.
&
Corpetti
T.
(
2017
)
Dynamic time warping under limited warping path length
,
Information Sciences
,
393
,
91
107
.
https://doi.org/10.1016/j.ins.2017.02.018
.
Zhejiang Provincial Department of Housing and Urban-Rural Development
. (
2020
)
Standards for Rainfall Intensity Computation (DB33/T1191–2020)
.
Hangzhou, China
:
Zhejiang Provincial Department of Housing and Urban-Rural Development
.
Zhou
W.
,
Zhu
Z.
,
Xie
Y.
&
Cai
Y.
(
2021
)
Impacts of rainfall spatial and temporal variabilities on runoff quality and quantity at the watershed scale
,
Journal of Hydrology
,
603
,
127057
.
https://doi.org/10.1016/j.jhydrol.2021.127057
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC 4.0), which permits copying, adaptation and redistribution for non-commercial purposes, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc/4.0/).