Regular and continuous monitoring of urban runoff in both quality and quantity aspects is of great importance for controlling and managing surface runoff. Due to the considerable costs of establishing new gauges, optimization of the monitoring network is essential. This research proposes an approach for site selection of new discharge stations in urban areas, based on entropy theory in conjunction with multi-objective optimization tools and numerical models. The modeling framework provides an optimal trade-off between the maximum possible information content and the minimum shared information among stations. This approach was applied to the main surface-water collection system in Tehran to determine new optimal monitoring points under the cost considerations. Experimental results on this drainage network show that the obtained cost-effective designs noticeably outperform the consulting engineers’ proposal in terms of both information contents and shared information. The research also determined the highly frequent sites at the Pareto front which might be important for decision makers to give a priority for gauge installation on those locations of the network.
INTRODUCTION
Entropy theory is usually applied in hydrology for optimality evaluation of information resulting from the measurements in monitoring stations of water resources. In these applications, economic efficiency of the monitoring system can be evaluated by comparison of the costs related to the monitoring system and the information obtained. In other words, entropy theory is one of the methods for evaluation and quantification of redundant information in water resources and environmental systems using limited available data. Entropy theory, unlike statistical methods, does not need assumptions such as system linearity or normality of assumed probability distribution for uncertain variables. This method is also beneficial in situations with non-homogeneous data and spatial–temporal multivariate features. This theory has been extensively applied to design of monitoring networks in water resources and environmental engineering systems. Some examples are briefly reviewed in the rest of this section.
Alfonso (2010) optimized a monitoring network of polders in the Netherlands using information theory and obtained the best choice of gauge locations by simultaneously considering the joint entropy and total correlation. Lee et al. (2014) determined the optimal places for monitoring stations in the lake of Yang Dam, South Korea, based on information theory and weighting method taking into account several water quality variables. The results showed that the optimal network provides all the information content of existing stations in the lake with fewer monitoring stations. However, the method had high computational burdens. Hirshleifer & Riley (1979) studied whether more investments to receive more information is economic or not and they proposed the value of information theory. Their method gives the opportunity to decision makers to measure the value of information under uncertain conditions and to make the correct decision. Masoumi & Kerachian (2008) developed a new entropy-based approach for assessing the location of salinity monitoring stations in Tehran Aquifer, Iran. They proposed a transinformation–distance (T–D) curve and used it in a multi-objective genetic algorithm-based optimization model, which provided the best locations for monitoring stations. The results showed the applicability and the efficiency of the model in assessing the groundwater monitoring system. Karamouz et al. (2009) presented an entropy-based approach for design of an on-line water quality monitoring network in rivers, in which the number and location of sampling sites as well as the sampling frequencies were determined by minimizing the redundant information. Bashi-Azghadi & Kerachian (2010) developed a new methodology based on information theory for optimally locating monitoring wells in groundwater systems, identifying an unknown pollution source using monitoring data. Mahjouri & Kerachian (2011) evaluated and revised the spatial and temporal sampling frequencies of the water quality monitoring system of the Jajrood River in the northern part of Tehran by developing a discrete entropy-based optimization approach. The results showed that the existing monitoring system of the river should be partially strengthened and in some cases the sampling frequencies should be increased. Li et al. (2012) introduced the maximum information minimum redundancy (MIMR) criterion for the design (or evaluation) of hydrometric networks. The MIMR selection was compared with another entropy-based approach and showed better performance at finding stations with high information content, and locating independent stations. Yoo et al. (2012) applied the entropy theory with the EPANET hydraulic solver to determine the best sites to install the pressure loggers in water distribution systems. Samuel et al. (2013) proposed a combined regionalization and dual entropy-multi-objective optimization (CRDEMO) method for determining the minimum river monitoring network that meets the World Meteorological Organization (WMO) standards. The credibility of the method was investigated on the monitoring network of two watersheds in Canada. Su & You (2014) proposed a spatial information estimation model for the analysis of precipitation gauge networks to improve previous methods based on information theory. The proposed model employed a two-dimensional T–D relationship in conjunction with multivariate information approximation to estimate transinformation to ungauged locations from existing stations, while taking into consideration the influence of multiple stations and anisotropy. Xu et al. (2015) used an entropy theory-based multi-criteria method which simultaneously considered the information derived from rainfall series, minimized the bias of areal mean rainfall and minimized the information overlapped by different gauges to resample the rain gauge networks with different gauge densities. The results revealed that the method provides an optimal design of rain gauge networks which is of vital importance in regional hydrological studies and water resources management.
As mentioned above, entropy theory has been widely used in the design of various monitoring networks of water resources system. While several studies have been done at national and international level to determine the optimal monitoring stations in water bodies such as rivers and groundwater resources, in urban systems studies are very limited. This paper proposes a methodology for finding the optimal layout of monitoring hydrometric gauges on urban drainage systems based on developing an entropy-based multi-objective optimization technique integrated with rainfall-runoff simulation models. Details of this modeling approach are also described.
METHODS AND MATERIALS
Entropy theory
The main difficulty in using the above equations, especially when the number of variables is considerably high, is estimating the joint entropy of multiple variables , which in fact requires the estimation of the joint distribution of
. One solution for this task is using the grouping property of mutual information (Kraskov et al. 2005; Alfonso 2010). In this method, one variable is agglomerated by another in such a way that the entropy of the new variable is the equivalent of the joint entropy of the original pair. This agglomeration can be done by putting the records of two original variables together, constituting a set of new records for the new variable. Then, the new variable is agglomerated by the next variable and so on. The last step involves summing up the partial total correlations for each built variable. To clarify this approach, a numerical example taken from Alfonso (2010) is illustrated here.
Process of calculating the entropy (Equation (1)) for separate variables (Alfonso 2010)
| x1 . | x2 . | X1 . | X2 . | p(X1) . | p(X2) . | log[p(X1)] . | −p(X1)*log[p(X1)] . | log[p(X2)] . | −p(X2)*log[p(X2)] . |
|---|---|---|---|---|---|---|---|---|---|
| 3.24 | 2.2 | 3 | 2 | 0.1 | 0.3 | −1 | 0.1 | −0.52288 | 0.157 |
| 4.25 | 2.08 | 4 | 2 | 0.1 | – | −1 | 0.1 | – | 0 |
| 5.3 | 1.15 | 5 | 1 | 0.7 | 0.1 | −0.155 | 0.108 | −1 | 0.1 |
| 5.33 | 4.81 | 5 | 4 | – | 0.3 | – | 0 | −0.523 | 0.157 |
| 5.45 | 5.4 | 5 | 5 | – | 0.2 | – | 0 | −0.699 | 0.140 |
| 5.7 | 4.36 | 5 | 4 | – | – | – | 0 | – | 0 |
| 6.55 | 4.6 | 6 | 4 | 0.1 | – | −1 | 0.1 | – | 0 |
| 5.42 | 5.21 | 5 | 5 | – | – | – | 0 | – | 0 |
| 5.4 | 3.13 | 5 | 3 | – | 0.1 | – | 0 | −1 | 0.1 |
| 5.25 | 2.91 | 5 | 2 | – | – | – | 0 | – | 0 |
| H(X1) = 0.408 | H(X2) = 0.654 |
| x1 . | x2 . | X1 . | X2 . | p(X1) . | p(X2) . | log[p(X1)] . | −p(X1)*log[p(X1)] . | log[p(X2)] . | −p(X2)*log[p(X2)] . |
|---|---|---|---|---|---|---|---|---|---|
| 3.24 | 2.2 | 3 | 2 | 0.1 | 0.3 | −1 | 0.1 | −0.52288 | 0.157 |
| 4.25 | 2.08 | 4 | 2 | 0.1 | – | −1 | 0.1 | – | 0 |
| 5.3 | 1.15 | 5 | 1 | 0.7 | 0.1 | −0.155 | 0.108 | −1 | 0.1 |
| 5.33 | 4.81 | 5 | 4 | – | 0.3 | – | 0 | −0.523 | 0.157 |
| 5.45 | 5.4 | 5 | 5 | – | 0.2 | – | 0 | −0.699 | 0.140 |
| 5.7 | 4.36 | 5 | 4 | – | – | – | 0 | – | 0 |
| 6.55 | 4.6 | 6 | 4 | 0.1 | – | −1 | 0.1 | – | 0 |
| 5.42 | 5.21 | 5 | 5 | – | – | – | 0 | – | 0 |
| 5.4 | 3.13 | 5 | 3 | – | 0.1 | – | 0 | −1 | 0.1 |
| 5.25 | 2.91 | 5 | 2 | – | – | – | 0 | – | 0 |
| H(X1) = 0.408 | H(X2) = 0.654 |
Process of agglomerating two variables and calculating joint entropy (Alfonso 2010)
| p(X1, X2) . | log[p(X1, X2)] . | −p(X1, X2)*log[p(X1, X2)] . | A . | P(A) . | log[P(A)] . | −p(A)*log[p(A)] . |
|---|---|---|---|---|---|---|
| 0.1 | −1 | 0.1 | 32 | 0.1 | −1 | 0.1 |
| 0.1 | −1 | 0.1 | 42 | 0.1 | −1 | 0.1 |
| 0.1 | −1 | 0.1 | 51 | 0.1 | −1 | 0.1 |
| 0.2 | −0.699 | 0.14 | 54 | 0.2 | −0.699 | 0.14 |
| 0.2 | −0.699 | 0.14 | 55 | 0.2 | −0.699 | 0.14 |
| – | – | 0 | 54 | – | – | 0 |
| 0.1 | −1 | 0.1 | 64 | 0.1 | −1 | 0.1 |
| – | – | 0 | 55 | – | – | 0 |
| 0.1 | −1 | 0.1 | 53 | 0.1 | −1 | 0.1 |
| 0.1 | −1 | 0.1 | 52 | 0.1 | −1 | 0.1 |
| H(X1, X2) = 0.88 | H(A) = 0.88 |
| p(X1, X2) . | log[p(X1, X2)] . | −p(X1, X2)*log[p(X1, X2)] . | A . | P(A) . | log[P(A)] . | −p(A)*log[p(A)] . |
|---|---|---|---|---|---|---|
| 0.1 | −1 | 0.1 | 32 | 0.1 | −1 | 0.1 |
| 0.1 | −1 | 0.1 | 42 | 0.1 | −1 | 0.1 |
| 0.1 | −1 | 0.1 | 51 | 0.1 | −1 | 0.1 |
| 0.2 | −0.699 | 0.14 | 54 | 0.2 | −0.699 | 0.14 |
| 0.2 | −0.699 | 0.14 | 55 | 0.2 | −0.699 | 0.14 |
| – | – | 0 | 54 | – | – | 0 |
| 0.1 | −1 | 0.1 | 64 | 0.1 | −1 | 0.1 |
| – | – | 0 | 55 | – | – | 0 |
| 0.1 | −1 | 0.1 | 53 | 0.1 | −1 | 0.1 |
| 0.1 | −1 | 0.1 | 52 | 0.1 | −1 | 0.1 |
| H(X1, X2) = 0.88 | H(A) = 0.88 |
Similarly, variable A can be agglomerated with variable X3 to create a new variable B using B= 10A+X3 and so on. When there are n variables, they can be hierarchically agglomerated to estimate by the same approach.
Case study
Rainfall-runoff modeling
Rainfall-runoff modeling is carried out to simulate and analyze the urban drainage network under different flooding loads. This modeling is used to estimate flood variables such as discharge time series in ungauged points of the network. Model setup and calibration for the study area has already been carried out in previous studies (Mahab Ghods Consultant Engineers 2011a, 2011b) and here the same modeling approach and calibrated parameters have been used for the model preparation. The first step for rainfall-runoff modeling was determination of design rainfall including rainfall depth and duration with its spatial–temporal pattern. Rainfall depth can be obtained from the intensity–duration–frequency curves, extracted from the recorded rainfalls. According to the previous studies (Mahab Ghods Consultant Engineers 2011a), design rainfall with a 25-year return period has been recommended by the authorities and thus here this return period is selected. The design rainfall duration is suggested to be at least equal to ‘time of concentration’ (Yazdi et al. 2013). This parameter for the studied urban area was estimated as 150 minutes. After a sensitivity analysis on the severity of produced runoff from rainfalls with different durations, a 6 hour duration has been recognized as the critical value and has been selected for design rainfall duration.
is a coefficient associated with the return period of design rainfall and the mean height of the sub-catchment. Therefore, the spatial variation of rainfall throughout the catchment is manipulated by the above equation using the average height of sub-catchments. The value of
is determined using a lookup table according to the rainfall return period and the average height of the sub-catchment.To convert design rainfall into flood hydrographs at sub-catchments, a rainfall-runoff model must be employed. In this study, HEC-HMS version 3.5 has been used for this purpose. HEC-HMS is a free public domain software, developed by the US Hydrologic Engineering Center (HEC), for simulating the rainfall-runoff processes of watershed systems (USACE 2010). Simulation results are stored in the data storage system HEC-DSS and can be used in conjunction with other software. The SCS curve number method, introduced by the US Soil Conservation Service (SCS), was set up for estimating the precipitation losses based on the local land use and soil type properties (USACE 2010).
Furthermore, in order to transform excess precipitation into surface runoff, the SCS unit hydrograph was utilized. According to the land uses and soil types, a total of 182 sub-catchments were considered in hydrological modeling including 81, 42, 35 and 24 sub-catchments for EDC, TCP, WDC and CRB catchments, respectively. After calculating sub-catchment surface runoff by model execution, they were introduced to the hydraulic model of the urban drainage network, here EPA's Storm Water Management Model (SWMM), for flood routing in the channels and conduits of the network. SWMM software is generally used for planning, analysis and design related to stormwater runoff, combined and sanitary sewers, and other drainage systems in urban areas. It is a dynamic hydrologic–hydraulic water quality simulation model, used for single event or continuous simulation of runoff quantity and quality from primarily urban areas. The routing portion transports this runoff through a system of pipes, channels, storage/treatment devices, pumps, and regulators. It has recently been extended to model the hydrologic performance of specific types of low impact development controls (EPA 2016).
Search algorithm
RESULTS AND DISCUSSION
In highly developed regions such as urban areas, WMO (2008) suggests that the area covered by a hydrometric gauge can be up to 10 km2. According to this criterion, Tehran city needs around 30 hydrometric gauges. But, the current state of the monitoring network includes only six active hydrometric gauges inside the city. Therefore, one of the modeling scenarios in this research is deemed as installing 24 new gauges throughout the network. Two other scenarios are also considered in this research, 10 and 15 new gauges, respectively, the results of which might be useful when there is a limitation in available funds for gauge establishment. Therefore, three scenarios are considered in total. It is worth mentioning that the three considered scenarios are not in sequence; i.e. each one is not implemented after the other one. They are three separate scenarios for extending the current monitoring network and, thus, they have their own search space and optimal solutions which are not necessarily subsets of each other.
NSGA-II parameters were set as the best known values in the literature (e.g. see Yazdi 2016), presented in Table A1 (appendix, available with the online version of this paper) and the model was run in an Intel® Core™2 Duo CPU system with 4 GB RAM.
Pareto front solutions found by NSGA-II, (a) 10 monitoring points, (b) 15 monitoring points, and (c) 24 monitoring points.
More specifically, a solution in the Pareto front (shown in Figure 4(c)) with almost the same joint entropy of the previous proposal decreases the overlapping information nearly 13%. Overall, this confirms the high performance of the entropy-based optimization method in finding the effective sites for gauge installation. It also should be mentioned that all solutions above the dashed horizontal line in Figure 4(c) dominate the previously suggested design and are superior to that proposal. Among them, the optimal design with almost the same level of joint entropy with the previously suggested design was selected here for more discussion because it provides the maximum reduction in shared information with respect to the previously suggested design. Generally, each of the solutions above the horizontal line can be selected for implementing on the network according to the criteria such as social aspects and urban aesthetic.
Frequency of the candidate sites in the Pareto front solutions, in the case of (a) 10 new gauge points, (b) 15 new gauge points, and (c) 24 new gauge points.
CONCLUSION
This research presented a methodology for evaluating and redesigning the monitoring system of the main surface-water drainage network of Tehran city, capital of Iran. Applying the entropy theory, the location of new hydrometric stations were found using the NSGA-II optimization algorithm coupled with the SWMM-EPA model. The obtained optimal monitoring network was compared with the previous design recently proposed by the consultant engineers of the municipality. This comparison showed that the obtained Pareto optimal set outperforms the previous design so that all optimal trade-off solutions provide less total correlation. Particularly, the optimal solution with almost the same level of joint entropy of the previous design yields nearly 13% less total correlation (i.e. shared information by the gauges), which means the proposed model chooses the gauge sites efficiently with minimum redundancy, avoiding waste of investment for installing extra gauges. The relative importance of the locations in the drainage network of Tehran was obtained through frequency analysis of the optimal solutions in the Pareto front. While the sites which appear more frequently at the Pareto front need more priority for gauge installation in the hydrometric network, none of the high frequency solutions was suggested in the previous design. Moreover, it was shown that the east network of the city is more critical for gauge installation than other parts of the drainage network. The final monitoring system design can be selected by decision makers from the Pareto optimal set based on the relative importance of joint entropy and total correlation, social aspects, urban aesthetic, etc.
Based on the results, the proposed approach can be used as an effective tool for evaluating, revising, or redesigning the urban drainage network monitoring systems.
ACKNOWLEDGEMENT
This research has been supported by research grant no. 600/461 funded by Shahid Beheshti University, Tehran, Iran.












