Scientific and effective urban waterlogging risk prediction can help improve urban waterlogging disaster prevention capabilities. Combining the numerical simulation model with the data-driven model, the construction of the urban waterlogging risk predictive model can satisfy the prediction accuracy and improve the prediction timeliness. Thus, this paper established an urban waterlogging risk predictive model based on the coupling of the BP neural network and SWMM model, and set five input patterns, finally selected the accumulative precipitation process and precipitation characteristics as input to predict the regional waterlogging risks under different urban rainstorm scenarios. The results show that the overall performance of the pipe drainage system in the study area is lower, and it cannot resist the rainstorm with a higher return period. Moreover, the total waterlogging risk of the southern old city is higher than that of the northern new city in the study area. The calculation speed of the prediction model constructed in this paper is thousands of times higher than that of the numerical model, so the calculation speed is very fast, which meets the requirements of the forecast timeliness.

  • Using fine data to build the SWMM model and various methods to verify the SWMM model.

  • Using web crawler technology to extract ponding points.

  • Using entropy weight method for risk quantization.

  • Using set pair analysis to evaluate the accuracy of the BP neural network.

  • Coupling SWMM model with BP neural network for risk prediction.

Urban waterlogging refers to the phenomenon that the runoff yield volume exceeds the carrying capacity of the urban drainage system under the condition of high-intensity and short-duration rainstorms, which leads to the loss of efficacy of rainwater drainage and then results in large-scale accumulated water on the urban surface (Xu 2021). At present, with climate change and the acceleration of urbanization, urban waterlogging is becoming more and more serious (Xu et al. 2020; Liu et al. 2021). In the summer of 2021, a torrential rainstorm in Zhengzhou City of China brought about severe urban waterlog disasters with completely paralyzed traffic, causing a total of 88.534 billion yuan in direct economic losses and 302 casualties. In 2022, rare continuous heavy rains for more than 60 years hit the east of South Africa, many cities entered a state of emergency, nearly 4,000 houses were completely destroyed, and more than 40,000 people were displaced. So, the rapid and accurate prediction of urban waterlogging risk is the key to scientific waterlogging prevention (Berkhahn et al. 2019; Wang et al. 2019).

Now, scholars have carried out many research in this field, including urban flood forecasting (Yoon & Nakakita 2015; Wu et al. 2020), waterlogging risk warning (Wei et al. 2020; Zhou et al. 2022), urban waterlogging simulation (Xue et al. 2016; Chen et al. 2021), etc. For the study on urban waterlogging risk prediction, the numerical simulation models based on physical mechanisms are commonly used, such as SWMM, MIKE, and Inforworks ICM. Among them, the SWMM model has been widely used because of its characteristics of powerful functions, simple operation, and open source (Pells & Pells 2016; Behrouz et al. 2020; Dell et al. 2021). However, these numerical models usually involve complex hydrological and hydraulic processes, lots of parameters to be calibrated, and longer operation times that cannot meet the requirements in the timeliness of urban waterlog emergency response. Therefore, it is necessary to seek a new method to predict urban waterlogging risk rapidly and accurately, so as to reduce waterlogging disaster losses (Liu et al. 2022).

In recent years, data-driven models based on artificial intelligence algorithms have been successfully applied in hydrological forecasting (Araghinejad et al. 2021; Feng et al. 2022), rainfall and runoff simulation (Yang & Li 2012), waterlogging risk assessment (Cai et al. 2020; Wu et al. 2021), and so on. Compared with numerical models, data-driven models generally only consider data input and output and have excellent nonlinear mapping ability and calculation rate with simple steps, which can effectively satisfy the demand of urban waterlogging forecast in timeliness. As the widely used data-driven model, back propagation (BP) neural network model is a complex network structure formed by imitating the structure and function of a brain neural network and connecting a large number of processing units (neurons), so it can describe the relations between hydraulic elements of storm-waterlogging well (Gu et al. 2011; Yan et al. 2020). However, data-driven models need higher quality and quantity of data samples, which is often a difficult problem for workers. Therefore, this study gives a new idea to couple the numerical model with the data-driven model. The former can provide sufficient and accurate data samples and the latter is used to achieve an excellent computation rate, thus the urban waterlogging risk can be predicted accurately and efficiently.

Therefore, this paper first established a SWMM model to analyze the drainage performance and waterlogging risk under different rainfall conditions. Secondly, considering the continuity of the storm disaster-causing process and the rain pattern characteristics, five rainfall input patterns were set up. Then with the coupling of the BP neural network and the SWMM model (referred to as the BP-SWMM coupling model), the urban waterlogging risk prediction model was constructed to predict the waterlogging risk under different rainstorm scenarios.

Study area

Zhongyuan District (Figure 1) is located in the west of Zhengzhou City, Henan Province, China with a total area of 193 km2 and a resident population of 965,300, ranging from 34°42′30″ to 34°51′30″ north latitude, 113°27′30″ to 113°37′30″ east longitude. It belongs to the warm temperate semi-arid monsoon continental climate with an annual average precipitation of 640.9 mm, but the heavy rains are mainly concentrated in July and August. The terrain of the area is high in the west and low in the east. Zhengzhou Municipal People's Government is just in this district and its drainage system performs well, but it still cannot resist the rainstorm with great return periods. The surface runoff generated by rainstorms is discharged into the Jinshui River in the southeast, the Xushui River in the east, and the Jialu River in the northeast.
Figure 1

Location of the study area.

Figure 1

Location of the study area.

Close modal

Data sources

The basic data mainly includes rainfall, pipe network, DEM elevation, and land use data. The rainfall data came from the rainfall stations in Zhongyuan District from 2011 to 2018 with a 10-minute time interval. The pipe network data came from the Administration Bureau of Zhengzhou City, and its junctions and conduits data were obtained by ArcGIS software. The DEM elevation data with a resolution ratio of 12.5 m was downloaded from the National Aeronautics and Space Administration (http://www.nasa.gov). The land use data was obtained by processing Landsat-8 images downloaded from the Geospatial Data Cloud of the Chinese Academy of Sciences (http://www.gscloud.cn/). Using ArcGIS image classification technology, the maximum likelihood method was selected to classify the study area into seven types: cultivated land, grassland, forest land, water area, building, road, and unused land (Figure 2).
Figure 2

Land use types of the study area.

Figure 2

Land use types of the study area.

Close modal

SWMM model

The stormwater management model (SWMM) was developed by the US Environmental Protection Agency in 1971 and has been widely applied by many countries (Leutnant et al. 2019; Pachaly et al. 2021). The model consists of four modules: hydrological module (precipitation, evaporation, runoff), hydraulic module (pipe network, channel, surface ponding), water quality module (pollutants, erosion compound), and low impact development module (LID). In this study, only hydrology and one-dimensional hydrodynamic modules are involved, and three simulation processes are exhibited as follows.

Surface runoff-yield process

The SWMM model divides the sub-catchments into three categories: the permeable area A1, the impervious area with depressions A2, and the impervious area without depressions A3 (as shown in Figure 3). The runoff yield on A1 needs to deduct the depression detention and the initial loss; the runoff yield on A2 equals the precipitation minus the initial loss; and the runoff on A3 equals the precipitation minus the evaporation on it.
Figure 3

Division of sub-catchments underlying surface.

Figure 3

Division of sub-catchments underlying surface.

Close modal

Surface confluence process

The net rainfall process of each sub-catchment can be converted into the outflow process in the SWMM model. In this confluence process, three areas A1, A2, and A3 are approximately treated as nonlinear reservoirs, thus the principle of their outflow is shown in formula (1):
(1)
where V is the total precipitation in a sub-catchment, m3; R is the runoff depth, m; V is the area of a sub-catchment, m2; t is the rainfall duration, s; q is the rainfall intensity, m/s; and Q is the total flow rate, m3/s.

Pipeline confluence process

The pipeline confluence of the SWMM model is divided into constant flow, kinematic wave, and dynamic wave. The constant flow assumes that the water flow state in the pipeline is uniform and constant, which is inconsistent with the actual situation. The kinematic wave assumes that the slope of the water surface in the pipeline is the slope of the pipeline, which is limited to the simulation of pressure flow, backwater flow, and stagnant water, and only supports the simulation of tree pipe network. The dynamic wave can consider the loss of the inlet and outlet of the pipe and can simulate the pressure flow and some complicated and changeable water flow States in the pipeline. Therefore, the dynamic wave is selected to calculate the confluence of the pipeline confluence in this paper.

BP neural network

BP neural network is a classic feed-forward neural network, and its learning process includes two processes: forward propagation of signal and backpropagation of error (Svozil et al. 1997). When the signal propagates forward, the samples enter the network by the input layer, and output by the output layer after being processed by the hidden layer. When the error propagates back, the error between the output values and the target values is fed back to the hidden layer, so that the weight coefficient values of each layer are modified and then the square sum of the network error reaches the stated threshold.

In Figure 4, represent the input vectors of the neural network, respectively, indicates the weight values from the input layer to the hidden layer, indicates the weight values from the hidden layer to the output layer, represent the actual output values of the neural network.
Figure 4

Structure of neural network.

Figure 4

Structure of neural network.

Close modal
The input of each neuron in the hidden layer is:
(2)
The output of each neuron in the hidden layer is:
(3)
The hidden layer output vector is:
(4)
The input of each neuron in the output layer is:
(5)
The input vector of the output layer is:
(6)
The output of each neuron in the output layer is:
(7)
The output vector of the output layer is:
(8)
where is the weight values of the hidden layer; is the weight values of the output layer; is the activation function between hidden layers; is the activation function between output layers; is hidden layer threshold; and is the output layer threshold.

The error between the expected output value R and the actual output value is . If does not meet the settled convergence value, it will enter the reverse feedback process to modify the weights and thresholds between layers, and then conduct the next training; If satisfies, it will stop training.

Entropy weight method

The entropy weight method establishes an evaluation matrix for the evaluation system and standardizes the matrix to assign the weight of evaluation indicators. Assuming that there are n evaluation objects and p evaluation indexes in the matrix, an evaluation matrix A can be established as shown in formula (9):
(9)
where A is the evaluation matrix; b is standardized values of evaluation indexes; n is the number of evaluation objects; and p is the number of evaluation index.
The entropy Hi of the ith evaluation index is determined as shown in formula (10):
(10)
where is the entropy of evaluation indexes.
The entropy of each evaluation indicator is weighted , as shown in formula (11):
(11)
The standardized values and entropy weight of each evaluation object are linearly weighted to obtain the comprehensive coefficients of the evaluation objects, as shown in formula (12):
(12)
where p is the comprehensive coefficients of evaluation objects.

Set pair analysis

Set pair analysis is to construct a set pair for two related sets in an uncertain system, those with the same level are in identity, those with one level of difference are in discrepancy, and those with two levels of difference are in opposition. After the identity, discrepancy and opposition analysis are done, the degree of connection is used to describe the relationship of identity, discrepancy, and opposition of set pairs.
(13)
(14)
(15)
where is the degree of connection; S, F, and L are the number of set pairs in identity, discrepancy, and opposition, respectively; N is the total length of the set; the value range of i is [−1,1], in this paper, i is 0.5, and the value of j is often −1; a, b, and c are the degree of identity, discrepancy, and opposition, respectively.

Technical route

There are four steps to build the BP-SWMM coupling model: Firstly, considering the continuity and main rainfall features, several rainfall input patterns were set up and the input sample set of the BP neural network was finally obtained by combining the observed rainstorm data selected in the study area. Secondly, ArcGIS software was used to extract data information on the pipe network and sub-catchments and establish their spatial topological relationship to build the SWMM model. Thirdly, the rainfall samples were imported into the SWMM model, and the node risk under different rainfall conditions was gained as the output sample set of the BP network. This made it possible to build a BP neural network by learning and training the rainfall-node risk sample set. Finally, the trained neural network was applied to predict the risk of nodes in the study area. Then, areal node risk, elevation, population, and other factors were taken as the waterlogging evaluation indexes to quantify the regional waterlogging risk. By aforementioned steps, the BP-SWMM coupling model for predicting the regional waterlogging risk of different rainfall events was successfully constructed (Figure 5).
Figure 5

Technical route of the BP-SWMM coupling model.

Figure 5

Technical route of the BP-SWMM coupling model.

Close modal

SWMM model

Pipe network generalization

In the research, it was necessary to generalize the complex drainage pipe network (Zhang et al. 2022). Based on the present situation of the study area and the topological relationship of the pipe network, necessary junctions were added and branch pipes and secondary pipelines were deleted, etc., to generalize the pipe network (as shown in Figure 6).
Figure 6

Drainage pipe network of the study area.

Figure 6

Drainage pipe network of the study area.

Close modal

Sub-catchments delineation

There are two methods to divide sub-catchments: manual drawing and automatic delineation. In view of the large scope of the study area, it is not suitable to use manual delineation. So, the Thiessen Polygon Method was applied to delineate catchments by the analytical tool of GIS. The sub-catchments were finally obtained by cutting the detail part according to the practical requirements (as shown in Figure 6).

Parameters determination

The parameters such as average slope, area, and percentage of imperviousness of a catchment area were acquired by processing DEM and land use data. The characteristic width (Chen et al. 2013) adopted the formula , where A is the area of a sub-catchment. Horton infiltration formula was applied in the permeable area with an initial infiltration rate of 60 mm/h, the minimum infiltration rate of 3 mm/h, and the decay rate constant of 4 h−1. Other parameters were gained by referring to relevant documents (Li 2016).

Create INP file of the SWMM model

The SWMM model is saved in the form of an INP file, which can be opened in a notebook. However, the INP file in the form of a notebook is not convenient for simulation personnel to input and edit data. Using an Excel spreadsheet can better solve this problem. By combining the built-in tools in SWMM software with the Excel startup program, the SWMM model data can interact with Excel spreadsheet data.

SWMM model validation

Flow process verification
There are many outlets in the study area. However, not every outlet has a flow monitoring station. Outlet 1 and Outlet 2 in Figure 6 are two important outlets in the drainage system of the study area, which control the rainwater discharge of pipelines in the northwest and southeast of the study area, respectively. So, they were chosen to verify the flow process. The constructed SWMM model was used to simulate two historical rainstorms in Zhongyuan District on 12 August 2017 and 15 May 2018. Outlet 1 in Figure 6 was used to verify the flow process. As shown in Figure 7, the simulated discharge hydrographs are basically consistent with the measured, and the Nash–Sutcliffe efficiency (NSE) coefficients all exceed 0.9, indicating that the model can be applied to the subsequent research.
Figure 7

Comparison of observed and simulated outlet discharges of two historical rainstorms.

Figure 7

Comparison of observed and simulated outlet discharges of two historical rainstorms.

Close modal
Overflow points verification
The web crawler technology was adopted to extract ponding points in Zhengzhou City from 2014 to 2018, and the distribution and frequency of ponding points in the research zone are shown in Figure 8 is basically consistent with the position of the overflow points simulated by the SWMM model.
Figure 8

Distribution of overflow points and ponding points.

Figure 8

Distribution of overflow points and ponding points.

Close modal

BP neural network model

Data sample set acquisition

In this study, the risk of waterlogging nodes is the main basis for evaluating the risk of urban rainstorm waterlogging. Therefore, for the BP neural network to be constructed, the rainfall and node risk samples are the input set and output set of the model respectively to form the data sample set. The specific formation process of the rainfall-node risk sample set is as follows:

According to statistics, the main disaster-causing rainfall type of urban waterlogging in Zhengzhou City is short-duration rainstorms. Therefore, 400 short-duration rainstorms with multiple temporal and spatial distributions and various rainfall patterns were screened out as the rainfall sample set. Then, these 400 rainfall events were imported into the SWMM model in turn, and 400 waterlogging node risk samples were obtained as the output sample set of the BP neural network. The original format of the rainfall sample set was the rainfall process of 10-min intervals. Considering the continuity of the rainstorm disaster-causing process and the main characteristics of rain pattern (such as total precipitation, average rainfall intensity, peak-to-times ratio, peak precipitation, and peak coefficient), five rainfall input patterns were finally formed (Table 1), a total of 1,500 rainfall samples were gained as the input sample set of BP network. Since the rainfall samples of five input patterns derived from each original rainfall sample are all corresponding to the same node risk sample, a total of 1,500 rainfall-node risk samples were acquired as the data sample set of the BP network.

Table 1

Rainfall input patterns

Pattern 1Pattern 2Pattern 3Pattern 4Pattern 5
Accumulative precipitation and precipitation characteristics Accumulative precipitation Interval precipitation and precipitation characteristics Interval precipitation Precipitation characteristics 
Pattern 1Pattern 2Pattern 3Pattern 4Pattern 5
Accumulative precipitation and precipitation characteristics Accumulative precipitation Interval precipitation and precipitation characteristics Interval precipitation Precipitation characteristics 

Network model design

Relying on MATLAB software, the BP neural network was constructed by calling the newff function. And 90% of the data set as the training set and 10% as the test set were selected. The number of iterations was set at 1,000 and the learning rate was set at 0.01. The L-M optimization algorithm (trainlm) was chosen as the training function, the tangent sigmoid function (tansig) was chosen as the activation function between the hidden layers, and the linear function (purelin) was chosen as the activation function between the output layers. After many times of training, the number of hidden layers was 2, and the number of neurons in each layer was 3, 7.

Urban waterlogging risk status

Based on the formula of rainstorm intensity in Zhengzhou, as shown in Equation (16), the design storm process with return periods of 1 year, 3 years, 5 years, 10 years, and 20 years were deduced. These five rainfall events with different return periods were imported into the SWMM model to analyze the drainage performance and waterlogging risk in the study area:
(16)
where q is the design storm intensity, , P is the design storm recurrence period, and t is the design storm duration, .

Urban drainage performance analysis

As can be seen from Table 2, with the increase in rainfall return period, the amount of overload conduits and junctions gradually increases and the degree deepens. In a 1-year rainstorm, only a few conduits and junctions are overloaded; in a 3-year rainstorm, the number of overloaded conduits and junctions increases significantly; in a 5-year rainstorm, the overload rate of conduit exceeds 40%, and junction exceeds 33%; when the rainfall return period is more than 10 years, more than half of the conduits and junctions are overloaded. It can be found that the overall performance of the drainage system in the study area is poor, which cannot resist the rainstorms with a high return period.

Table 2

Overload number and rate of pipes and nodes in different rainfall return periods

Return periodNumber of overload pipesOverload rate of pipe (%)Number of overload nodesOverload rate of node (%)
1a 28 8.04 25 6.87 
3a 88 25.29 82 22.53 
5a 145 41.67 130 35.71 
10a 210 60.34 183 50.27 
20a 248 71.27 228 62.64 
Return periodNumber of overload pipesOverload rate of pipe (%)Number of overload nodesOverload rate of node (%)
1a 28 8.04 25 6.87 
3a 88 25.29 82 22.53 
5a 145 41.67 130 35.71 
10a 210 60.34 183 50.27 
20a 248 71.27 228 62.64 

Nodes risk analysis

Overload time, total overflow, and maximum ponding depth are three important feature values of junctions, which reflect the running status of junctions. They were chosen as indicators to comprehensively evaluate and quantify the node risk with the entropy weight method. According to the characteristics of nodes in the study area and the standards of Zhengzhou Municipal Drainage Management Department, the overloaded nodes can be divided into three grades by reviewing the literatures (Zhang 2017; Zhang et al. 2021): The junction risk value of 0–0.2 is a mild waterlog point, 0.2–0.5 is a moderate waterlog point, and greater than 0.5 is a severe waterlog point. Normally, when the junction risk value is less than 0.2, the ponding generated by a node can be discharged quickly without waterlogging; when the junction risk value is between 0.2 of 0.5, the ponding cannot be discharged timely, and it will stay on the ground for a period of time, so there is a certain risk of waterlogging; when the junction risk value is greater than 0.5, a large amount of water will continuously overflow from a junction, which is seriously overloaded and the ground is seriously waterlogged. The distribution of different grades of nodes under three recurrence intervals of rainstorms is shown in Figure 9.
Figure 9

Distribution of waterlogging nodes and waterlog-prone areas.

Figure 9

Distribution of waterlogging nodes and waterlog-prone areas.

Close modal

Regional risk analysis

By extracting ponding points in Zhengzhou and combining them with the simulation results of the SWMM model, it was found that there are eight obvious waterlog-prone areas in Zhongyuan District (Figure 9). In this paper, the natural disaster risk expression (risk = danger + vulnerability) proposed by Maskey (1989) was applied to assess the waterlogging risk in waterlog-prone areas. Danger is based on natural attributes, including disaster-causing factors and hazard-inducing environment. For urban waterlogging, disaster-causing factors refer to urban ponding caused by short-term heavy rainfall, the hazard-inducing environment is an environmental factor that causes waterlogging disasters to occur. Vulnerability is based on social attributes and reflects the degree of damage caused by disasters, including aspects such as human beings themselves, the economy, and the urban transportation system. The node risk, areal slope, elevation, and impermeability were chosen as dangerous indexes and the population, point of interest, and road network were chosen as vulnerable indexes. After normalization of the values of each indicator, the weight of each index was determined by the entropy weight method and the risk values of each waterlog-prone area were acquired by weight sum calculation. By using the K-means Clustering Method, the waterlogging risk levels of waterlog-prone areas were classified, and the results show that the risk values of waterlog-prone areas are 0 ∼ 0.1, 0.1 ∼ 0.3, 0.3 ∼ 0.5, 0.5 ∼ 0.7, and 0.7 ∼ 1, grade I, grade II, grade III, grade IV, and grade V. Among them, grade I represents the lowest risk and grade V represents the highest risk. The risk values of eight waterlog-prone areas under diverse precipitation scenarios are shown in Table 3.

Table 3

Waterlogging risk value of each waterlog-prone area

Return periodWaterlog-prone area 1Waterlog-prone area 2Waterlog-prone area 3Waterlog-prone area 4Waterlog-prone area 5Waterlog-prone area 6Waterlog-prone area 7Waterlog-prone area 8
1a 0.021 0.052 0.099 0.037 0.094 0.015 
3a 0.026 0.120 0.263 0.056 0.342 0.180 0.403 0.116 
5a 0.177 0.294 0.463 0.175 0.540 0.320 0.627 0.305 
10a 0.359 0.437 0.656 0.473 0.690 0.572 0.754 0.587 
20a 0.619 0.732 0.874 0.684 0.913 0.825 0.921 0.803 
Return periodWaterlog-prone area 1Waterlog-prone area 2Waterlog-prone area 3Waterlog-prone area 4Waterlog-prone area 5Waterlog-prone area 6Waterlog-prone area 7Waterlog-prone area 8
1a 0.021 0.052 0.099 0.037 0.094 0.015 
3a 0.026 0.120 0.263 0.056 0.342 0.180 0.403 0.116 
5a 0.177 0.294 0.463 0.175 0.540 0.320 0.627 0.305 
10a 0.359 0.437 0.656 0.473 0.690 0.572 0.754 0.587 
20a 0.619 0.732 0.874 0.684 0.913 0.825 0.921 0.803 

As can be seen from Table 3 and Figure 9 that with the growth of rainstorm recurrence interval, the risk values of each waterlog-prone area increase in varying degrees. Most of these waterlog-prone areas are located at the commercial areas or intersections of metropolitan zones in Zhongyuan District, and their terrains are relatively low.

It is not difficult to find that the waterlogging risk in the north of the study area is lower than that in the south. The reason is that the north of Zhongyuan District is a new city with low development degree, low impermeability of underlying surface, and better rainwater storage capacity. Based on the situation of the pipe network, although the density of the drainage conduits in the southern old city is high, the pipe diameter is smaller and the conduits are mostly aging and in bad repair with insufficient capacity. Consequently, the risk of waterlogging in the southern old city is higher when encountering heavy rains.

BP-SWMM coupling model construction and verification

Comparison of prediction results of different rainfall input patterns

In this paper, the degree of connection of set pair analysis was adopted to evaluate the accuracy of BP model. The test set samples corresponding to five rainfall input patterns were respectively imported into the trained BP network to obtain the prediction values of node risk, and made it and the SWMM real values be set pairs for set pair analysis (Zhao & Xuan 1996). The predictive value and real value sequences were noted as set Ri (i = 1,2,3,4,5) and P, respectively (for example, R1 indicates the predictive values sequence obtained by pattern 1, R2 indicates pattern 2, and so on). Their errors (Figure 10) were critically classified, the amount of set pairs in identity, discrepancy, and opposition were counted, and the degree of identity, discrepancy, opposition, and connection of each set pair were calculated, and the results are shown in Table 4.
Table 4

Results of set pair analysis

Set pairThe degree of identityThe degree of discrepancyThe degree of oppositionThe degree of connection
(R1, P0.78 0.18 0.05 0.82 
(R2, P0.73 0.23 0.04 0.80 
(R3, P0.70 0.26 0.04 0.79 
(R4, P0.72 0.22 0.06 0.77 
(R5, P0.68 0.25 0.07 0.74 
Set pairThe degree of identityThe degree of discrepancyThe degree of oppositionThe degree of connection
(R1, P0.78 0.18 0.05 0.82 
(R2, P0.73 0.23 0.04 0.80 
(R3, P0.70 0.26 0.04 0.79 
(R4, P0.72 0.22 0.06 0.77 
(R5, P0.68 0.25 0.07 0.74 
Figure 10

Error of prediction results of each input pattern.

Figure 10

Error of prediction results of each input pattern.

Close modal
It can be found from Table 4 and Figure 11 that the neural network model effects of each input pattern are ranked in order from best to worst as pattern 1 > pattern 2 > pattern 3 > pattern 4 > pattern 5. When the accumulated precipitation is taken as the input pattern, the degree of connection is significantly improved by 3.8% compared with the interval precipitation, because the continuance of the rainstorm disaster-causing process is considered. The accuracy of the model with two combinations as input patterns (interval precipitation and precipitation characteristics, accumulative precipitation, and precipitation characteristics) are higher than that before combining, and the degree of connection is improved by 2.5 and 2.6%, respectively. This is because the rainfall process is often complex and changeable in the actual rainfall scenarios, and the accumulative precipitation and interval precipitation cannot cover all the information of a rainstorm, while the main features of the rain pattern are just a powerful supplement. Considering the above, this research selects accumulative precipitation and precipitation characteristics as the input pattern of the model.
Figure 11

Results of set pair analysis.

Figure 11

Results of set pair analysis.

Close modal

Node risk predictive results

Waterlog-prone area 7 is located at Zhongyuan Road of West 2nd Ring in Zhengzhou City. There are big shopping malls, hospitals, schools, and the population density is concentrated and the terrain is low. Once waterlogging happens, it will cause huge economic losses and safety risks. Therefore, taking waterlog-prone area 7 as an example, this paper establishes BP models for four nodes separately in this zone, and the predictive results of each node risk are shown in Figure 12.
Figure 12

Prediction results of each model test set.

Figure 12

Prediction results of each model test set.

Close modal

Figure 12 shows that the predictive value curves of the BP model are in good agreement with the true value curves of the SWMM model, which preliminarily illustrates that the prediction model has acquired a favorable effect. So as to further prove the rationality of the prediction model, the mean absolute error (MAE), root mean square error (RMSE), and NSE coefficients are introduced for analysis, as shown in Table 5.

Table 5

Evaluation index values of BP model at each node

Evaluation indexABCD
NSE 0.943 0.948 0.980 0.965 
MAE 0.040 0.038 0.024 0.025 
RMSE 0.059 0.054 0.032 0.039 
Evaluation indexABCD
NSE 0.943 0.948 0.980 0.965 
MAE 0.040 0.038 0.024 0.025 
RMSE 0.059 0.054 0.032 0.039 

In Figure 12, it is found that the overall fitting errors of the points on the upper part of the predictive value curves and true value curves are lower than that on the lower part, that is, the predictive accuracy of the nodes with higher risk is better than that for nodes with lower risk. To confirm this conjecture, the mean relative error (MRE) of various junctions in Figure 12(b), which is less obvious, was calculated. As shown in Figure 12(b), the amounts of mild waterlog points, moderate waterlog points, and severe waterlog points are: 24, 11, and 5, respectively, in the predictive results of the test set samples, and the MRE values are 20.4, 10.1, and 5.2%, respectively. Compared with the moderate waterlog points, the MRE of forecasting severe waterlog points reduces by 4.9%, and compared with the mild waterlog points, it reduces by 15.2%. Therefore, the BP model has the highest accuracy for the prediction of severe waterlog points, that is, it has better applicability for junctions with higher risk levels.

Since there are no other nodes in the waterlog-prone area 7 except these four nodes,when the risks of the four nodes are known, the waterlogging risk value of the waterlog-prone area 7 can be acquired by the calculation method of regional waterlogging risk mentioned above. Therefore, the mapping relationship between rainfall and regional waterlogging risk is successfully established, which means the building of the BP-SWMM coupling model has been completed.

Regional risk predictive results

The established BP-SWMM coupling model was used to predict the waterlogging risk of the waterlog-prone area 7 under the five-return period rainfall conditions, the predictive results of the risk values of the four nodes are shown in Table 6.

Table 6

Node risk values of waterlog-prone area 7

Return periodRainfall timeABCD
1a 1 h 30 min 0.118 0.082 0.106 0.046 
3a 2 h 0.360 0.261 0.337 0.242 
5a 2 h 30 min 0.517 0.509 0.552 0.403 
10a 3 h 0.734 0.715 0.763 0.657 
20a 3 h 30 min 0.908 0.877 0.920 0.816 
Return periodRainfall timeABCD
1a 1 h 30 min 0.118 0.082 0.106 0.046 
3a 2 h 0.360 0.261 0.337 0.242 
5a 2 h 30 min 0.517 0.509 0.552 0.403 
10a 3 h 0.734 0.715 0.763 0.657 
20a 3 h 30 min 0.908 0.877 0.920 0.816 

From Table 6, the risk values of the four junctions in the waterlog-prone area 7 under the 1-year, 3-year, 5-year, 10-year and 20-year rainstorm events are known, then by further calculation of the entropy weight method with other indexes, such as elevation, impermeability, population, and so on, the waterlogging risk values were acquired: 0.110, 0.368, 0.601, 0.730, and 0.898, the MRE values are 17, 8.7, 4.1, 3.2, and 2.5%, respectively. The MRE values in these five events are all smaller, indicating that the predictive results are outstanding and the coupling model has an excellent effect on regional risk prediction. The MRE decreases with the increase of the regional risk level, which is consistent with the previous conclusion.

In terms of prediction timeliness, to calculate the waterlogging risk values of the waterlogging area 7 under a rainfall event, the BP-SWMM coupling model took only 0.1 s. However, the SWMM model took 172, 195, 217, 245, and 270 s to run the five rainstorm events of 1 year, 2 years, and 3 years, respectively, and it will take longer to export and process the running results. The calculation speed of the coupling model is thousands of times faster than that of the numerical model. If the research range expands and the pipes and nodes increase, the calculative time of the numerical model will also increase, and the calculative efficiency cannot satisfy the real-time simulation demands (Gui et al. 2021). The method suggested in this paper has high simulative accuracy and fast running speed, which can greatly meet the needs of urban waterlog emergency work.

In this paper, a SWMM model and a BP-SWMM coupling model were established for waterlogging risk analysis and prediction in the study area. The main conclusions are as follows:

  • (1)

    The drainage system in the research zone has insufficient ability to resist rainstorms in the high recurrence intervals; with the increase of the rainfall return period, the risk level of each waterlog-prone area presents a transition trend from low to high, and the overall waterlogging risk of the southern old city is higher than that of the northern new city.

  • (2)

    When pattern 1 (accumulative precipitation and precipitation characteristics) is taken as the input of the BP-SWMM coupling model, the model has the best effect, which is 2.5% higher than pattern 2 (accumulative precipitation) and 6.5% higher than pattern 4 (interval precipitation).

  • (3)

    In terms of urban waterlogging prediction, the coupling model can not only satisfy the prediction accuracy (its NSE coefficient has reached above 0.9); but also improve the prediction timeliness (its calculation speed is thousands of times higher than that of the numerical model, which can effectively satisfy the urban waterlog emergency work).

This research is supported by the National Key R&D Program of China (Grant No. 2021YFC3200205) National Natural Sciences Foundation of China (Grant No. 52379028) and Natural Sciences Foundation of Henan Province (Grant No. 212300410404).

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Araghinejad
S.
,
Azmi
M.
&
Kholghi
M.
2021
Application of artificial neural network ensembles in probabilistic hydrological forecasting
.
Journal of Hydrology
407
(
1–4
),
94
104
.
https://doi.org/10.1016/j.jhydrol.2011.07.011
.
Behrouz
M. S.
,
Zhu
Z. D.
,
Matott
L. S.
&
Rabideau
A. J.
2020
A new tool for automatic calibration of the Storm Water Management Model (SWMM)
.
Journal of Hydrology
581
,
124436
.
https://doi.org/10.1016/j.jhydrol.2019.124436
.
Berkhahn
S.
,
Fuchs
L.
&
Neuweiler
I.
2019
An ensemble neural network model for real-time prediction of urban floods
.
Journal of Hydrology
575
,
743
754
.
https://doi.org/10.1016/j.jhydrol.2019.05.066
.
Cai
Z. M.
,
Li
D. M.
&
Deng
L. B.
2020
Risk evaluation of urban rainwater system waterlogging based on neutral network and dynamic hydraulic model
.
Journal of Intelligent & Fuzzy Systems
,
5661
5671
.
https://doi.org/10.3233/JIFS-189045
.
Chen
X. Y.
,
Zhang
N.
,
Wu
F.
&
He
B.
2013
Principle, parameters and application of storm flood management model SWMM
.
China Water & Wastewater
29
(
4
),
4
7
.
Chen
Z. P.
,
Li
K.
,
Du
J. H.
,
Chen
Y.
,
Liu
R. G.
&
Wang
Y.
2021
Three-dimensional simulation of regional urban waterlogging based on high-precision DEM model
.
Natural Hazards
108
(
3
),
2653
2677
.
https://doi.org/10.1007/s11069-021-04793-8
.
Dell
T.
,
Razzaghmanesh
M.
,
Sharvelle
S.
&
Arabi
M.
2021
Development and application of a SWMM-based simulation model for municipal scale hydrologic assessments
.
Water
13
(
12
),
1644
.
https://doi.org/10.3390/w13121644
.
Feng
Z. K.
,
Shi
P. F.
,
Yang
T.
,
Niu
W. J.
,
Zhou
J. Z.
&
Cheng
C. T.
2022
Parallel cooperation search algorithm and artificial intelligence method for streamflow time series forecasting
.
Journal of Hydrology
606
,
127434
.
https://doi.org/10.1016/j.jhydrol.2022.127434
.
Gu
J.
,
Qin
X.
,
Li
W.
,
Chen
W.
&
Ma
D.
2011
Analysis on evolution of Hengsha Passage in the Yangtze River Estuary with BP artificial neural network
.
International Conference on Bioinformatics and Biomedical Engineering
.
https://doi.org/10.1109/icbbe.2011.5780808
.
Gui
H. L.
,
Zhang
C. P.
,
Wu
Z. G.
&
Liu
C.
2021
Comparison of artificial neural network and SWMM applied in rainfall runoff simulation
.
China Water & Wastewater
37
(
13
),
108
112
.
Leutnant
D.
,
Doering
A.
&
Uhl
M.
2019
swmmr – an R package to interface SWMM
.
Urban Water
16
(
1–2
),
68
76
.
https://doi.org/10.1080/1573062X.2019.1611889
.
Li
S. H.
2016
Flood Risk Analysis and Waterlogging Simulation in Zhengzhou City
.
Dissertation
,
Zhengzhou University
,
Zhengzhou
,
China
.
Liu
H.
,
Hao
Y.
,
Zhang
W. H.
,
Zhang
H. Y.
,
Gao
F.
&
Tong
J. P.
2021
Online urban waterlogging monitoring based on recurrent neural network for classification of microblogging text
.
Natural Hazards and Earth System Sciences
21
(
4
),
1179
1194
.
https://doi.org/10.5194/nhess-21-1179-2021
.
Liu
Y. Y.
,
Liu
Y. S.
,
Zheng
J. W.
,
Chai
F. X.
&
Ren
H. C.
2022
Intelligent prediction method for waterlogging risk based on AI and numerical model
.
Water
14
(
15
),
2282
.
https://doi.org/10.3390/w14152282
.
Maskrey
A.
1989
Disaster Mitigation: A Community Based Approach
.
Oxfam
,
Oxford
,
England
.
Pachaly
R. L.
,
Vasconcelos
J. G.
&
Allasia
D. G.
2021
Surge predictions in a large stormwater tunnel system using SWMM
.
Urban Water Journal
18
(
8
),
577
584
.
https://doi.org/10.1080/1573062X.2021.1916828
.
Pells
S. E.
&
Pells
P. J. N.
2016
Application of Dupuit's equation in SWMM to simulate baseflow
.
Journal of Hydrologic Engineering
21
(
1
),
06015009
.
https://doi.org/10.1061/(ASCE)HE.1943-5584.0001245
.
Svozil
D.
,
Kvasnicka
V.
&
Pospicha
J.
1997
Introduction to multi-layer feed-forward neural networks
.
Chemometrics and Intelligent Laboratory Systems
39
(
1
),
43
62
.
https://doi.org/S0169-7439(97)00061-0
.
Wang
X. Q.
,
Kingsland
G.
,
Poudel
D.
&
Fenech
A.
2019
Urban flood prediction under heavy precipitation
.
Journal of Hydrology
577
,
123984
.
https://doi.org/10.1016/j.jhydrol.2019.123984
.
Wei
M.
,
She
L.
&
You
X. Y.
2020
Establishment of urban waterlogging pre-warning system based on coupling RBF-NARX neural networks
.
Water Science & Technology
82
(
9
),
1921
1931
.
https://doi.org/10.2166/wst.2020.477
.
Wu
Z. H.
,
Zhou
Y. H.
,
Wang
H. L.
&
Jiang
Z.
2020
Depth prediction of urban flood under different rainfall return periods based on deep learning and data warehouse
.
Science of The Total Environment
716
(
2
),
137077
.
https://doi.org/10.1016/j.scitotenv.2020.137077
.
Wu
M. M.
,
Wu
Z. N.
,
Ge
W.
,
Wang
H. L.
,
Shen
Y. X.
&
Jiang
M. M.
2021
Identification of sensitivity indicators of urban rainstorm flood disasters: a case study in China
.
Journal of Hydrology
599
,
126393
.
https://doi.org/10.1016/j.jhydrol.2021.126393
.
Xu
H. J.
2021
Research on modeling method of urban rain and flood simulation based on SWMM model
.
Water Resources Planning and Design
97
(
9
),
44
49
.
Xu
Z.
,
Chen
H.
,
Ren
M.
&
Cheng
T.
2020
Progress on disaster mechanism and risk assessment of urban flood/waterlogging disasters in China
.
Advances in Water Science
31
(
5
),
713
724
.
https://doi.org/1001-6791(2020)31:5<713:ZGCSHL>2.0.TX;2-S
.
Xue
F. C.
,
Huang
M. M.
,
Wang
W.
&
Zou
L.
2016
Numerical simulation of urban waterlogging based on flood area model
.
Advances in Meteorology
2016
,
1
9
.
https://doi.org/10.1155/2016/3940707
.
Yan
X.
,
Chang
Y.
,
Yang
Y.
&
Liu
X.
2020
Monthly runoff prediction using modified CEEMD-based weighted integrated model
.
Journal of Water and Climate Change.
12
(
5
),
1744
1760
.
https://doi.org/10.2166/wcc.2020.274
.
Yang
X. H.
&
Li
Y. Q.
2012
Runoff simulation using artificial intelligent techniques
.
International Conference on Remote Sensing
.
https://doi.org/10.1109/RSETE.2012.6260721
.
Yoon
S. S.
&
Nakakita
E.
2015
Application of an X-band multiparameter radar network for rain-based urban flood forecasting
.
Journal of Hydrologic Engineering
22
(
5
),
E5015005
.
https://doi.org/10.1061/(ASCE)HE.1943-5584.0001281
.
Zhang
Y.
2017
Study on Rainstorm and Flood Warning Method in Zhengzhou City
.
Dissertation
,
Zhengzhou University
,
Zhengzhou
,
China
.
Zhang
J. P.
,
Zhang
H. R.
&
Fang
H. Y.
2021
Urban waterlogging simulation and rainwater pipe network system evaluation based on SWMM and SCS method
.
South-to-North Water Transfers and Water
20
(
01
),
110
121
.
https://doi.org/10.13476/j.cnki.nsbdqk
.
Zhang
H.
,
Zhang
J. P.
,
Fang
H. Y.
&
Yang
F.
2022
Urban flooding response to rainstorm scenarios under different return period types
.
Sustainable Cities and Society
87
,
104184
.
https://doi.org/10.1016/j.scs.2022.104184
.
Zhao
K. Q.
&
Xuan
A. L.
1996
Set pair theory – a new uncertainty theory method and application
.
Systems Engineering
12
(
1
),
18
23
.
Zhou
Y. H.
,
Zn
W.
,
Xu
H. S.
&
Wang
H. L.
2022
Prediction and early warning method of inundation process at waterlogging points based on Bayesian model average and data-driven
.
Journal of Hydrology
44
,
101248
.
https://doi.org/10.1016/j.ejrh.2022.101248
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC-ND 4.0), which permits copying and redistribution for non-commercial purposes with no derivatives, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc-nd/4.0/).