## Abstract

The study compares an annual number of weir overflows calculated using a hydrodynamic model by continuous simulations and a probabilistic model. The weir outflow for a single precipitation event was successfully modelled using logistic regression. Performed numerical experiments showed that the calculated number of weir outflows with the hydrodynamic model falls within confidence intervals of the probabilistic model. This suggests that the model of the logistic regression can be used in practice. The probabilistic simulations revealed that a model with a probabilistic description of a number of annual precipitations and a model with an assumed average number of such events are not consistent. The proposed methodology can be applied for the design of outflow weirs and other storm devices.

## INTRODUCTION

Growing urbanization, climate change and extending terms of running of stormwater systems cause an essential increase in the amount of stormwater pouring on the surface area and the number of discharges by storm overflows (Doglioni *et al*. 2009). That leads to an increase in the level of the water table in receivers, to deterioration of the quality of the receivers' waters and to failure of their biological and chemical balance. Therefore, when designing storm overflows, the criteria for protection of waters against pollution should be taken into account; those criteria are expressed among other means by the permitted number of annual stormwater discharges (Rauch *et al*. 1998; Mantegazza *et al*. 2010).

Multiannual results of measurements which are usually used to develop regression models may constitute the basis for assessment of the number of stormwater discharges. However, since these models are of a local nature, their scope of application is limited and they can be used only for estimation operations (Veldkamp & Wiggers 1997; Fidala-Szope *et al*. 1999). A number of studies have therefore been undertaken to develop methods of a universal nature, but due to the simplifications they have adopted, the result received could be significantly flawed (Benoist & Lambertus 1989; Urcikán & Rusnák 2006). At present, the basis for the multiplicity of the operation of a storm overflow is usually continuous hydrodynamic simulations (Price 2000; Thorndahl 2009; Andrés-Doménech *et al*. 2010) made by means of a calibrated model on the basis of multiannual rainfall time series (DWA-A 118E 2006).

Due to strong interactions between parameters describing the characteristics of catchment areas considered in the models, there are problems with determining the values of empirical parameters influencing a satisfactory mapping of processes that take place during the stormwater outflow. The high cost of measuring rainfall and stormwater flows as well as problems with model calibration make the approaches described above not always economically justified. An additional problem is the access to multiannual continuous rainfall measurements of high resolution (DWA-A 118E 2006).

Therefore, probabilistic models were developed for the analysis of operation of storm overflows; in the models the necessity for continuous simulations was eliminated and statistical methods, like for example FORM (first order reliability method), were used for forecasting the occurrence of a stormwater discharge. The estimation of parameters in this method requires the implementation of complex numerical algorithms, so its application to model stormwater systems operation is limited (Thorndahl & Willems 2008). At the same time, in the methods, the stochastic nature of the number of rainfall events in the annual cycle has not been taken into account which may have a significant impact on the simulation results.

In view of the above considerations, it is appropriate to continue the search for models for forecasting the annual number of discharges by storm overflows. Those models should take into account the complex nature of surface runoff and the stochastic nature of rainfall and they should reduce the costs of rainfall and stormwater flow measuring; the estimation of their parameters should be relatively simple. Moreover, these models should not be highly complex for only then would they be successfully applied in engineering practice. In the paper, an attempt was made to develop a model in which a logistic regression model was used to assess the occurrence of a discharge in a storm overflow. This way hydrodynamic modelling was unnecessary. The Iman-Conover (IC) method was used to simulate the rainfall event, assuming that the relationship between the analysed variables describing the operation of the overflow could be determined by the Spearman correlation factor. At the same time, two solutions were considered in the paper: the first of them adopted an average annual number of rainfall events which is common practice in probabilistic models (Osorio *et al*. 2009; Fu & Butler 2014; Szeląg & Bąk 2017). In the latter case, it was assumed that the number of rainfall events per year is random and described by a uniform distribution which in this type of deliberation constitutes an authorial approach.

## OBJECT OF INVESTIGATION

The subject of the analysis is an urban catchment area located in the south-eastern part of Kielce in Poland (Figure 1). The total surface area of the catchment concerned is 62 ha. The analysed catchment area is covered by housing estates, public utility buildings and main and side streets. Sealed areas extend over an area of 29.26 ha (with a retention height of 2.50 mm) and unsealed ones over an area of 32.74 ha (with a retention height of 5 mm) and the weighted average field retention value of the catchment is equal to *d _{r}* = 3.81 mm (Szeląg

*et al*. 2016). The separate sewer system designed ensures complete drainage of the existing infrastructure. A detailed description of the urban catchment area can be found in the paper of Szeląg

*et al*. (2013).

The sewer system is built of 200 sewage wells, 100 pipe sections with a total length of 22 km, one storm overflow and two outlets. The roughness coefficient of Manning regarding the stormwater drainage pipes changes between 0.013–0.018 m^{−1/3}·s (Szeląg *et al*. 2016). Rain waters from the catchment area flow out by the collectors Φ1.25 m to the receiver – Silnica River, and they are previously treated in the stormwater treatment plant (Figure 1). If the filling of the diversion chamber (DC) does not exceed 0.42 m, then the entire stormwater via four pipes Φ0.400 m is discharged to the treatment plant. In the first place it is sent to the horizontal settling tank (SETT) and then a part of it is transported through a Φ0.20 m pipe to the coalescing separator (SEP) and to the measuring chamber (MC), from which the treated stormwater is introduced into the receiver. When the filling of the DC exceeds 0.42 m, the stormwater is discharged via the storm overflow (OV) to the Φ1.25 m discharge channel (S2), from which it is directly discharged to Silnica River.

In the framework of continuous measurements carried out in 2009–2011, a filling probe was installed in the DC and an ultrasonic flow meter (MES) was placed on the Φ1.25 m pipe to the DC at a distance of 3.0 m from the collector outlet (S1).

The investigated urban catchment area has already been the subject of numerous studies (Szeląg *et al.* 2013; Szeląg & Bąk 2017), which dealt with forecasting the quantity and quality of stormwater effluent on the outflow from the catchment area and the operation of the stormwater treatment plant and storm overflow by means of a hydrodynamic model developed in the SWMM programme (Storm Water Management Model). The hydrodynamic model of the catchment area and of the DC used in the paper has been previously calibrated and an analysis of its sensitivity and uncertainty was performed using the GLUE + GSA method (Szeląg *et al*. 2016).

## RAINFALL DATA

The evaluation of storm overflow operation and determination of independent rainfall events were based on rainfall data gained from a rainfall station (collector S1 in Figure 1) located 2 km away from the catchment basin and established in 2008. A minimum antecedent period of 4 h (DWA-A 118E 2006) was used for the calculation. In the work, a rainfall event was assumed as a fall episode when the total minimum precipitation depth is not less than 3.0 mm (Fu *et al*. 2011; Fu & Kapelan 2013). The duration of rainfall (*t _{d}*) of observed events was between 20 min and 2.366 min, while the antecedent period was in the range of 0.16–60 days. The total depth of rainfall in precipitation events varied between 3.0 mm and 45.2 mm, while the maximum rainfall depth in the precipitation events lasting 15, 30 or 45 minutes was respectively

*P*

_{t}_{=15}= 0.3–20.4 mm,

*P*

_{t}_{=30}= 0.6–25.4 mm and

*P*

_{t}_{=45}= 1.1–28.5 mm.

On the basis of the observed rainfall in the period 2008–2017, independent rainfall events were determined. The number of rainfall events (*N*) in individual years varied from 36 to 58, while their average value was equal to 45 episodes. Using the methodology to identify precipitation events described above, 463 precipitation episodes were selected in the available data series. The selected rainfall events have been parameterized by: total precipitation depth (*P _{c}*); event duration (

*t*); maximum rainfall depth in the rainfall events lasting respectively 15, 30 or 45 minutes (

_{d}*P*

_{t}_{=15},

*P*

_{t}_{=30},

*P*

_{t}_{=45}); the variability of these characteristics was described by empirical distributions. Then, in order to obtain the best possible theoretical and empirical data matching, the following statistical distributions (Adams & Papa 1999; Bacchi

*et al*. 2008; Andrés-Doménech

*et al*. 2010) were considered: Weibull, chi-square statistic, exponential, Generalized Extreme Value (GEV), Gumbel, Fisher-Tippet, gamma, log-normal, Pareto and beta. The Kolmogorov-Smirnov test was used to assess the conformity of the empirical and theoretical distributions.

## METHODOLOGY

The article provides a probabilistic model for forecasting the annual number of stormwater discharges in a storm overflow facility under investigation. It should be clearly stated here that the model developed and examined in the paper does not concern forecasting the flow of stormwater in the drainage system, but concerns forecasting the events consisting of stormwater overflows occurring in the storm overflow installed on a pipe of that system. The models used to forecast the stormwater flow itself were presented in detail in the works by Szeląg *et al*. (2018). The model developed in the paper assumes that the operation of the overflow is random and depends on the variability of rainfall in the annual cycle. The logistic regression model was used for forecasting storm overflow operation, while the Monte Carlo (MC) method was used for simulation of rainfall series. If the storm overflow operation was explained by only one variable or the explained variables were independent, then MC sampling was performed by means of theoretical distributions describing rainfall characteristics, as presented in works by Adams & Papa (1999) and Szeląg & Bąk (2017). Where the analysed explanatory variables *x _{i}* were correlated, and the relationship between the variables characterizing rainfall was determined by calculating the correlation coefficient of Spearman

*R*. Assuming that this is a sufficient description of the relationship between the variables, rainfall events were generated using the Iman & Conover (1982) method, described below. The calculation diagram of the model proposed in the paper is presented in Figure 2.

From the diagram presented, the calculation of the annual number of discharges by storm overflow is carried out in the following stages:

identification of parameters (

*α*) and explanatory variables (_{i}*x*– precipitation characteristics) in the logit model,_{i}evaluation of interdependencies between explanatory variables (

*x*) based on the Spearman correlation coefficient (_{i}*R*),*T*– th simulation of*N*rainfall events in the synthetic rainfall data series by means of Monte Carlo method,*M*– th simulation of precipitation characteristics (*x*) of rain in the synthetic rainfall data series by means of Monte Carlo method, taking into account the correlation; the analyses conducted assume_{i}*N**=**M**=*5000,determination of the number of discharges by storm overflow in modelled synthetic rainfall series based on Equation (1) (see below),

determination of the empirical curve describing the probability of exceeding the annual number of discharges in storm overflow (CDF – cumulative distribution function).

## LOGISTIC REGRESSION

*et al*. 2001). Moreover, this model is increasingly being used to assess the performance of biological reactors in wastewater treatment plants (Bayo

*et al*. 2006), in assessing the technical condition of water supply and stormwater pipes (Tran

*et al*. 2007) and in the sensitivity analysis of statistical models (Bartkiewicz

*et al.*2016). The logistic model discussed in the paper is a special case of a generalized linear model of the following form: in which the probability (

*P*) of occurrence of

*s*– stormwater discharges in the annual rainfall data series (

*N*) is described by the binomial distribution: where

*α*

_{0}– free term,

*α*– empirical coefficients determined by the maximum likelihood method,

_{i}*x*– variables, i.e.: total precipitation depth (

_{i}*P*), rainfall duration (

_{c}*t*), antecedent period (

_{d}*t*), maximum rainfall depth in rainfall events lasting 15, 30 and 45 minutes (

_{an}*P*

_{t}_{=15},

*P*

_{t}_{=30},

*P*

_{t}_{=45}),

*s*– annual number of overflow discharges,

*N*– number of rainfall events per year,

*p*– probability of the occurrence of a discharge in the storm overflow in

_{i}*i*-th rainfall event.

*p*calculated by Equation (1) will not be less than 0.50, which corresponds to the condition of the form: with the multidimensional space of variables (

*x*) that account for stormwater discharge.

_{i}The variables explaining the rainfall characteristics (*P _{c}*,

*t*,

_{d}*t*,

_{an}*P*

_{t}_{=15},

*P*

_{t}_{=30},

*P*

_{t}_{=45}) during the considered rainfall episodes were used for the analysis of the overflow operation. Only statistically significant variables at the adopted confidence level (

*α*= 0.95) are included in the statistical model. Quality of the model, i.e. adjustment of calculation results to the measurements data in the logistic regression model, has been determined on the basis of the following coefficients: sensitivity (

*SENS*, %), specificity (

*SPEC*, %) and reckoning error (

*R*) (Bartkiewicz

_{z}^{2}*et al.*2016).

In order to estimate *α _{i}* parameters in the logit model, the results of measurements data concerning storm overflow operation carried out in the years 2009–2011 were used, where overflow discharge occurred in 69 cases in 188 precipitation events, and 42 overflow discharges occurred (2012–2014) in 93 cases of the maximum filling of the DC. The model was validated using 10 independent rainfall events.

## RAINFALL EVENT GENERATOR

In this paper, if the values characterizing rainfall (*P _{c}*,

*t*,

_{d}*t*,

_{an}*P*

_{t}_{=15},

*P*

_{t}_{=30},

*P*

_{t}_{=45}) and also the explanatory variables in the logit model show significant dependence, then for its mathematical description the value of Spearman correlation coefficient was used (Iman & Conover 1982). The variability of the rainfall characteristics defined in the form of boundary distributions is described by the functions of theoretical distributions. Based on the established correlation coefficients and boundary distributions, individual events were randomly sampled using the IC method.

The IC method is an algorithm commonly used in the Monte Carlo method. This method is implemented in several statistical packages (Risk, Crystal Ball, STATISTICA), which makes it available to a wide range of users. The method can be properly used when the following conditions are met (Wu & Tsang 2004):

Mean values (

*μ*_{1,}*μ*_{2}, …_{,}*μ*_{i})and standard deviations (σ_{s}_{1,}σ_{2}, …_{,}σ_{i})of individual variables in the predicted rainfall data series do not differ more than_{s}*ɛ*< 5% from relevant values of the theoretical distributions, where*ɛ*is the relative difference between the modelled and measured value.The empirical distributions of the modelled variable values (

*x*) are in line with theoretical distributions; for the verification of this condition it is recommended to use the Kolmogorov-Smirnov test._{i}The correlation coefficient (

*R*) between individual dependent variables (*x*) obtained for data received from the MC simulation do not differ more than about_{i}*ɛ*<5% from the_{c}*R*value calculated for empirical data.

A detailed description of the assumptions and theoretical considerations concerning the IC method can be found in the works of Iman & Conover (1982), Wu & Tsang (2004) and Tarpanelli *et al*. (2012).

## RESULTS

Using the hydrodynamic model of the water catchment with the DC, continuous simulation of storm overflow operation based on continuous measurements series of rainfall from 2008 to 2017 was performed. On the basis of the calculations performed (Table 1), the annual number of discharges by storm overflow located in the DC was determined and compared with the results of measurements collected from 2008 to 2011. The annual number of discharges calculated by means of the developed logit model described by Equation (2) is also presented in Table 1. Based on the tabular data, one can show that in the analysed time period the number of overflow discharges in particular years ranged from 13 to 29. The number of discharges measured and projected with the hydrodynamic model shows a slight variation, which is indicated by the numerical values in Table 1. Moreover, the analyses carried out shows that the number of rainfall events in particular years ranged from 36 to 58, while the average number of rainfall events was 45.

Year | Number of discharges by overflow calculated (SWMM)/measured | Number of discharges forecasted by logit model | Number of rainfall events |
---|---|---|---|

2008 | 13/15 | 14 | 43 |

2009 | 15/16 | 16 | 47 |

2010 | 17/18 | 19 | 47 |

2011 | 19/20 | 19 | 51 |

2012 | 21/13* | 20/14* | 36 |

2013 | 22/13* | 20/13* | 41 |

2014 | 29/16* | 28/16* | 44 |

2015 | 26 | 26 | 58 |

2016 | 22 | 21 | 44 |

2017 | 17 | 17 | 38 |

Year | Number of discharges by overflow calculated (SWMM)/measured | Number of discharges forecasted by logit model | Number of rainfall events |
---|---|---|---|

2008 | 13/15 | 14 | 43 |

2009 | 15/16 | 16 | 47 |

2010 | 17/18 | 19 | 47 |

2011 | 19/20 | 19 | 51 |

2012 | 21/13* | 20/14* | 36 |

2013 | 22/13* | 20/13* | 41 |

2014 | 29/16* | 28/16* | 44 |

2015 | 26 | 26 | 58 |

2016 | 22 | 21 | 44 |

2017 | 17 | 17 | 38 |

where: numbers of stormwater discharges for years 2008–2011 were obtained using continuous simulations, *outflow events for 2012–2014 were acquired on the basis of observed DC maximal filling.

*P*,

_{c}*t*), a logistic regression model was developed, described as follows: The values of

_{d}*α*coefficients and the measures for matching calculation results to measurements are given in Table 2. Determined parameters (

_{i}*SENS*,

*SPEC*,

*Rz*) indicating the matching of calculation results to the measurements show that the obtained logistic regression model

^{2}*p*=

*f*(

*P*,

_{c}*t*) has satisfactory predictive capability. Taking into account individual matching parameters expressing the accuracy of model classification (

_{d}*SPEC*= 96.9%) it was found that the model correctly identified 103 events for 106 observed stormwater discharge events. Based on the value of

*SENS*= 98.2% it was found that out of 165 events without stormwater discharged, the model appropriately classified 162 events.

Variable | α _{i} | Standard deviation | p from test |
---|---|---|---|

P _{c} | 0.566 | 0.0620 | 0.002 |

t _{d} | −0.004 | 0.0012 | 0.009 |

α_{0} | −2.152 | 0.1860 | 0.004 |

R^{2}_{McFadden} = 0.868 | SENS = 0.982 | ||

R^{2}_{Cox-Snell} = 0.798 | SPEC = 0.969 | ||

R^{2}_{Negelerke} = 0.868 | R_{z}^{2} = 0.974 |

Variable | α _{i} | Standard deviation | p from test |
---|---|---|---|

P _{c} | 0.566 | 0.0620 | 0.002 |

t _{d} | −0.004 | 0.0012 | 0.009 |

α_{0} | −2.152 | 0.1860 | 0.004 |

R^{2}_{McFadden} = 0.868 | SENS = 0.982 | ||

R^{2}_{Cox-Snell} = 0.798 | SPEC = 0.969 | ||

R^{2}_{Negelerke} = 0.868 | R_{z}^{2} = 0.974 |

Analysing the *R*_{z}^{2} value it was found that out of 271 observed events the model correctly classified 264 cases. Validation of the logit model showed that out of 10 events simulation results were obtained in 8 cases according to the measurements. A high compliance of the annual discharges number in the period 2008–2011 with the measurement results confirms a satisfactory predictive capability of the obtained logistic regression model.

*P*= const. (reduction of the average rainfall intensity

_{c}*q*=

*P*−

_{c}*t*

_{d}^{−1}) leads to a decrease of the probability of a discharge in storm overflow. It should be noted that the obtained relation

*p*

*=*

*f*(

*P*,

_{c}*t*) has been confirmed by Grum & Aalderink (1999), and Thorndahl & Willems (2008), who analysed the operation of storm overflows in urban catchment areas. In view of the above observation and attempting to provide a universal relation to the prediction of the occurrence of a stormwater overflow in a rainwater drainage system, Equation (4) was transformed in the form: The obtained Equation (5) is equal to the condition for the minimum rainfall (

_{d}*d*), the exceeding of which causes the discharge by an overflow and which was obtained by Thorndahl & Willems (2008) analysing the discharge volumes. The calculated value of parameter

_{min}*α*

_{0}/

*α*

_{1}=2.152/0.566 = 3.80 in Equation (5) has a physical interpretation and is equal to the weighted average value of retention height (

*d*) of the analysed catchment area. The result obtained is of significant practical importance as it enables the use of the developed logit model for other urban catchments, and in particular for uncontrolled ones. Equation (5) obtained in the paper is very important from the point of view of the forecast of storm overflow operation in uncontrolled basins, where at present, in order to analyse its functioning, a hydrodynamic model is created. It should be noted that despite the development of numerous empirical relationships for the determination of parameters in SWMM models, the results of calculations do not always accurately reflect the actual state of affairs (Guo & Urbonas 2009). At the same time, it should be remembered that the logit or hydrodynamic model is only a tool for forecasting the operation of the storm overflow. In order to determine the annual number of discharges, multiannual rainfall series (30 years according to DWA A-118E) are necessary, in the case of the hydrodynamic model for continuous simulation, and in the case of the probabilistic model for the development of the precipitation generator. Literature (Fu & Butler 2014) shows that a reliable precipitation generator can be built on the basis of a 10 year series.

_{r}= d_{min}Using the obtained relation shown in Equation (5) empirical distributions (distribution functions) for the total rainfall depth (*P _{c}*) and duration (

*t*) were determined. The theoretical distributions based on Kolmogorov-Smirnov test results were adjusted to the obtained empirical distributions; the results are presented in Table 3 and in Figures 3(a) and 3(b).

_{d}Probability distribution | P_{c} | t_{d} | ||
---|---|---|---|---|

D-critical | p-test | D-critical | p-test | |

chi-square | 0.231 | <0.0001 | 0.546 | <0.0001 |

beta | 0.108 | <0.0001 | 0.068 | 0.0830 |

Weibull | 0.050 | 0.3570 | 0.030 | 0.8520 |

exponential | 0.273 | <0.0001 | 0.057 | 0.2080 |

GEV | 0.399 | <0.0001 | 0.128 | <0.0001 |

Pareto | 0.375 | <0.0001 | 0.403 | <0.0001 |

Fisher-Tippet | 0.485 | <0.0001 | 0.382 | <0.0001 |

log-normal | 0.087 | <0.0001 | 0.091 | 0.0060 |

gamma | 0.951 | <0.0001 | 1.000 | <0.0001 |

Gumbel | 0.951 | <0.0001 | 1.000 | <0.0001 |

Probability distribution | P_{c} | t_{d} | ||
---|---|---|---|---|

D-critical | p-test | D-critical | p-test | |

chi-square | 0.231 | <0.0001 | 0.546 | <0.0001 |

beta | 0.108 | <0.0001 | 0.068 | 0.0830 |

Weibull | 0.050 | 0.3570 | 0.030 | 0.8520 |

exponential | 0.273 | <0.0001 | 0.057 | 0.2080 |

GEV | 0.399 | <0.0001 | 0.128 | <0.0001 |

Pareto | 0.375 | <0.0001 | 0.403 | <0.0001 |

Fisher-Tippet | 0.485 | <0.0001 | 0.382 | <0.0001 |

log-normal | 0.087 | <0.0001 | 0.091 | 0.0060 |

gamma | 0.951 | <0.0001 | 1.000 | <0.0001 |

Gumbel | 0.951 | <0.0001 | 1.000 | <0.0001 |

*D*and

*p*(see Table 3), by means of a Kolmogorov-Smirnov test, it was found that for the empirical data (concerning the total rainfall depth and duration) the Weibull distribution described by the following formula would be best suited: where

*x*-variable;

*β*, γ,

*μ*-empirical parameters of the distributions, in the case of

*P*are

_{c}*μ*= 3.0 mm;

*β*= 0.714 and γ = 5.336 mm and for

*t*are

_{d}*μ*= 12.0 min;

*β*= 0.994 and γ = 491.94 min.

The calculations performed showed that the difference between the modelled and measured mean values (*μ*) and standard deviation values (σ) for *P _{c}* and

*t*does not exceed 1%. The difference in the value of the correlation coefficient (

_{d}*R*) derived from MC simulation results and calculated from the measurement data does not exceed 2%. In addition, the assessment of the conformity of the theoretical distributions and those obtained by Monte Carlo simulation using the IC method showed that the probability values

*p*in the Kolmogorov-Smirnov test are

*p*= 0.54 for the total rainfall depth and

*p*= 0.59, respectively. Taking into account the above calculation results it can be concluded that the number of

*N*= 5000 samples used is sufficient and the theoretical distributions of

*P*and

_{c}*t*and those ones received from MC simulations are compatible.

_{d}Based on the logit regression model described by Equation (4) the theoretical distributions (*P _{c}* and

*t*) were determined and the annual number of stormwater discharges was calculated with a probabilistic model in accordance with the scheme in Figure 1. In the analyses performed, two cases have been considered: the first option takes into account the variable number of rainfall events per year (

_{d}*N*= var) and the second option omits this fact and takes into account the average value of rainfall events occurring in singular years (

*N*= const.). The results of the calculations performed with the probabilistic model (Figure 3(d)) were compared with the results of continuous simulations carried out with the SWMM programme for the period 2008–2017.

From data in Figure 3(d), the expected value of the number of discharges in storm overflow (*n* = 28) obtained under the assumption of a uniform distribution of the number of rainfall events in a year (*N* = var.) is higher than when the average number of rainfall episodes (*n* = 27) is assumed. Moreover, the obtained curves show that for *p* < 0.28 the number of discharges in storm overflow obtained with the assumption *N* = const. is lower than when the stochastic character of the number of rainfall events per year is taken into account. If the probability value *p* (Figure 3(d)) is greater than 0.28 then the number of discharges in storm overflow obtained under the assumption of the average annual number of rainfall episodes is greater than the discharges number determined while taking into account the random character of the number of rainfall events in the year. Percentile values *p* = 0.05 and *p* = 0.95 corresponding to the annual number of discharges in storm overflow are 22 and 32, respectively, when *N* = const.; when the number of annual rainfall episodes is of a probabilistic nature then the appropriate percentile values are 19 and 37. Summing up the results of the calculations obtained, it can be concluded that at the stage of storm overflow designing by means of the presented probabilistic model, omitting the stochastic nature of the number of rainfall events (*N* = const.) can lead to under-positioning of the overflow edge and thus to exceeding the permitted number of wastewater discharges. Moreover it is worth noting that the results of simulation of the annual number of discharges by means of the calibrated hydrodynamic model are within the range of solutions obtained by the probabilistic model which indicates that the method presented in the paper can be applied in practice.

## CONCLUSIONS

The paper presents the results of calculations of the annual number of stormwater overflows using the developed probabilistic model and based on the results of continuous simulations with a hydrodynamic model.

The paper presents an innovative application of the logistic regression model to the forecast of the operation of an underground infrastructure facility (storm overflow) located in the rainwater drainage system. The calculations have confirmed that the logit model is a useful tool and can be used to simulate complex flow phenomena during precipitation episodes. In practical terms, the logit model can be used as an alternative tool for forecasting the storm overflow operation by means of a hydrodynamic model, and in particular in the event of problems with its calibration.

It is worth noting that the empirical parameters used in the logit model have a physical interpretation, which makes it possible to apply that model in unmeasured urban basins. However, in order to confirm this, further studies are needed in catchments with different physical and geographical characteristics.

The probabilistic model presented in the paper for the forecast of the annual number of stormwater overflows is an innovative combination of logit model and rainfall generator (IC method). This model takes into account the stochastic nature of the number of rainfall events in the year, which has so far been ignored in the calculations. The developed probabilistic model makes it possible to estimate the annual number of discharges in the case of limited access to long-term rainfall measurements (30 years); however, data are necessary to develop a model (characteristics of the catchment area) to simulate the operation of the overflow and precipitation generator.

The results obtained in the paper indicate that ignoring the stochastic nature of the number of rainfall events in the annual cycle may lead to lowering the designed storm overflow edge and, consequently, to exceeding the permissible number of discharges.

The analyses of the annual number of discharges by the storm overflow showed that the results of calculations obtained with the use of a hydrodynamic model are in the range of probabilistic solutions presented in the paper with the use of the mathematical model. This confirms the practical character of the method discussed and applied in the paper for the forecast of the annual number of stormwater discharges by means of a storm overflow.

Taking into account the above considerations, it seems justified to conduct further research aimed at the possibility of using the logistic regression model to assess the functioning of other facilities located in the rainwater drainage networks (e.g. retention reservoirs, sewage treatment plants). Moreover, due to the fact that in many cases we have limited data resources (short period of rainfall and flow measurements), further studies are recommended in order to assess the influence of selected boundary distributions and uncertainty of estimated parameters in the logit models on the results of simulations of storm overflow operation.

## REFERENCES

*(under review)*