Abstract
Flood events are typically triggered by extreme precipitation in rain-dominant basins. In this study, to better understand the genetic mechanisms and characteristics of floods, copula functions are used to analyze the response of flood events to extreme precipitation. The coincidence probabilities of the typical extreme flood and precipitation events are calculated; different return periods of their arbitrary combinations are calculated, whereas the dangerous domains for flood control under different return periods are identified; furthermore, flood risk analysis under different extreme precipitation scenarios is performed via their conditional exceedance probabilities. The Xitiaoxi catchment (XC) and Dongtiaoxi catchment (DC) in the Zhexi Region of the Taihu Basin are selected as the study area. The results show that in four scenarios with precipitation frequencies of 80%, 90%, 93.33%, and 95%, the probabilities of the dangerous flood are 9.72%, 10.57%, 10.86%, and 11.01% in the XC, respectively, and 5.91%, 6.31%, 6.44%, and 6.51% in the DC, respectively. This study provides a practical basis and guidance for the computation of rainstorm designs, management of flood control safety, and water resource scheduling in the Taihu Basin.
HIGHLIGHTS
This study focused on the flood risk in the main water source of Taihu Lake, i.e., the Zhexi Region, which had a direct and significant impact on the floods in the Taihu Basin.
Extreme hydrological and meteorological events were considered as the objects of this study. Their joint probability distribution and combined effects were directly related to the degree of flood risk.
Graphical Abstract
INTRODUCTION
Floods occur frequently and are widely distributed, resulting in significant casualties and economic losses worldwide, particularly in plain river network areas with developed economies and dense populations. (UNISDR 2016; Zeng et al. 2019; Thapa et al. 2020). The Taihu Basin is a typical plain river network area in China, located in the economically developed Yangtze River Delta. Although the area and population of the Taihu Basin account for only 0.4 and 3% of the country, the Gross Domestic Product (GDP) is as high as 13% of that of the country (Bao et al. 2016). Characterized by a flat terrain, extensive river networks, and numerous polder areas, the Taihu Basin is vulnerable to flooding (Zhai et al. 2020). In 2016, direct economic losses due to flood disasters in the Taihu Basin were $1.06 billion, accounting for 0.12% of its GDP (Wang 2018). Meanwhile, in 1991 and 1999, the direct economic losses due to flood disasters accounted for 6.7% and 1.6% of the GDP in the corresponding year (Xu 2009; Wang et al. 2018). Within this context, satisfying national and municipal demands for a better understanding of flood risks in the Taihu Basin is of top priority for scientific and effective disaster prevention and mitigation work (Tanaka et al. 2017).
Precipitation and discharge are closely related, and floods are typically triggered by extreme precipitation events in rain-dominant basins (Fang et al. 2018). The flood events in the Taihu Basin have occurred primarily during the flood season (from May 1st to September 30th) and have always been accompanied by extreme precipitation events (Li et al. 2013). Therefore, investigating the bivariate frequency of extreme flood and extreme precipitation during the flood season will facilitate a better understanding of the incidence characteristics and genetic mechanisms of flood events in the Taihu Basin.
The copula function, introduced by Sklar (Sklar 1959), is a commonly used tool for calculating the joint distribution of multiple random variables. It can flexibly construct the multivariate joint distribution of random variables with different marginal distributions and describe linear and non-linear correlations between them (Favre et al. 2004). The application of copula in hydrology can be categorized into two primary types. The first is to analyze the frequency of the multifeature attributes of a certain hydrological phenomenon, such as the drought duration and intensity (Kwon & Lall 2016); flood duration, peak, and volume (Daneshkhah et al. 2016); and precipitation duration, intensity, and depth (Gao et al. 2018). The other is to study the combination of hydrological events at different time and space scales, such as the flood coincidence probability analysis for the mainstream and its tributaries (Gao et al. 2018), as well as river flow simulations based on spatio-temporal joint distributions (Chen et al. 2019). A more comprehensive understanding of hydrological phenomena can be obtained based on multivariate joint frequencies.
Thus far, many scholars have used copula functions to evaluate the multivariate frequencies of flood or precipitation events. Chen et al. analyzed the risk of flooding as a result of flood coincidences by considering flood magnitudes and times (dates) of occurrence using multivariate copula functions (Chen et al. 2012). Zhang et al. investigated the probabilistic behaviors of precipitation extremes based on copulas in both space and time (Zhang et al. 2013). Saad et al. analyzed spring flood-causing mechanisms in terms of the occurrence, frequency, duration, and intensity of precipitation as well as temperature events and their combinations prior to and during floods using a proposed multivariate copula approach (Saad et al. 2015). Goswami et al. analyzed the copula-based probabilistic behavior of precipitation extremes over the eastern Himalayan region and discovered that the co-occurrences of heavy and weak precipitation will be more frequent in the future, resulting in a higher risk of floods and droughts within the same year (Goswami et al. 2018). However, the studies above have focused on the multivariate frequency analysis of different feature attributes of the same event (i.e., either the flood or precipitation event), whereas the multivariate frequency analysis of the extreme flood and extreme precipitation events has not been performed.
Frequency analysis of meteorological or hydrological events in the Taihu Basin has been performed. Hu et al. constructed the joint distribution of the initial time of typhoons, the start time of plum rains, and the end time of plum rains in the Taihu Basin based on Gumbel copulas and concluded that the total encounter possibility of typhoons and plum rains in the Taihu Lake Basin was 9.23% (Hu et al. 2010). Liu et al. investigated the relationships between the onset of plum rains and the rainfall amount during the plum rains period in the Taihu Basin based on the copula function; the results showed that changes in floods appearing during the early onset of plum rains must be the focus (Liu et al. 2015). Han et al. applied bivariate copulas to analyze the correlation between flood peak levels and duration at two stations in the Taihu Basin and obtained the joint return periods of their arbitrary combinations (Han et al. 2018). Luo et al. calculated the joint distribution of the precipitation events of eight hydrological subregions in the Taihu Basin using Archimedean copulas considering the spatial and temporal differences in flood risks caused by plum rains and typhoons (Luo et al. 2019). However, these studies focused on the different feature attributes of a single meteorological or hydrological event in the Taihu Basin, whereas the joint distribution analysis of extreme meteorological and hydrological events has not yet been investigated. Meanwhile, the aforementioned studies pertaining to the Taihu Basin considers the average situation of the entire Taihu Basin as the study object; however, a targeted research regarding the Zhexi Region, i.e., the main water source of the Taihu Lake, has not been conducted.
Therefore, the main purpose of this study is to investigate the response of flood events to extreme precipitation in the Zhexi Region of the Taihu Basin through the joint risk analysis of extreme flood and extreme precipitation events. The main contents of this study include: (1) establishment of a copula-based joint distribution model of the extreme flood and precipitation events for flood risk analysis, (2) calculation of the coincidence probabilities of typical extreme flood and precipitation events, (3) analysis of different return periods of arbitrary combinations of extreme flood and precipitation events and the identification of dangerous domains for flood control under different return periods, and (4) flood risk analysis under different extreme precipitation scenarios via their conditional exceedance probabilities. The technical roadmap of this study is shown in Figure 1.
MATERIALS AND METHODS
Materials
Study area
The Xitiaoxi catchment (XC) and Dongtiaoxi catchment (DC) are located in one of the eight hydrological subregions of the Taihu Basin, i.e., the Zhexi Region. The upstream area of the Hengtangcun hydrometrical station in the XC and the upstream area of the Pingyao hydrometrical station in the DC were selected as the study area (Figure 2), whose drainage areas measured 1,359 km2 and 1,420 km2, respectively. The Zhexi Region is located upstream of the Taihu Lake and is the subregion of the Taihu Basin with the highest rainfall. The inflow of the Taihu Lake from the Zhexi Region accounts for approximately 50% of the total inflow of this lake (Wang et al. 2019). The steep slopes and fast flowing rivers resulted in a high risk of floods in the XC and DC, which are the main source of floods in the Taihu Basin.
The XC (119°14′E–19°3′E, 30°23′N–30°53′N) lies in the subtropical monsoon climate zone, which is mild and humid. The multiyear average temperature and precipitation are 15.5 °C and 1,465.8 mm, respectively. The distribution of the precipitation is uneven during the year, with the most precipitation occurring in the flood season, accounting for 75% of the annual precipitation. The runoff in the XC is generated by the precipitation, and its occurrence and quantity are dependent on the characteristics of the precipitation. Therefore, the within-year variation in the runoff is significant, and the runoff during the flood season accounts for 45–54% of the annual runoff, making the XC prone to floods. The average annual inflow of the Taihu Lake from the XC is 26.8 × 108 m3, accounting for 27.7% of the total inflow of this lake (Liu et al. 2009). Since the mid-20th century, extreme floods have occurred frequently in the XC. For example, as a result of a typhoon, the maximum observed discharge at the Hengtangcun station was 1,770 m3/s in 2013. High mountainous and hilly areas are distributed in the southwest with a maximum elevation of 1,578 m, whereas low alluvial plains lie in the northeast (Lv et al. 2015). The dominant land use types are woodland and cropland.
The DC (119°30′E–120°06′E, 30°08′N–30°42′N) has a subtropical monsoon climate, with an annual average temperature and precipitation of 15.8 °C and 1,640 mm, respectively. The precipitation varies significantly by year, and its distribution is uneven during the year. May to July is the flood season characterized by plum rains, with the precipitation ranging from 450 to 510 mm; August to September is the flood season characterized by typhoons, with the precipitation ranging from 190 to 800 mm. Furthermore, the precipitation in the flood season accounts for approximately 75% of the annual precipitation. Similar to the runoff in the XC, the runoff in the DC is generated by precipitation, and the distribution of the runoff during the year is consistent with the precipitation. The average annual inflow of Taihu Lake from the DC is 15.4 × 108 m3, accounting for 18.4% of its total inflow (Ma & Wang 2011). The maximum flood discharge at Pingyao station was 795 m3/s, which occurred in 1984. The geomorphology from northwest to east varies from mountainous to hilly and ends in a plain area, and the corresponding height ranges from 1,500 to 3 m. The major land use types are woodland and cropland, and the DC has more urban and construction land compared with the XC.
Dataset
The data used in this study included:
- (1)
Daily precipitation data during the flood season (from 1965 to 2016) of 14 representative stations in the XC, including the Hengtangcun station, and 12 representative stations in the DC, including the Pingyao station.
- (2)
Daily discharge data during the flood season from 1965 to 2016 of the Hengtangcun station in the XC and the Pingyao station in the DC.
The locations of each station are shown in Figure 2. The areal average precipitations in the two catchments were calculated using the Thiessen polygon method. All data were obtained from the ‘Annual Hydrological Report P.R. China’ (Lin et al. 1965–2016), which is a hydrological yearbook compiled by the Ministry of Water Resources of the People's Republic of China. The data in the hydrological yearbook have been reviewed by the authors to guarantee the reliability, including the completeness, accuracy, and consistency. Therefore, the data used here are reliable.
Determination of marginal distribution
The marginal distributions of the four series were determined: (1) the maximum flood discharge during the flood season (MF) at the Hengtangcun station, (2) the areal average precipitation during the flood season (Pr) in the XC, (3) the MF at the Pingyao Station, and (4) the Pr in the DC.
Autocorrelation test







Parameter estimation





Goodness-of-fit test






Establishment of copula model
The copula model used to describe the dependence structure of the MF and Pr series in the study areas was constructed by joining their marginal distributions.
Dependence evaluation








Copula function and parameter estimation





The copula is unique if the marginals are continuous.
The typically applied Archimedean copulas in hydrology, including four single-parameter copulas and four double-parameter copulas whose structures are more flexible, were employed in this study. They are defined in Table 1.
Notation and properties of bivariate Archimedean copula families
Name . | Expression ![]() | Generator function![]() | Parameter range . |
---|---|---|---|
Gumbel | ![]() | ![]() | ![]() |
Clayton | ![]() | ![]() | ![]() |
Frank | ![]() | ![]() | ![]() |
Joe | ![]() | ![]() | ![]() |
Clayton–Gumbel | ![]() | ![]() | ![]() |
Joe–Gumbel | ![]() | ![]() | ![]() |
Joe–Clayton | ![]() | ![]() | ![]() |
Joe–Frank | ![]() | ![]() | ![]() |
Name . | Expression ![]() | Generator function![]() | Parameter range . |
---|---|---|---|
Gumbel | ![]() | ![]() | ![]() |
Clayton | ![]() | ![]() | ![]() |
Frank | ![]() | ![]() | ![]() |
Joe | ![]() | ![]() | ![]() |
Clayton–Gumbel | ![]() | ![]() | ![]() |
Joe–Gumbel | ![]() | ![]() | ![]() |
Joe–Clayton | ![]() | ![]() | ![]() |
Joe–Frank | ![]() | ![]() | ![]() |


Goodness-of-fit test




The corresponding p-values were obtained via Monte Carlo methods, the detailed procedures of which have been provided by Genest et al. (2009).

Return periods and dangerous domains
The return period indicates the average time between the concurrence of two hydrological events and is critical for flood prevention and mitigation work. Three types of bivariate return periods, including the joint return period (), co-occurrence return period (
), and secondary return period (
), are discussed here.

















The dangerous domain , the notion of which was introduced by Salvadori et al. (Salvadori et al. 2011), contains all the dangerous events that are regarded as ‘more dangerous’ than the prescribed vector of thresholds
. The dangerous domain identified under different return periods are different.
- (1)Under
:where at least one of the components exceeds a prescribed threshold, as shown in the gray-shaded area in Figure 3(a).
- (2)Under
:where both the components exceed a prescribed threshold, as shown in the gray-shaded area in Figure 3(b).
- (3)
Under
:
Graphical illustration of dangerous domain. (a) Identification of dangerous domain under ; (b) identification of dangerous domain under
; (c) identification of dangerous domain under
; (d) comparison of dangerous domains under
; (e) comparison of dangerous domains under
; (f) comparison of dangerous domains under
. Black curves represent contour lines of different return periods, and value of the return period is marked near contour line. Black dots represent thresholds, whose coordinates are included adjacent to them. Gray-shaded areas represent dangerous domains corresponding to different thresholds.
Graphical illustration of dangerous domain. (a) Identification of dangerous domain under ; (b) identification of dangerous domain under
; (c) identification of dangerous domain under
; (d) comparison of dangerous domains under
; (e) comparison of dangerous domains under
; (f) comparison of dangerous domains under
. Black curves represent contour lines of different return periods, and value of the return period is marked near contour line. Black dots represent thresholds, whose coordinates are included adjacent to them. Gray-shaded areas represent dangerous domains corresponding to different thresholds.



The definition of the multivariate return periods and
is limited in the identification of dangerous domains, whereas
based on the Kendall measure is more rational. An advantage of
is that thresholds lying over the same critical line always generate the same dangerous domain; however, this does not apply when considering
and
, as illustrated in detail in Figure 3 and Table 2.
Identification of dangerous domains under different return periods and their pros and cons
Type of the return period . | Thresholds . | Value of the return period . | Dangerous domain . | Pros and cons . |
---|---|---|---|---|
![]() | ![]() | ![]() | Polygon ![]() | As shown in Figure 3(d), the joint return period of ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
![]() | ![]() | Polygon ![]() | ||
![]() | ![]() | ![]() | Rectangle ![]() | As shown in Figure 3(e), the co-occurrence return period of ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
![]() | ![]() | Rectangle ![]() | ||
![]() | ![]() | ![]() | Area ![]() | As shown in Figure 3(f), any point on the same secondary return period contour line has the same dangerous domain. The larger the return period, the smaller the dangerous domain. The dangerous domain with a smaller return period necessarily covers that with a larger return period. Compared with ![]() ![]() |
![]() | ![]() | Area ![]() |
Type of the return period . | Thresholds . | Value of the return period . | Dangerous domain . | Pros and cons . |
---|---|---|---|---|
![]() | ![]() | ![]() | Polygon ![]() | As shown in Figure 3(d), the joint return period of ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
![]() | ![]() | Polygon ![]() | ||
![]() | ![]() | ![]() | Rectangle ![]() | As shown in Figure 3(e), the co-occurrence return period of ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
![]() | ![]() | Rectangle ![]() | ||
![]() | ![]() | ![]() | Area ![]() | As shown in Figure 3(f), any point on the same secondary return period contour line has the same dangerous domain. The larger the return period, the smaller the dangerous domain. The dangerous domain with a smaller return period necessarily covers that with a larger return period. Compared with ![]() ![]() |
![]() | ![]() | Area ![]() |
Conditional exceedance probability
RESULTS AND DISCUSSION
Determination of marginal distributions
We first tested the randomness of the variables by computing their autocorrelations. The autocorrelation plots of the MF and Pr series in the XC and DC are shown in Figure 4. As shown, the autocorrelations in the four series can be disregarded at the 5% significance level. Therefore, their univariate distribution fitting can be performed.
Autocorrelation plots of MF and Pr series in XC and DC. (a) Autocorrelation plot of MF series in XC; (b) autocorrelation plot of Pr series in XC; (c) autocorrelation plot of MF series in DC; (d) autocorrelation plot of Pr series in DC. The lag of each autocorrelation estimate is denoted on the horizontal axis, and each autocorrelation estimate is indicated by height of vertical bars. A pair of horizontal, dashed lines represent lag-wise 95% confidence intervals centered at zero and are used for determining the statistical significance of an individual autocorrelation estimate at a specified lag vs. a null value of zero (i.e., no autocorrelation at that lag).
Autocorrelation plots of MF and Pr series in XC and DC. (a) Autocorrelation plot of MF series in XC; (b) autocorrelation plot of Pr series in XC; (c) autocorrelation plot of MF series in DC; (d) autocorrelation plot of Pr series in DC. The lag of each autocorrelation estimate is denoted on the horizontal axis, and each autocorrelation estimate is indicated by height of vertical bars. A pair of horizontal, dashed lines represent lag-wise 95% confidence intervals centered at zero and are used for determining the statistical significance of an individual autocorrelation estimate at a specified lag vs. a null value of zero (i.e., no autocorrelation at that lag).
We used six distribution models (the GEV, Gumbel, P-III, gamma, normal, and log-normal distributions) to fit the marginal distributions for the MF and Pr series in the XC and DC, and the results are shown in Figures 5 and 6. As show in both figures, the CDFs and PDFs for the marginal distributions exhibit different fitting performances between the estimated and empirical distributions.
Comparison of different probability density estimates with observed frequency for MF and Pr series in XC. (a) Comparison of CDFs of candidate distributions and observed frequency for MF series in XC; (b) comparison of PDFs of candidate distributions and observed frequency for MF series in XC; (c) comparison of CDFs of candidate distributions and observed frequency for Pr series in XC; (d) comparison of PDFs of candidate distributions and observed frequency for Pr series in XC.
Comparison of different probability density estimates with observed frequency for MF and Pr series in XC. (a) Comparison of CDFs of candidate distributions and observed frequency for MF series in XC; (b) comparison of PDFs of candidate distributions and observed frequency for MF series in XC; (c) comparison of CDFs of candidate distributions and observed frequency for Pr series in XC; (d) comparison of PDFs of candidate distributions and observed frequency for Pr series in XC.
Comparison of different probability density estimates with observed frequency for MF and Pr series in DC. (a) Comparison of CDFs of candidate distributions and observed frequency for MF series in DC; (b) comparison of PDFs of candidate distributions and observed frequency for MF series in DC; (c) comparison of CDFs of candidate distributions and observed frequency for Pr series in DC; (d) comparison of PDFs of candidate distributions and observed frequency for Pr series in DC.
Comparison of different probability density estimates with observed frequency for MF and Pr series in DC. (a) Comparison of CDFs of candidate distributions and observed frequency for MF series in DC; (b) comparison of PDFs of candidate distributions and observed frequency for MF series in DC; (c) comparison of CDFs of candidate distributions and observed frequency for Pr series in DC; (d) comparison of PDFs of candidate distributions and observed frequency for Pr series in DC.
The K–S test, MAE, PPCC, and DMC were used to evaluate the performances of six candidate distribution models, and the results are shown in Table 3. The p-values for all were greater than 0.05, indicating that all the candidate distributions were suitable for fitting the distribution of the MF and Pr series at the 5% significance level. It can be concluded that the MF and Pr series in the XC fitted the gamma distribution the best, whereas the MF and Pr series in the DC fitted the GEV and gamma distributions the best, respectively.
Goodness-of-fit tests for candidate distributions: K–S test (p-value), MAE, PPCC, and DMC
Catchment . | Series . | Distribution . | p-value . | MAE . | PPCC . | DMC . |
---|---|---|---|---|---|---|
XC | MF | GEV | 0.98 | 0.05077 | 0.91545 | 0.99416 |
Gumbel | 0.98 | 0.05078 | 0.91470 | 0.97623 | ||
Pearson-III | 0.94 | 0.05002 | 0.99259 | 0.99424 | ||
Gamma | 0.98 | 0.05001 | 0.99282 | 0.99505 | ||
Normal | 0.57 | 0.05002 | 0.99266 | 0.99431 | ||
Log-Normal | 0.95 | 0.05009 | 0.97542 | 0.99422 | ||
Pr | GEV | 0.79 | 0.05129 | 0.98339 | 0.99029 | |
Gumbel | 0.76 | 0.05063 | 0.98460 | 0.99052 | ||
Pearson-III | 0.87 | 0.05142 | 0.80189 | 0.98545 | ||
Gamma | 0.78 | 0.05030 | 0.98965 | 0.99144 | ||
Normal | 0.48 | 0.05138 | 0.87357 | 0.98978 | ||
Log-Normal | 0.86 | 0.05133 | 0.95275 | 0.99000 | ||
DC | MF | GEV | 0.78 | 0.04828 | 0.98636 | 0.98673 |
Gumbel | 0.18 | 0.04931 | 0.95171 | 0.98170 | ||
Pearson-III | 0.05 | 0.05078 | 0.81735 | 0.95566 | ||
Gamma | 0.23 | 0.05017 | 0.85233 | 0.96172 | ||
Normal | 0.76 | 0.05083 | 0.80265 | 0.94691 | ||
Log-Normal | 0.10 | 0.04969 | 0.87787 | 0.97249 | ||
Pr | GEV | 0.83 | 0.04964 | 0.97465 | 0.98695 | |
Gumbel | 0.50 | 0.05026 | 0.97111 | 0.98673 | ||
Pearson-III | 0.74 | 0.05065 | 0.86893 | 0.98508 | ||
Gamma | 0.84 | 0.04802 | 0.97858 | 0.98732 | ||
Normal | 0.86 | 0.05057 | 0.94595 | 0.98549 | ||
Log-Normal | 0.78 | 0.05073 | 0.86228 | 0.98050 |
Catchment . | Series . | Distribution . | p-value . | MAE . | PPCC . | DMC . |
---|---|---|---|---|---|---|
XC | MF | GEV | 0.98 | 0.05077 | 0.91545 | 0.99416 |
Gumbel | 0.98 | 0.05078 | 0.91470 | 0.97623 | ||
Pearson-III | 0.94 | 0.05002 | 0.99259 | 0.99424 | ||
Gamma | 0.98 | 0.05001 | 0.99282 | 0.99505 | ||
Normal | 0.57 | 0.05002 | 0.99266 | 0.99431 | ||
Log-Normal | 0.95 | 0.05009 | 0.97542 | 0.99422 | ||
Pr | GEV | 0.79 | 0.05129 | 0.98339 | 0.99029 | |
Gumbel | 0.76 | 0.05063 | 0.98460 | 0.99052 | ||
Pearson-III | 0.87 | 0.05142 | 0.80189 | 0.98545 | ||
Gamma | 0.78 | 0.05030 | 0.98965 | 0.99144 | ||
Normal | 0.48 | 0.05138 | 0.87357 | 0.98978 | ||
Log-Normal | 0.86 | 0.05133 | 0.95275 | 0.99000 | ||
DC | MF | GEV | 0.78 | 0.04828 | 0.98636 | 0.98673 |
Gumbel | 0.18 | 0.04931 | 0.95171 | 0.98170 | ||
Pearson-III | 0.05 | 0.05078 | 0.81735 | 0.95566 | ||
Gamma | 0.23 | 0.05017 | 0.85233 | 0.96172 | ||
Normal | 0.76 | 0.05083 | 0.80265 | 0.94691 | ||
Log-Normal | 0.10 | 0.04969 | 0.87787 | 0.97249 | ||
Pr | GEV | 0.83 | 0.04964 | 0.97465 | 0.98695 | |
Gumbel | 0.50 | 0.05026 | 0.97111 | 0.98673 | ||
Pearson-III | 0.74 | 0.05065 | 0.86893 | 0.98508 | ||
Gamma | 0.84 | 0.04802 | 0.97858 | 0.98732 | ||
Normal | 0.86 | 0.05057 | 0.94595 | 0.98549 | ||
Log-Normal | 0.78 | 0.05073 | 0.86228 | 0.98050 |
Notes: P-value > 0.05 denotes a failure to reject the null hypothesis at the 5% significance level. Values in bold represent the minimum value of MAE and the maximum values of PPCC and DMC among candidate probability distributions, i.e., the most suitable distributions.
Bivariate frequency analysis of MF and Pr
Establishment of copula model
We measured the dependence between the MF and Pr series to determine whether it is appropriate to establish their copula model. The Kendall rank correlation coefficients between the MF and Pr series in the XC and DC were 0.327 and 0.403, respectively, which were greater than the threshold at the 5% significance level, i.e. 0.273. The Spearman's rank correlation coefficients between the MF and Pr series in the XC and DC were 0.480 and 0.569, respectively, which were greater than the threshold at the 5% significance level, i.e., 0.297. This signified that the null hypothesis at the 5% significance level was rejected and a correlation existed between the two variables. Therefore, it is feasible to use the copula function to establish the joint probability distribution between them.
The parameters of eight candidate copulas estimated using the IFM method and their goodness-of-fit tests through the Cramér–von Mises test, AIC, and BIC are shown in Table 4. As shown, all the p-values were greater than 0.05, indicating that all candidate copulas can be applied to fit the dependence structure between the MF and Pr series. The Joe–Frank copula performed the best for modeling the dependence structure of the MF and Pr series in the XC, whereas the Clayton copula performed the best for modeling the dependence structure of the MF and Pr series in the DC.
Estimated parameters and goodness-of-fit tests of the candidate copulas
Catchment . | Copula model . | Parameters . | p-value . | AIC . | BIC . | |
---|---|---|---|---|---|---|
![]() | ![]() | |||||
XC | Gumbel | 1.43 | / | 0.51 | −335.44 | −333.49 |
Clayton | 0.77 | / | 0.13 | −333.94 | −331.98 | |
Frank | 3.21 | / | 0.26 | −338.68 | −336.73 | |
Joe | 1.64 | / | 0.79 | −330.24 | −328.29 | |
Clayton–Gumbel | 0.77 | 1.00 | 0.12 | −331.94 | −328.03 | |
Joe–Gumbel | 1.00 | 1.43 | 0.49 | −333.44 | −329.54 | |
Joe–Clayton | 1.00 | 0.77 | 0.13 | −331.94 | −328.03 | |
Joe–Frank | 3.65 | 0.68 | 0.90 | −343.84 | −339.94 | |
DC | Gumbel | 1.70 | / | 0.70 | −336.98 | −335.03 |
Clayton | 1.26 | / | 0.47 | −344.63 | −342.68 | |
Frank | 4.29 | / | 0.15 | −321.46 | −319.51 | |
Joe | 2.08 | / | 0.66 | −344.30 | −342.35 | |
Clayton–Gumbel | 1.26 | 1.00 | 0.35 | −342.63 | −338.73 | |
Joe–Gumbel | 1.63 | 1.22 | 0.51 | −341.41 | −337.51 | |
Joe–Clayton | 1.00 | 1.26 | 0.40 | −342.63 | −338.73 | |
Joe–Frank | 2.70 | 0.93 | 0.59 | −341.17 | −337.27 |
Catchment . | Copula model . | Parameters . | p-value . | AIC . | BIC . | |
---|---|---|---|---|---|---|
![]() | ![]() | |||||
XC | Gumbel | 1.43 | / | 0.51 | −335.44 | −333.49 |
Clayton | 0.77 | / | 0.13 | −333.94 | −331.98 | |
Frank | 3.21 | / | 0.26 | −338.68 | −336.73 | |
Joe | 1.64 | / | 0.79 | −330.24 | −328.29 | |
Clayton–Gumbel | 0.77 | 1.00 | 0.12 | −331.94 | −328.03 | |
Joe–Gumbel | 1.00 | 1.43 | 0.49 | −333.44 | −329.54 | |
Joe–Clayton | 1.00 | 0.77 | 0.13 | −331.94 | −328.03 | |
Joe–Frank | 3.65 | 0.68 | 0.90 | −343.84 | −339.94 | |
DC | Gumbel | 1.70 | / | 0.70 | −336.98 | −335.03 |
Clayton | 1.26 | / | 0.47 | −344.63 | −342.68 | |
Frank | 4.29 | / | 0.15 | −321.46 | −319.51 | |
Joe | 2.08 | / | 0.66 | −344.30 | −342.35 | |
Clayton–Gumbel | 1.26 | 1.00 | 0.35 | −342.63 | −338.73 | |
Joe–Gumbel | 1.63 | 1.22 | 0.51 | −341.41 | −337.51 | |
Joe–Clayton | 1.00 | 1.26 | 0.40 | −342.63 | −338.73 | |
Joe–Frank | 2.70 | 0.93 | 0.59 | −341.17 | −337.27 |
Notes: P-value > 0.05 indicates that the estimated copula can be accepted at the 5% significance level. The smaller the value of the AIC and BIC, the better the distribution fits. The most appropriate copulas are indicated in boldface.
Coincidence probability
The 80th, 90th, 93.33th, and 95th quantiles of the Pr series were selected as the typical extreme precipitation events in both the XC and the DC, and they were 1,136, 1,245, 1,304, and 1,342 mm in the XC, respectively, and 1,084, 1,181, 1,231, and 1,265 mm in the DC, respectively. The typical extreme flood events in the XC were designated as 800, 1,000, 1,200, and 1,400 m3/s, respectively. Here, 800 m3/s is the safety discharge capacity of the Hengtangcun station. Safety discharge refers to the maximum discharge to pass the river safely under normal circumstances; 1,400 m3/s is the warning discharge determined based on the historical flood peak discharges at the Hengtangcun station. When the flood peak reaches the warning discharge, the flood control department focuses on flood control emergencies. The values 1,000 and 1,200 m3/s correspond to two extreme flood events between the safety discharge and warning discharge. Similarly, the typical extreme flood events in the DC were designated as 500, 550, 650, and 750, respectively; 500 m3/s is the safety discharge capacity of the Pingyao station; 750 m3/s is the warning discharge; 550 and 650 m3/s are two extreme flood events between the safety discharge and warning discharge in the DC.
The situation when the MF does not exceed the safety discharge is defined as a safe flood, which is beneficial for flood control; meanwhile, the situation when the MF is greater than or equal to the warning discharge is defined as a dangerous flood, which is adverse to flood control.
Using the established copula models, the coincidence probabilities of the typical MF and Pr events in the XC and DC were calculated to study the joint risk of the two extreme events, the results of which are shown in Table 5.
Coincidence probabilities of typical MF and Pr events in XC and DC
Catchment . | Coincidence probability of MF and Pr ![]() | |||
---|---|---|---|---|
XC | Pr >1,136 mm | Pr >1,245 mm | Pr >1,304 mm | Pr >1,342 mm |
MF >800 m3/s | 44.69% | 40.64% | 39.40% | 38.80% |
MF >1,000 m3/s | 32.67% | 26.23% | 24.18% | 23.18% |
MF >1,200 m3/s | 26.11% | 17.96% | 15.32% | 14.01% |
MF >1,400 m3/s | 22.80% | 13.69% | 10.69% | 9.19% |
DC | Pr >1,084 mm | Pr >1,181 mm | Pr >1,231 mm | Pr >1,265 mm |
MF >500 m3/s | 50.33% | 47.13% | 46.15% | 45.68% |
MF >550 m3/s | 41.76% | 37.10% | 35.64% | 34.93% |
MF >650 m3/s | 28.65% | 21.07% | 18.61% | 17.39% |
MF >750 m3/s | 21.84% | 12.39% | 9.26% | 7.70% |
Catchment . | Coincidence probability of MF and Pr ![]() | |||
---|---|---|---|---|
XC | Pr >1,136 mm | Pr >1,245 mm | Pr >1,304 mm | Pr >1,342 mm |
MF >800 m3/s | 44.69% | 40.64% | 39.40% | 38.80% |
MF >1,000 m3/s | 32.67% | 26.23% | 24.18% | 23.18% |
MF >1,200 m3/s | 26.11% | 17.96% | 15.32% | 14.01% |
MF >1,400 m3/s | 22.80% | 13.69% | 10.69% | 9.19% |
DC | Pr >1,084 mm | Pr >1,181 mm | Pr >1,231 mm | Pr >1,265 mm |
MF >500 m3/s | 50.33% | 47.13% | 46.15% | 45.68% |
MF >550 m3/s | 41.76% | 37.10% | 35.64% | 34.93% |
MF >650 m3/s | 28.65% | 21.07% | 18.61% | 17.39% |
MF >750 m3/s | 21.84% | 12.39% | 9.26% | 7.70% |
The following were observed in Table 5:
- (1)
For cases with the same MF, with the increase in the Pr, the marginal distribution probability and coincidence probability of the Pr decreased. Similarly, for cases with the same Pr, as the MF increased, the marginal distribution probability and coincidence probability of the MF decreased.
- (2)
As the MF increased, the sensitivity of its coincidence probability to changes in the Pr increased. Similarly, as the Pr increased, the sensitivity of its coincidence probability to changes in the MF increased.
Hence, we conclude that in the XC, the coincidence probabilities of encountering dangerous floods with the Pr frequency exceeding 80%, 90%, 93.33%, and 95% were 22.80%, 13.69%, 10.69%, and 9.19%, respectively. In the DC, the coincidence probabilities of encountering dangerous floods with the Pr frequency exceeding 80%, 90%, 93.33%, and 95% were 21.84%, 12.39%, 9.26%, and 7.70%, respectively.
Return periods and dangerous domains
The bivariate return periods of various combinations of the flood and extreme precipitation are more effective for actual flood control and management than univariate return period analysis; therefore, the joint return, co-occurrence return, and secondary return periods of the joint distribution of the MF and Pr series in the XC and DC were calculated. The contour maps of different return periods are shown in Figure 7, and some important values are shown in Table 6.
Univariate and bivariate return periods for MF and Pr series in XC and DC
Catchment . | Univariate . | Bivariate . | ||||
---|---|---|---|---|---|---|
![]() | MF (m3/s) . | Pr (mm) . | ![]() | ![]() | ![]() | |
XC | 5 | 1,003.70 | 1,136.66 | 3.08 | 13.35 | 8.26 |
10 | 1,203.64 | 1,246.80 | 5.60 | 46.64 | 23.46 | |
15 | 1,312.28 | 1,304.27 | 8.11 | 99.78 | 45.61 | |
20 | 1,386.69 | 1,342.83 | 10.61 | 172.75 | 74.69 | |
DC | 5 | 612.98 | 1,084.28 | 3.05 | 13.84 | 8.50 |
10 | 676.87 | 1,181.02 | 5.56 | 49.81 | 27.78 | |
15 | 705.82 | 1,231.34 | 8.06 | 107.91 | 58.13 | |
20 | 723.56 | 1,265.03 | 10.56 | 188.13 | 99.54 |
Catchment . | Univariate . | Bivariate . | ||||
---|---|---|---|---|---|---|
![]() | MF (m3/s) . | Pr (mm) . | ![]() | ![]() | ![]() | |
XC | 5 | 1,003.70 | 1,136.66 | 3.08 | 13.35 | 8.26 |
10 | 1,203.64 | 1,246.80 | 5.60 | 46.64 | 23.46 | |
15 | 1,312.28 | 1,304.27 | 8.11 | 99.78 | 45.61 | |
20 | 1,386.69 | 1,342.83 | 10.61 | 172.75 | 74.69 | |
DC | 5 | 612.98 | 1,084.28 | 3.05 | 13.84 | 8.50 |
10 | 676.87 | 1,181.02 | 5.56 | 49.81 | 27.78 | |
15 | 705.82 | 1,231.34 | 8.06 | 107.91 | 58.13 | |
20 | 723.56 | 1,265.03 | 10.56 | 188.13 | 99.54 |
Contour maps of different return periods for XC and DC. (a) Contour map of joint return periods for XC; (b) contour map of co-occurrence return periods for XC; (c) contour map of secondary return periods for XC; (d) contour map of joint return periods for DC; (e) contour map of co-occurrence return periods for DC; (f) contour map of secondary return periods for DC. The curves represent contour lines of different return periods. Number labels near contour lines represent values of different return periods.
Contour maps of different return periods for XC and DC. (a) Contour map of joint return periods for XC; (b) contour map of co-occurrence return periods for XC; (c) contour map of secondary return periods for XC; (d) contour map of joint return periods for DC; (e) contour map of co-occurrence return periods for DC; (f) contour map of secondary return periods for DC. The curves represent contour lines of different return periods. Number labels near contour lines represent values of different return periods.
If the MF and Pr are known, one can determine their ,
, and
based on Figure 7. Moreover, using the method described in the section on Return periods and dangerous domains the dangerous domains of the thresholds under different return periods can be identified.
- (1)The return periods satisfied the following two theoretical inequalities:where
and
are the univariate return periods. For example, for the MF = 1,003.70 m3/s, Pr = 1,136.66 mm pair in the XC (corresponding to a return period
years estimated using the fitted gamma distributions for MF and Pr series, respectively),
,
, and
were 3.08, 13.35, and 8.26 years, respectively. This finding is consistent with those of previous studies (Vandenberghe et al. 2010; Requena et al. 2013).
- (2)
The higher the return period, the greater was the differences among
,
, and
.
- (3)
The return periods of the XC and DC were not significantly different, indicating that the joint frequencies of the MF and Pr in the two catchments were the same. However, under the same univariate return period,
in the XC was slightly larger than that in the DC, whereas
and
were slightly smaller than those in the DC.
The contour map trend of the return periods was consistent those of previous studies (Bezak et al. 2014; Fan et al. 2016), indicating that the results were reasonable.
Flood risk under specified precipitation scenarios
In practice, water resource managers are more interested in the possibility of floods exceeding certain thresholds in certain extreme precipitation events. Hence, we calculated the conditional exceedance probabilities of the MF under different Pr scenarios. Four precipitation scenarios were presented: the 80th, 90th, 93.33th, and 95th quantiles of the Pr series. The values of the conditional exceedance probabilities are listed in Table 7, and their contour maps are shown in Figure 8.
Conditional exceedance probabilities of MF under given Pr scenarios in XC and DC
Catchment . | Conditional probabilities of MF under given Pr scenarios . | |||
---|---|---|---|---|
XC | Pr = 1,136 mm | Pr = 1,245 mm | Pr = 1,304 mm | Pr = 1,342 mm |
MF >800 m3/s | 61.89% | 64.26% | 65.00% | 65.36% |
MF >1,000 m3/s | 37.84% | 40.17% | 40.93% | 41.31% |
MF >1,200 m3/s | 20.13% | 21.70% | 22.24% | 22.50% |
MF >1,400 m3/s | 9.72% | 10.57% | 10.86% | 11.01% |
DC | Pr = 1,084 mm | Pr = 1,181 mm | Pr = 1,231 mm | Pr = 1,265 mm |
MF >500 m3/s | 69.85% | 71.70% | 72.27% | 72.55% |
MF >550 m3/s | 55.43% | 57.46% | 58.10% | 58.42% |
MF >650 m3/s | 25.65% | 27.07% | 27.54% | 27.77% |
MF >750 m3/s | 5.91% | 6.31% | 6.44% | 6.51% |
Catchment . | Conditional probabilities of MF under given Pr scenarios . | |||
---|---|---|---|---|
XC | Pr = 1,136 mm | Pr = 1,245 mm | Pr = 1,304 mm | Pr = 1,342 mm |
MF >800 m3/s | 61.89% | 64.26% | 65.00% | 65.36% |
MF >1,000 m3/s | 37.84% | 40.17% | 40.93% | 41.31% |
MF >1,200 m3/s | 20.13% | 21.70% | 22.24% | 22.50% |
MF >1,400 m3/s | 9.72% | 10.57% | 10.86% | 11.01% |
DC | Pr = 1,084 mm | Pr = 1,181 mm | Pr = 1,231 mm | Pr = 1,265 mm |
MF >500 m3/s | 69.85% | 71.70% | 72.27% | 72.55% |
MF >550 m3/s | 55.43% | 57.46% | 58.10% | 58.42% |
MF >650 m3/s | 25.65% | 27.07% | 27.54% | 27.77% |
MF >750 m3/s | 5.91% | 6.31% | 6.44% | 6.51% |
Contour maps of conditional exceedance probabilities of MF under various Pr scenarios in XC and DC. (a) Contour map of conditional exceedance probabilities of MF under various Pr scenarios in XC; (b) contour map of conditional exceedance probabilities of MF under various Pr scenarios in DC. The horizontal lines represent various Pr scenarios. The curves represent contour lines of different conditional exceedance probabilities. The number labels near contour lines represent values of different conditional exceedance probabilities.
Contour maps of conditional exceedance probabilities of MF under various Pr scenarios in XC and DC. (a) Contour map of conditional exceedance probabilities of MF under various Pr scenarios in XC; (b) contour map of conditional exceedance probabilities of MF under various Pr scenarios in DC. The horizontal lines represent various Pr scenarios. The curves represent contour lines of different conditional exceedance probabilities. The number labels near contour lines represent values of different conditional exceedance probabilities.
- (1)
Under the same Pr scenario, as the MF increased, the conditional exceedance probability decreased.
- (2)
For the same MF, the conditional exceedance probability in the heavy precipitation scenario was greater than that in the small precipitation scenario, i.e., the conditional exceedance probabilities of the MF increased with the amount of Pr.
- (3)
These conditional exceedance probabilities enabled the possibility of certain flood discharges under different precipitation scenarios to be identified. Using the XC for example, for Pr = 1,136 mm in the future, the probability of MF >800 m3/s is 61.89%.
Hence, we conclude the following:
- (1)
For the XC, in the four scenarios with Pr frequencies of 80%, 90%, 93.33%, and 95%, the probabilities of safe floods were 38.11%, 35.74%, 35.00%, and 34.64%, respectively, and the probabilities of dangerous floods were 9.72%, 10.57%, 10.86%, and 11.01%, respectively.
- (2)
For the DC, in the four scenarios with Pr frequencies of 80%, 90%, 93.33%, and 95%, the probabilities of safe floods were 30.15%, 28.30%, 27.73%, and 27.45%, respectively, and the probabilities of dangerous floods were 5.91%, 6.31%, 6.44%, and 6.51%, respectively.
The contour map trend of the conditional exceedance probabilities was consistent with that of a previous study (Guo et al. 2018), indicating the results were reasonable.
The contributions of this study are as follows:
- (1)
We focused on the flood risk in the main water source of Taihu Lake, i.e. the Zhexi Region, which had a direct and significant impact on the floods in the Taihu Basin. This region had not been emphasized in previous studies, whereas it is vital for flood prevention and drainage work.
- (2)
Extreme hydrological and meteorological events (i.e., the flood peak flow and precipitation during the flood season), which were the main flood disaster factors, were considered as the objects of this study. Their joint probability distribution and combined effects were directly related to the degree of flood risk and further affected the design and management of flood prevention and drainage systems in the Taihu Basin. Moreover, considering the extreme hydrological and meteorological events simultaneously can illustrate the inherent law of flood events more effectively and enable the relationship between flood and extreme precipitation to be analyzed.
CONCLUSIONS
To study the response of flood events to extreme precipitation in the Zhexi Region of the Taihu Basin, a copula model was established in this study to analyze the bivariate frequency of the MF and Pr series in the XC and DC. The 80th, 90th, 93.33th, and 95th quantiles of the Pr series were selected as typical extreme precipitation events. The safety discharge, warning discharge, and two extreme flood events between them were selected as typical extreme flood events. The coincidence probabilities of the typical MF and Pr events were calculated. The joint return, co-occurrence return, and secondary return periods for arbitrary combinations of the MF and Pr were calculated, and their contour maps were drawn. Meanwhile, the dangerous domains under different return periods were identified. Furthermore, flood risk analysis under different extreme precipitation scenarios was performed, and contour maps of the conditional exceedance probabilities were drawn. The main conclusions were as follows:
- (1)
According to the K–S test, MAE, PPCC, and DMC, both the MF and Pr in the XC fitted the gamma distribution the best, whereas the MF and Pr in the DC fitted the GEV and gamma distributions the best, respectively. Based on the Cramér–von Mises test, AIC, and BIC, the Joe–Frank copula and Clayton copula were selected as the most appropriate copulas to fit the joint distribution of the MF and Pr in the XC and DC, respectively.
- (2)
In the XC, the coincidence probabilities of encountering dangerous floods with the Pr frequency exceeding 80%, 90%, 93.33%, and 95% were 22.80%, 13.69%, 10.69%, and 9.19%, respectively. In the DC, the coincidence probabilities of encountering dangerous floods with the Pr frequency exceeding 80%, 90%, 93.33%, and 95% were 21.84%, 12.39%, 9.26%, and 7.70%, respectively.
- (3)
Different return periods for the same pair of MF and Pr satisfied the following two rules:
and
.
- (4)
In the four scenarios with Pr frequencies of 80%, 90%, 93.33%, and 95%, the probabilities of safe floods were 38.11%, 35.74%, 35.00%, and 34.64% for the XC, respectively, and 30.15%, 28.30%, 27.73%, and 27.45% for the DC, respectively. In the four scenarios with Pr frequencies of 80%, 90%, 93.33%, and 95%, the probabilities of dangerous floods were 9.72%, 10.57%, 10.86%, and 11.01% for the XC, respectively, and 5.91%, 6.31%, 6.44%, and 6.51% for the DC, respectively.
Water resources managers can identify the corresponding flood risk information for different flood and precipitation events from the contour map of return periods (Figure 7). Furthermore, the probability of a certain flood occurring under various precipitation scenarios can be assessed based on the contour map of conditional exceedance probabilities (Figure 8).
The results of this study can provide a practical basis and guidance for the computation of rainstorm designs, management of flood control safety, and water resource scheduling in the Taihu Basin. The framework designed in this study for analyzing the response of flood events to extreme precipitation can be applied to similar river basins. Furthermore, the flood generation mechanism has changed owing to climate change and human activities. Therefore, for future studies, we plan to perform flood frequency analysis under changed flood generation mechanisms to identify the effects of climate change and human activities on floods, as well as to provide a more practical reference for flood control.
FUNDING
This research was funded by the National Key Research and Development Program of China, grant number 2018YFC1508200.
DATA AVAILABILITY STATEMENT
All relevant data are included in the paper or its Supplementary Information.