This paper presents the evaluation of a multisite statistical weather generator (MulGETS: Multisite weather Generator of École de Technologie Supérieure) based on its simulation effect of precipitation in the Yangtze River Basin. MulGETS effectively generates spatially correlated sequences of precipitation simultaneously, while maintaining their spatial and temporal distribution characteristics. On the spatial scales, the accuracy of the model varies from station to station, and in general, the errors are lower at stations in the middle and lower reaches of the Yangtze River Basin than in the upper reaches. This difference is likely to exist because of the lower amount of rainfall and more complex topography than those of the upper river basins. On the temporal scales, the simulated values are more precise on the annual scale than on the seasonal scale. Large relative errors occur more frequently in winter, ranging from −35% to 25%. MulGETS can consistently produce precipitation by considering the intensity, magnitude, and duration indices with sub-basin varied observations. However, the precipitation maxima were much lower than the observations. This work shows the general reasonability of the model in downscaling precipitation in the Yangtze River Basin.

  • MulGETS is applied to generate daily precipitation for each subbasin in the Yangtze River Basin.

  • The performance is evaluated by 9 indices in three categories over seasonal and annual scales.

The general circulation model (GCM) is currently an important tool for simulating the basin's climate and predicting future climate scenarios. However, the spatial resolution of the model's output data is extremely coarse for application to meteorological and hydrological research at the basin scale. To solve this problem, scientists have extensively researched downscaling methods to convert rough GCM data into high-resolution regional or watershed scale data (Xu 1999; Wilby et al. 2002; Wood et al. 2004; Mehrotra et al. 2014; Pour et al. 2014; Singh et al. 2017; Araya-Osses et al. 2020; Li et al. 2020).

In general, downscaling methods can be divided into two categories: dynamical downscaling (Giorgi & Mearns 1999; Zeng et al. 2017; Liu et al. 2018; Shiferaw et al. 2018) and statistical downscaling (Chen et al. 2016; Yi et al. 2018) methods. Statistical downscaling is widely used because of its flexibility, simplicity, low computational complexity, and the ability to quickly simulate long-term regional climate sequence (Cheng et al. 2008; Chen et al. 2011; Chen et al. 2012; Wang et al. 2014; Verdin et al. 2015). Owing to the importance of fully understanding the secular variation of hydrological characteristics at the watershed scale, many scholars have applied different statistical downscaling methods to determine hydrological responses to climate change (Ahmed et al. 2013; Jeong et al. 2013; Wang et al. 2013; Lee & Bae 2016; Zheng et al. 2018). The climate change information needs to match the hydrological information on the basin scale; in other words, weather data corresponding to the existing stations should be used (Brissette et al. 2007).

Weather generators are an important statistical downscaling method, and they are widely used to generate synthetic sequences of weather variables that are statistically consistent with observed variables. Conventional weather generators are based on individual stations (or grid points) to generate single-point data or establish mutually independent regression equations (Brissette et al. 2007), however, hydrological simulation results are largely affected by the spatial distribution of regional precipitation. The above-mentioned single-site downscaling methods may neglect the correlation between adjacent points of the forecast elements. This deficiency partly restricts the application of the single-site statistical downscaling method (Chen et al. 2014). Therefore, in recent years, many studies have focused on spatially correlated downscaling methods (Kottegoda et al. 2003; Raje & Mujumdar 2009; Srikanthan & Pegram 2009; Breinl et al. 2013; Jeong et al. 2013; Chowdhury et al. 2019) or provided useful insights on the development of multisite downscaling methods (Wilks 1998; Harpham & Wilby 2005; Fang & Tacher 2010). Fu evaluated four multisite weather generators to simulate daily precipitation in the Gloucester Watershed in Australia. The results show that the four weather generators can satisfactorily simulate the extreme values of precipitation as well as the duration of wet and dry days (Fu et al. 2018).

Chen proposed a MATLAB-based stochastic weather generator referred to as Multisite weather Generator of École de Technologie Supérieure (MulGETS) and tested it by simulating precipitation and temperature at stations in the Jinghe Watershed in China, Peribonka Watershed in Canada, and Walnut Gulch Watershed in America (Chen et al. 2014). At these stations, MulGETS was able to retain the spatial correlation of precipitation and temperature between stations. Many researchers have applied MulGETS to areas of interest and compared it with other weather generators. This work has shown that MulGETS often outperformed the other three weather generators when applied to a wathershed of Eastern Ontario, Canada (Alodah & Seidou 2019). Tseng compared MulGETS with three other weather generators on the task of producing precipitation at different spatial scales, and the results showed that MulGETS exhibited the advantage of preserving most of the observed statistics and producing reasonable Q-Q plots (Tseng et al. 2020). Vallam and Qin compared MulGETS with a multi-site rainfall simulator, a Neyman–Scott based Poisson cluster model, and a proposed modified k-nearest neighbour model for 30 years of rainfall in Singapore, and found that MulGETS exhibited the lowest differences between prediction intervals (Vallam & Qin 2016). MulGETS can also be coupled with other methods to act as a spatial downscaling method. Chen proposed a multi-site downscaling approach coupling a quantile mapping method with MulGETS to simultaneously downscale precipitation for multiple locations within a watershed (Chen et al. 2018). He also assessed the effectiveness of MulGETS in hydrological modeling over a Canadian watershed in the province of Québec, and the results showed that the monthly mean discharge was accurately represented by MulGETS-generated precipitation and temperature (Chen et al. 2016).

The multisite weather generator effectively solves the challenging problem of retaining the spatial correlation of the input data. However, previous studies on MulGETS have generally concentrated on the consistency between the daily results of model simulations and observations with respect to indices of precipitation intensity and precipitation frequency. Moreover, it is important to evaluate MulGETS results on timescales longer than the daily scale due to the twofold reasons as follows. First, considering possible systematic deviations in some regions, these daily scale biases may be different when calculated on the seasonal or annual scales (e.g., the seasonal biases are likely to be reduced or amplified, or changed from positive to negative), therefore, it is necessary to evaluate MulGETS on the annual and seasonal scales over multiple sub-basins. Second, the Yangtze River Basin is the third-largest river basin in the world, with a total drainage area of approximately 1.8 million km2, accounting for 18.8% of China's land area, where extraordinary floods or high river water levels of the trunk stream were observed to occur as extreme seasonal hydroclimate events with persistent heavy rainfalls over 1–2 months (e.g., the 1998 and 2020 summer floods in the basin). Meanwhile, the subbasins of the Yangtze River Basin are greatly influenced by frequent floods in which the most severe damage seems to result from specific episodes of extreme precipitation on a regional scale, such as rainstorms and heavy precipitation events that account for high percentages of the yearly total over a few rainy days (Zhang et al. 2009). Besides, precise precipitation data, which are sometimes unavailable, are needed for prediction of floods. Hence, in this study, we evaluated the performance of MulGETS using multiple indices of extreme precipitation with MulGETS's ability to reproduce long-term precipitation characteristics examined at daily, annual, and seasonal scales on data collected from the Yangtze River Basin, which we hope will provide a reference for precipitation simulation and flood forecasting in future research.

Study area

The Yangtze River originates from Tanggula Mountain in Qinghai Province and flows through 11 provinces, administrative regions, or municipalities in China from the west to the east. The river has a total length of 6,380 km and a drainage area of 1.8 million km2. It is the third-largest drainage basin in the world, and the areas through which it flows contain mountains, plateaus, basins, plains, and other topographies. The entire basin area is subjected to diverse climate types; for example, some areas of the Qinghai-Tibet Plateau climate have a subtropical monsoon climate, the southern part has a tropical climate, and the northern part has a temperate climate. The annual mean precipitation in the Yangtze River Basin is approximately 1,067 mm. However, owing to the vast area and complex topography, the temporal and spatial distribution of annual precipitation and extreme events are very uneven; the annual mean precipitation in the west is between 300 and 500 mm, while that in the southeast can reach 1,600–1,900 mm. In addition, rainfall largely occurs during summertime (Gemmer et al. 2008). Based on the climate and underlying surface conditions, we roughly divided the study area into 11 sub-basins from the west to the east. The distribution of the sub-basins and the stations is shown in Figure 1.

Figure 1

Distribution of sub-basins and stations of the Yangtze River Basin: (1) Jinshajiang River, (2) Mintuojiang River, (3) Jialingjiang River, (4) Wujiang River, (5) Upper mainstream section, (6) Hanjiang River, (7) Dongting Lake, (8) Poyang Lake, (9) Middle mainstream section, (10) Lower mainstream section, and (11) Tai Lake.

Figure 1

Distribution of sub-basins and stations of the Yangtze River Basin: (1) Jinshajiang River, (2) Mintuojiang River, (3) Jialingjiang River, (4) Wujiang River, (5) Upper mainstream section, (6) Hanjiang River, (7) Dongting Lake, (8) Poyang Lake, (9) Middle mainstream section, (10) Lower mainstream section, and (11) Tai Lake.

Close modal

The area of each sub-basin and their detailed information are summarized in Table 1, which shows that the stations are evenly distributed. The average density of the stations is approximately 15,000 km2 per station; however, the density is relatively low at the Jinsha River, Mintuo River, and Jialing River. In addition, three stations in the northern part of the Jialing River were disregarded because of missing essential data, resulting in sparse station distribution in the basin.

Table 1

Primary information of sub-basins

Region CategoriesSub-basinAbbreviationArea/104 km2No. of stationsStation density (104 km2/station)
Upper region Jinsha River JSR 47.4 20 2.37 
Mintuo River MTR 16.3 10 1.63 
Jialing River JLR 16.3 2.33 
Wujiang River WJR 8.9 1.11 
Upper mainstream section UMS 10.3 1.29 
Middle region Hanjiang River HJR 15.8 14 1.23 
Middle mainstream section MMS 27.2 23 1.18 
Poyang Lake PYL 17.3 15 1.53 
Lower region Dongting Lake DTL 10.0 1.43 
Lower mainstream section LMS 9.3 1.03 
Tai Lake THL 4.0 1.00 
Total – 182.8 125 1.46 
Region CategoriesSub-basinAbbreviationArea/104 km2No. of stationsStation density (104 km2/station)
Upper region Jinsha River JSR 47.4 20 2.37 
Mintuo River MTR 16.3 10 1.63 
Jialing River JLR 16.3 2.33 
Wujiang River WJR 8.9 1.11 
Upper mainstream section UMS 10.3 1.29 
Middle region Hanjiang River HJR 15.8 14 1.23 
Middle mainstream section MMS 27.2 23 1.18 
Poyang Lake PYL 17.3 15 1.53 
Lower region Dongting Lake DTL 10.0 1.43 
Lower mainstream section LMS 9.3 1.03 
Tai Lake THL 4.0 1.00 
Total – 182.8 125 1.46 

Data

The precipitation data of the initial 175 stations in the Yangtze River Basin were obtained from the daily datasets provided by the China Meteorological Data Sharing Service System (http://cdc.cma.gov.cn/index.jsp). After quality control, 123 stations with better continuity and sufficient extent of data were selected, each with complete data from January 1961 to December 2005.

Model: MulGETS

The weather generator is a random model that generates a comprehensive weather time series with similar statistical characteristics as the input data. When the measured climate data is insufficient to meet the complete spatial coverage, the weather generator can supplement it. The generator can also be used to simulate past or future weather sequences and has become a commonly used tool in the research on climate change and hydrological response (Liao et al. 2004; Kilsby et al. 2007; Semenov 2008).

MulGETS is a MATLAB-based stochastic weather generator, developed by Chen et al. (2014). The MulGETS Matlab routine can be found on the Mathworks file exchange website (https://www.mathworks.com/matlabcentral/fileexchange/47537-multi-site-stochstic-weather-generator-mulgets). The steps and basic principles of MulGETS downscaling for precipitation are as follows:

  • Calculate two transition probabilities based on the historical precipitation data of each station.

  • The measured precipitation transition probabilities of each station are converted into values in a normal distribution with a mean value of 1 and a standard deviation of 0 by using the inverse complementary error function. Thus, the normalized transition probabilities matrix, norm_trans, is obtained. Then, the Cholesky method is used to generate a spatially correlated random number matrix, corr_random, with a size of m × n (where m is the product of the number of days in a certain month and the number of years in the generated sequence and n is the number of stations in the basin).

  • Based on the stability ineffectiveness theory of Markov, assuming that the state of the day is state1, each value of corr_random generated in step 2 is compared with its corresponding parameter, norm_trans. According to the results, the precipitation states of next day, state2, are obtained.

  • is introduced in the problem of spatial intermittence proposed by Brissette et al. (2007). Then, the daily precipitation occurrence coefficient is calculated for each season as follows:
    where O is the row vector of indicating whether precipitation occurs in the surrounding stations of a certain station on a certain day, and is the column vector of the seasonal correlation coefficient of precipitation occurrence between a certain station and its surrounding stations. According to the magnitude of , the observed data and its corresponding are divided into 9 intervals, an evenly distributed between 0 and 1, if a class contains fewer than 50 precipitation events, two or more neighboring classes are combined until there are more than 50 events in each of the new classes.
  • Parameters for exponential or gamma distribution are then calculated by devising a nonlinear regression equation of the group median and the observed precipitation mean or standard deviation sequence of all points in the interval. Then, the simulated average precipitation is corrected with the observed values, and the parameters of the exponential or gamma distributions are calculated. For gamma distribution, MulGETS calibrates several pairs of (α, β) for each site and season.

  • Finally, a random precipitation amount is generated according to the precipitation occurrence coefficient and the related parameters by determining which interval falls on a certain day and generates a certain amount of precipitation based on the interval's parameter and selected distribution.

Experimental design

For each sub-basin, observation data of the stations collected from 1961 to 2005 served as the input data. The MulGETS was used to generate the precipitation series on a daily scale. The only parameters required for the MulGETS operation are the daily precipitation occurrence threshold and type of precipitation distribution function. In this study, 0.1 mm/day is selected as the threshold and the gamma function is the distribution function. The mean and standard deviation of precipitation and occurrence index for each station of each sub-basin are provided in the Supplementary Material.

MulGETS is generally believed to generate a sequence that temporally coincides with the input values. When evaluating MulGETS, all sequences from 1961 to 2005 were used.

Most studies use explanatory variance, root mean squared error, standard deviation, and other indices to evaluate the simulation effect of the model (Huang et al. 2010; Liu et al. 2011). Some other scholars use more abundant indices, such as quantiles, median, maximum value, and minimum value, for evaluation (Wetterhall et al. 2006). In this study, we selected nine indices categorized into three types, which originated from the European Union STARDEX project (STAtistical and Regional dynamical Downscaling of EXtremes for the European region) (Goodess et al. 2012). In daily scale evaluation, we reserved standard deviation (SD) and 99th percentile of rain day amounts (Pq99) as extra indices. The abbreviations and definitions of extreme precipitation indices are shown in Table 2.

Table 2

Abbreviations and definitions of the indices

CategoryIndexUnitDefinition
 Pint mm/day Simple daily intensity (rain per rain day) 
Precipitation intensity Ppdd Mean dry-day persistence 
 Pnl90 day No. of events > long-term 90th percentile 
 Pq95 mm 95th percentile of rain day amounts 
Precipitation magnitude Pfl90 mm Total rainfall from events > long-term 90th percentile 
 Px5d mm Greatest 5-day total rainfall per year 
 Pxcdd day Max no. of consecutive dry days 
Precipitation persistence Pn10 mm day No. of days with precipitation ≥10 mm 
 Pdsav day Mean dry-day spell lengths 
CategoryIndexUnitDefinition
 Pint mm/day Simple daily intensity (rain per rain day) 
Precipitation intensity Ppdd Mean dry-day persistence 
 Pnl90 day No. of events > long-term 90th percentile 
 Pq95 mm 95th percentile of rain day amounts 
Precipitation magnitude Pfl90 mm Total rainfall from events > long-term 90th percentile 
 Px5d mm Greatest 5-day total rainfall per year 
 Pxcdd day Max no. of consecutive dry days 
Precipitation persistence Pn10 mm day No. of days with precipitation ≥10 mm 
 Pdsav day Mean dry-day spell lengths 

Overview of simulated daily precipitation

This subsection presents an overview of simulated daily precipitation, in which the SD, 95% percentiles (Pq95), and 99% percentiles (Pq99) were calculated from the MulGETS daily precipitation over 1961–2005.

The spatial distribution of the SD of the observed and simulated precipitation and their relative errors are shown in Figure 2. The SD of simulated precipitation on the daily scale of MulGETS was generally underestimated, and it was 1–5% smaller in most areas, indicating that the dispersion of precipitation series may not be estimated well. However, the situation varied for each sub-basin. At most stations of the middle mainstream and stations near the borders between the Middle mainstream and Wujiang River and the borders between Tai Lake and Lower mainstream, the SD of the simulated values was overestimated by 1–5%, and for several stations was even overestimated by 5–10%. This may be attributed to the steep terrain and mountainous areas in the upper reaches of the Yangtze River, and the heterogeneity of the spatial distribution thereof. In the other sub-basins, there are a few stations with positive RE, which are different from most stations with negative RE. These stations are mostly near the boundary of the basin, which is usually a watershed formed by high mountains. Therefore, it is possible that MulGETS is more likely to produce a more widely distributed precipitation series in mountainous areas and more narrowly distributed ones in plains.

Figure 2

Spatial distribution of Standard Deviation of (a) the observed data, (b) the MulGETS simulated data, (c) and their relative error of daily precipitation in the Yangtze River Basin (1: Jinshajiang River, 2: Mintuojiang River, 3: Jialingjiang River, 4: Wujiang River, 5: upper mainstream section, 6: Hanjiang River, 7: Dongting Lake, 8: Poyang Lake, 9: middle mainstream section, 10: lower mainstream section, and 11-Tai Lake).

Figure 2

Spatial distribution of Standard Deviation of (a) the observed data, (b) the MulGETS simulated data, (c) and their relative error of daily precipitation in the Yangtze River Basin (1: Jinshajiang River, 2: Mintuojiang River, 3: Jialingjiang River, 4: Wujiang River, 5: upper mainstream section, 6: Hanjiang River, 7: Dongting Lake, 8: Poyang Lake, 9: middle mainstream section, 10: lower mainstream section, and 11-Tai Lake).

Close modal

The simulated values of Pq95 and Pq99 were relatively accurate, and the errors in most regions were within 5% (Figure 3). Large negative REs of Pq95 were distributed mainly in the Jinsha River and northern Hanjiang River, while positive REs were located mainly near the borders between the middle mainstream section, Mintuo River, and Jinsha River. Compared with Pq95, Pq99 showed fewer large negative REs, but the spatial distribution of REs was more scattered. In general, in most sub-basins, MulGETS can capture precipitation maximums and reproduce their spatial distribution characteristics.

Figure 3

Spatial distribution of Pq95&Pq99 of (a) & (d) the observed data, (b) & (e) the MulGETS simulated data, (c) & (f) and their relative error of daily precipitation in the Yangtze River Basin (1: Jinshajiang River, 2: Mintuojiang River, 3: Jialingjiang River, 4: Wujiang River, 5: upper mainstream section, 6: Hanjiang River, 7: Dongting Lake, 8: Poyang Lake, 9: middle mainstream section, 10: lower mainstream section, and 11-Tai Lake).

Figure 3

Spatial distribution of Pq95&Pq99 of (a) & (d) the observed data, (b) & (e) the MulGETS simulated data, (c) & (f) and their relative error of daily precipitation in the Yangtze River Basin (1: Jinshajiang River, 2: Mintuojiang River, 3: Jialingjiang River, 4: Wujiang River, 5: upper mainstream section, 6: Hanjiang River, 7: Dongting Lake, 8: Poyang Lake, 9: middle mainstream section, 10: lower mainstream section, and 11-Tai Lake).

Close modal

MulGETS is characterized by its ability to generate precipitation sequences with spatial correlations. The correlation coefficient of precipitation occurrence between stations in the middle and lower reaches of the Yangtze River Basin is larger than that in the upper reaches of the Yangtze River Basin (Figure 4(b1)–4(b11)). MulGETS was able to generate precipitation occurrence sequences consistent with the observed values for all sub-basins, in which all the scattered points were near the 1:1 line. Although with respect to precipitation amounts, the scattered points were farther from the 1:1 line, overall they were still along the line. In the Tai Lake, lower mainstream section, and Poyang Lake areas, the spatial correlation coefficients of simulated values were often lower than that of observation, while those of Jinsha River and Wujiang River were higher than observation. On a daily scale, a good simulation of precipitation occurrence cannot guarantee a good spatial correlation of precipitation amounts. The interstation correlation is induced at the monthly scale for precipitation occurrence and at the seasonal scale for precipitation amounts, so the accuracy of spatial correlation on the daily scale may not be very high. When evaluating the effect of the weather generator, the indices calculated on different time scales may exhibit different performance characteristics, which are determined by the construction and properties of the model.

Annual-scale precipitation

In the 11 sub-basins of the Yangtze River Basin, MulGETS was used to downscale precipitation from 1961 to 2005. The simulation results were evaluated using 9 indices and analysed at both annual and seasonal scales. At the seasonal scale, summer and winter were selected as representatives.

Spatial distribution of pint

The Pint of the Yangtze River Basin has an increasing pattern from the west to the east (Figure 5(a)). The observed precipitation amount in the lower reaches is greater than that in the middle reaches, and it is higher in the middle reaches than in the upper reaches. Moreover, stations on the south and the north sides of the basin were rainier. The comparison of Figure 5(a) and 5(b) shows that MulGETS can reasonably simulate the Pint of precipitation at most stations in the Yangtze River Basin and retain the increasing pattern of low and high values in the west and east, respectively. The spatial distribution of the simulated values was also largely consistent with the observed values. The simulation effects of each sub-basin were different.

Figure 4

Observed correlation of precipitation amounts vs. Generated correlation (top) and Observed correlation of precipitation occurrence vs. Generated correlation occurrence (bottom) for all station pairs over all 11 watersheds.

Figure 4

Observed correlation of precipitation amounts vs. Generated correlation (top) and Observed correlation of precipitation occurrence vs. Generated correlation occurrence (bottom) for all station pairs over all 11 watersheds.

Close modal
Figure 5

Spatial distribution of Pint of (a) the observed data, (b) the MulGETS simulated data, (c) and their relative error of annual precipitation in the Yangtze River Basin (1: Jinshajiang River, 2: Mintuojiang River, 3: Jialingjiang River, 4: Wujiang River, 5: upper mainstream section, 6: Hanjiang River, 7: Dongting Lake, 8: Poyang Lake, 9: middle mainstream section, 10: lower mainstream section, and 11-Tai Lake).

Figure 5

Spatial distribution of Pint of (a) the observed data, (b) the MulGETS simulated data, (c) and their relative error of annual precipitation in the Yangtze River Basin (1: Jinshajiang River, 2: Mintuojiang River, 3: Jialingjiang River, 4: Wujiang River, 5: upper mainstream section, 6: Hanjiang River, 7: Dongting Lake, 8: Poyang Lake, 9: middle mainstream section, 10: lower mainstream section, and 11-Tai Lake).

Close modal

As shown in Figure 5(c), the simulation errors of most stations were between −5 and 5%. The stations with errors below −5% were mainly located in the Jinsha River. In comparison, the stations with errors more significant than 5% were mainly located in the Chongqing and Sichuan area, eastern Mintuo River, southern Jialing River, western Wujiang River, and western upper mainstream. This region is the junction of mountains and plains; therefore, precipitation at different stations in the same sub-basin varies greatly, leading to precipitation homogenization at some stations and sizeable relative errors of simulation values.

Performance of various indices

The boxplots of each index are shown in Figure 6(a)–6(i). Overall, MulGETS has a good simulation effect on each index, and the box line of the simulated value is generally consistent with the observations.

Figure 6

Comparison between indices of annual scale observation and MulGETS simulated precipitation.

Figure 6

Comparison between indices of annual scale observation and MulGETS simulated precipitation.

Close modal

MulGETS can simulate the precipitation intensity indices accurately on an annual scale (Figure 6(a)–6(c)). It can also reasonably reproduce each station's Pint characteristics (Figure 6(a)). For example, the observed and simulated values of each station are in good agreement, and the characteristics of different stations (even adjacent ones) were also well distinguished. In most sub-basins, Ppdd shows a slight difference between stations; MulGETS can accurately capture the differences between stations in other basins, such as the Ppdd in some stations in the Mintuo and Jinsha River. However, at stations where Ppdd has high interannual variation, the simulated Ppdd shows lower variability; this indicates that the error will be larger in years when drought time is overconcentrated or scattered. In general, Ppdd in these basins is above 50%, and most of them reach 80%, indicating a high proportion of consecutive dry days. Pnl90 reflects the total number of precipitation days. As shown, Pnl90 has an inverse correlation with Ppdd. MulGETS slightly underestimated the interannual variance in Pnl90.

Clearly, the precipitation magnitude indices of the simulated values are mostly consistent with the observations; however, the range of boxplots at some stations is slightly different from the actual values (Figure 6(d)–6(f)). Specifically, there was a good correspondence between Pq95 and Pint. As shown in Figure 6(a) and 6(d), stations with high Pint are more likely to have a higher Pq95, and the interannual variation of Pq95 is greater than that of Pint. MulGETS also have good simulating effect on Pq95 (Figure 8(d)). For stations where precipitation varies widely from year to year, the simulated values show the same features and vice versa. However, the simulation of Pnl90 is much larger overall, especially in the Jinsha River, Mintuo River, Jialing River, Wujiang River, Hanjiang River, and Dongting Lake, most of which are in the middle and upper reaches of the Yangtze River Basin. The errors of Pnl90 seem to indicate that overestimating the 10% largest part of precipitation is one of the reasons for the systematic errors. However, Px5d was underestimated in most basins. These two inverse errors may cancel each other to ensure a more accurate overall mean, such as Pint.

Most of the annual scale precipitation persistence indices are well simulated; however, in some basins, a systematic error was observed (Figure 6(g)–6(i)). Stations with a relatively medium Pxcdd had a more accurate Pxcdd (Figure 6(g)–6(i)). For example, the simulated Pxcdd values for some stations in the Jinsha River and Mintuo River are slightly overestimated; for the Jialing River, Hanjiang River, the lower mainstream, and Tai Lake they are accurate; for Wujiang River, Dongting Lake, and the middle mainstream basins, they are all low. Because Pxcdd is underestimated in the sub-basins with lower observed Pxcdd, the influence of the simulated errors is comparably small. However, for the stations where Pxcdd is high, overestimation will result in more drought events. Pn10 mm is, to some extent, consistent with Pq95, which has a good relationship with Pint. MulGETS's simulation value for Pn10 mm is accurate at most stations and slightly low at some of the Jinsha River, Mintuo River, and Hanjiang River stations. Pdsav is also well simulated at most stations, and MulGETS precisely captures its interannual variation and interstation differences. Moreover, Pdsav has a positive relationship with Ppdd (Figure 6(b)). However, the Ppdd simulated values are not as precise as that of Pdsav for MulGETS, thus underestimating its interannual variation.

Seasonal-scale precipitation

Spatial distribution of pint

On a seasonal scale, the simulated Pint is in good agreement with the observed ones, and the differences between each sub-basin and the overall distribution can be well simulated. In the Yangtze River Basin, summer precipitation accounted for more than half of the annual precipitation and the Pint is approximately 7.0–24.5 mm/day; in winter, it is less than 10 mm/day in general. The interannual mean precipitation has a spatial distribution of gradually increasing from the upper reaches to the lower reaches, especially in the winter (Figure 7(d)).

Figure 7

Spatial distribution of Pint of observed data (a, d), MulGETS simulated data (b, e), and their relative error (c, f) in the summer (the left column) and the winter (the right column) precipitation in the Yangtze River Basin (1: Jinshajiang River, 2: Mintuojiang River, 3: Jialingjiang River, 4: Wujiang River, 5: upper mainstream section, 6: Hanjiang River, 7: Dongting Lake, 8: Poyang Lake, 9: middle mainstream section, 10: lower mainstream section, and 11: Tai Lake).

Figure 7

Spatial distribution of Pint of observed data (a, d), MulGETS simulated data (b, e), and their relative error (c, f) in the summer (the left column) and the winter (the right column) precipitation in the Yangtze River Basin (1: Jinshajiang River, 2: Mintuojiang River, 3: Jialingjiang River, 4: Wujiang River, 5: upper mainstream section, 6: Hanjiang River, 7: Dongting Lake, 8: Poyang Lake, 9: middle mainstream section, 10: lower mainstream section, and 11: Tai Lake).

Close modal
Figure 8

Comparison between indices of observation and MulGETS simulated precipitation in summer.

Figure 8

Comparison between indices of observation and MulGETS simulated precipitation in summer.

Close modal

In summer, the Sichuan area receives a considerable amount of precipitation; at some stations, the amount can be equivalent to the lower reaches (Figure 7(a)). As shown in Figure 7(a) and 7(b), the precipitation conditions of some stations differ from those of the surrounding stations, forming an evident local closure; for example, some stations in the Jinsha River and Hanjiang River. MulGETS can capture such differences (Figure 7(c) and 7(d)), indicating that the model does not provide a completely average precipitation.

At most stations, the relative error of Pint was between −5 and 5%, and the simulation effect was good (Figure 7(e) and 7(f)). The negative values were mainly concentrated in the upstream basins. In summer, the simulated Pint is overestimated in most sub-basins, whereas in winter, only a few stations have overestimated simulation values in the Jialing River, Hanjiang River, and Wujiang River. Unlike the annual scale simulation, the positive relative errors in summer and winter are scattered rather than concentrated in the Sichuan area. Because there is less precipitation in winter, the denominator of the relative error calculation is smaller. If the absolute error is the same, the relative error will be much larger than that in summer. Therefore, the distribution area of stations with significant errors in winter was larger than that in summer throughout the year (Figure 8(f)).

MulGETS can closely simulate the spatial distribution of Pint in the Yangtze River Basin; however, the relative error at some stations is still difficult to ignore. The accuracy of the model should be increased, especially in stations with less precipitation.

Performance of various indices

The boxplots of the seasonal scale indices are shown in Figures 8 and 9. It was found that MulGETS can precisely simulate precipitation at a seasonal scale. For the precipitation intensity indices, the overall effect is good, especially for Ppdd and Pnl90, which are highly consistent with the observations. In the summer, the Pint is larger than the annual mean; in the winter, it is rather small and mostly below 10 mm/day. MulGETS has a good simulation effect in both seasons (Figures 8(a) and 9(a)). Nevertheless, it faces certain problems, in several sub-basins, the interannual variation of simulated values at various stations tends to be consistent. For the Jialing River in Figure 8(a), the maximum interannual observations were not captured. In winter, the Jinsha River as well as the Mintuo River, where the observations are low, have inaccurate Pint's simulated values; the difference between the stations is not captured as well. The simulated summer Ppdd was lower than that of the annual scale, but its interannual variation range was higher than that of the annual scale (Figure 8(b)), which agrees with the observations. This is probably because summer precipitation in the Yangtze River Basin is affected by large-scale ocean circulation and sea surface temperature anomalies such as Hardley circulation and ENSO (El Niño-Southern Oscillation) (Zhou & Wang 2006; Luo et al. 2018), which lead to large interannual fluctuations during the summer precipitation. The simulated Pnl90 results are very consistent with the observation (Figure 8(c)); this is probably because the model refers to the observed rainy/no-rain days and only activates rainfall simulation on rainy days. Therefore, with a steady total number of days, the number of rain days will be close to the observation, as will the Pnl90. The shorter the period, the more concentrated is Pnl90, and thus the boxplot is compressed into a single line (even during winter). This index shows interannual differences on an annual scale (with a longer time range), although its interannual variation is underestimated. In winter, MulGETS can reproduce Ppdd and Pnl90 in most basins (Figure 9(b) and 9(c)). Overall, there is a need to improve the simulation effect of MulGETS for winter precipitation.

Figure 9

Comparison between indices of observation and MulGETS simulated precipitation in winter.

Figure 9

Comparison between indices of observation and MulGETS simulated precipitation in winter.

Close modal

The range of seasonal scale precipitation magnitude indices is wide, but MulGETS can simulate each sub-basin's characteristics and the differences between stations. Compared with the annual scale Pq95, the simulated Pq95 in the summer was not significantly lower. However, in some sub-basins, the interannual variation in simulated values is underestimated. For example, in the Jialing River (Figure 8(d)), the interannual maximum values of some stations were not captured accurately. The simulation results of Pq95 in the winter are in good agreement with those of Pint, and there is a significant error in some of the Jinsha River and Mintuo River stations. For a basin with less precipitation in the winter, most rain days may be concentrated in a few short periods so that the error of Pq95 will lead to a corresponding error of Pint. In the summer, the Pfl90 is slightly overestimated in each sub-basin, indicating that MulGETS simulation values have a systematic error in this index (Figures 8(e) and 9(e)). Pfl90 is larger than Pnl90, but Pnl90 is more accurate, indicating that the simulation value of MulGETS is not accurate enough to capture the distribution of daily precipitation in the summer. In summer, Px5d was generally underestimated in each sub-basin (Figure 8(f)). In winter, the effect was not good. Specifically, the Px5d and its interannual variation are underestimated in the Jinsha River, Mintuo River, Jialing River, Wujiang River, and the mainstream basins of the upper reaches of the Yangtze River. In summary, on a seasonal scale, MulGETS can simulate the differences between stations but is still weak in heavy rainfall simulations. This is a disadvantage when applying the model to precipitation prediction and flood risk management in the Yangtze River Basin.

The simulation effects were varied for the precipitation persistence indices. In general, the simulation effect of Pxcdd in the middle and lower reaches of the Yangtze River Basin is better than that in the upper reaches of the Jinsha River, Jialing River, Mintuo River, and Wujiang River (Figure 8(g)). For example, at the Jialing River Basin's first station, the simulated value was approximately 50% higher than the maximum of inputs and, at the seventh station, 50% lower than the maximum value. Pxcdd was underestimated in most basins in winter; however, the difference was not significant (Figure 9(g)). Pn10 mm is generally underestimated in the Jinsha and Mintuo River and partly underestimated in the Hanjiang River and Dongting Lake. The minimum interannual values were all lower than the observation minima (Figure 8(h)). Some simulated values reached zero in the middle and upper reaches in the winter. Furthermore, when the observation is zero, the simulated result must be zero but not vice versa (Figure 9(h) and 9(i)). This inclination indicates that although the MulGETS simulation result at one station is affected by its surrounding stations, its own precipitation condition still plays a decisive role. The Pdsav of the simulated value is in good agreement with the observation. However, the interannual variance is overestimated at some stations, such as stations in the Jialing River, Hanjiang River, and Dongting Lake (Figure 8(i)). In summary, MulGETS tends to underestimate the number of days with precipitation of more than 10 mm in the summer and overestimate the most extended length without rain, which is also one reason for some other indices' errors.

In summary, owing to the underestimation of heavy rain in summer and the overestimation of rainless days, MulGETS tends to produce a precipitation series that is drier than the observation. However, the extent is not the same in the sub-basins. Specifically, the simulation effect at the stations at the middle and lower reaches, with more precipitation, is better than that at the upper reaches with less precipitation.

The primary purpose of the weather generators is to reconstruct data with statistical features close to the input observation data, where one crucial and challenging factor is to reconstruct the time autocorrelation and space correlation of the data. We evaluated MulGETS, a multisite stochastic weather generator, for its simulation effect on precipitation in the Yangtze River Basin from the perspective of both time and space. The results showed that MulGETS can effectively generate spatially correlated variables. The simulated sequences retained the spatial distribution and temporal variation of the input data to a great extent. The simulation effect of the model varied for each sub-basin. The simulation effect of the model for the basins at the middle and lower reaches was better than that for the upper reaches. A possible explanation is the scanty rainfall conditions and the influence of mountainous topography in the upper reaches. On the time scale, the model performs better on the annual scale than on the seasonal scale, and it performs better in summer than in winter.

MulGETS provides satisfactory results for the simulation of intensity, magnitude, and duration indices and is more accurate at simulating the indices reflecting rainless days. However, its ability to capture maximum values must be improved.

In addition, we proposed a method to evaluate multisite downscaling models by analysing the effects of simulations in terms of both spatial distribution and temporal variation. This study provides a reference not only to apply MulGETS in the Yangtze River Basin but also to understand and improve the MulGETS. This study only examined the simulation effect of the MulGETS model. The next step will be to add results from other models for comparison, which will help to further understand the characteristics of the multisite downscaling model.

We gratefully acknowledge the source code of the MulGETS shared by Professor Chen. The authors would like to thank two anonymous reviewers for their helpful comments and suggestions.

Conceptualization, Yu Li., X. M. Zeng, J.L. Guo; methodology, Yu Li, X. M. Zeng, J. L. Guo; formal analysis, Yu Li; writing – original draft preparation, Yu Li; writing – review and editing, Yu Li, X. M. Zeng; visualization, Yu Li; supervision, X. M. Zeng; funding acquisition, X. M. Zeng. All authors have read and agreed to the published version of the manuscript.

This research was funded by the National Key Research and Development Program of China, grant number 2018YFC1508200, the National Natural Science Foundation of China, grant number 42005023, the Fundamental Research Funds for the Central Universities, grant number B210203067 and Open Funding of Hubei Key Laboratory of Intelligent Yangtze and Hydroelectric Science, China Yangtze Power Co.,Ltd,(NO. ZH2002000103).

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

All relevant data are available from an online repository or repositories (http://data.cma.cn/).

Ahmed
K. F.
,
Wang
G. L.
,
Silander
J.
,
Wilson
A. M.
,
Allen
J. M.
,
Horton
R.
&
Anyah
R.
2013
Statistical downscaling and bias correction of climate model outputs for climate change impact assessment in the U.S. northeast
.
Global and Planetary Change
100
(
2013
),
320
332
.
Araya-Osses
D.
,
Casanueva
A.
,
Román-Figueroa
C.
,
Uribe
J. M.
&
Paneque
M.
2020
Climate change projections of temperature and precipitation in Chile based on statistical downscaling
.
Climate Dynamics
54
(
9–10
),
4309
4330
.
Breinl
K.
,
Turkington
T.
&
Stowasser
M.
2013
Stochastic generation of multi-site daily precipitation for applications in risk management
.
Journal of Hydrology
498
(
2013
),
23
35
.
Brissette
F. P.
,
Khalili
M.
&
Leconte
R.
2007
Efficient stochastic generation of multi-site synthetic precipitation data
.
Journal of Hydrology
345
(
3–4
),
121
133
.
Chen
J.
,
Brissette
F. P.
&
Leconte
R.
2011
Assessment and improvement of stochastic weather generators in simulating maximum and minimum temperatures
.
Transactions of the ASABE
54
(
5
),
1627
1637
.
Chen
J.
,
Brissette
F.
&
Zhang
X. J.
2014
A multi-site stochastic weather generator for daily precipitation and temperature
.
Transactions of the ASABE
57
(
5
),
1375
1391
.
Chen
J.
,
Brissette Francois
P.
&
Zhang
X. C.
2016
Hydrological modeling using a multisite stochastic weather generator
.
Journal of Hydrologic Engineering
21
(
2
),
04015060
.
Chen
J.
,
Chen
H.
&
Guo
S. L.
2018
Multi-site precipitation downscaling using a stochastic weather generator
.
Climate Dynamics
50
(
5
),
1975
1992
.
Cheng
C. S.
,
Li
G.
,
Li
Q.
&
Auld
H.
2008
Statistical downscaling of hourly and daily climate scenarios for various meteorological variables in South-central Canada
.
Theoretical & Applied Climatology
91
(
1–4
),
129
147
.
Chowdhury
A. K.
,
Kar
K. K.
,
Shahid
S.
,
Chowdhury
R.
&
Rashid
M. M.
2019
Evaluation of spatio-temporal rainfall variability and performance of a stochastic rainfall model in Bangladesh
.
International Journal of Climatology
39
(
11
),
4256
4273
.
Fang
J. N.
&
Tacher
L.
2010
An effcient and accurate algorithm for generating spatially-correlated random fields
.
Communications in Numerical Methods in Engineering
19
(
10
),
801
808
.
Fu
G. B.
,
Chiew
F. H. S.
&
Shi
X. G.
2018
Generation of multi-site stochastic daily rainfall with four weather generators: a case study of Gloucester catchment in Australia
.
Theoretical & Applied Climatology
134
(
2018
),
1027
1046
.
Gemmer
M.
,
Jiang
T.
,
Su
B. D.
&
Kundzewicz
Z. W.
2008
Seasonal precipitation changes in the wet season and their influence on flood/drought hazards in the Yangtze River Basin
.
China Quaternary International
186
(
No.1
),
12
21
.
Giorgi
F.
&
Mearns
L. O.
1999
Introduction to special section: regional climate modeling revisited
.
Journal of Geophysical Research
104
(
D6
),
6335
6352
.
Goodess
C.
,
Anagnostopoulou
C.
,
Bárdossy
A.
,
Frei
C.
,
Harpham
C.
,
Haylock
M.
,
Hundecha
Y.
,
Maheras
P.
,
Ribalaygua
J.
,
Schmidli
J.
,
Schmith
T.
,
Tolika
K.
&
Tomozeiu
R.
2012
An intercomparison of statistical downscaling methods for Europe and European regions – assessing their performance with respect to extreme temperature and precipitation events
.
Climatic Change
.
Harpham
C.
&
Wilby
R. L.
2005
Multi-site downscaling of heavy daily precipitation occurrence and amounts
.
Journal of Hydrology
312
(
1
),
235
255
.
Huang
J.
,
Zhang
J. C.
,
Zhang
Z. X.
,
Xu
C. Y.
,
Wang
B. L.
&
Yao
J.
2010
Estimation of future precipitation change in the Yangtze river basin by using statistical downscaling method
.
Stochastic Environmental Research and Risk Assessment
25
(
6
),
781
792
.
Kilsby
C. G.
,
Jones
P. D.
,
Burton
A.
,
Ford
A. C.
,
Fowler
H. J.
,
Harpham
C.
,
James
P.
,
Smith
A.
&
Wilby
R. L.
2007
A daily weather generator for use in climate change studies
.
Environmental Modelling & Software
22
(
12
),
1705
1719
.
Kottegoda
N. T.
,
Natale
L.
&
Raiteri
E.
2003
A parsimonious approach to stochastic multisite modelling and disaggregation of daily rainfall
.
Journal of Hydrology
274
(
1–4
),
47
61
.
Li
J. F.
,
Gan
T. Y.
,
Chen
Y. D.
,
Gu
X.
,
Hu
Z. Y.
,
Zhou
Q. M.
&
Lai
Y. C.
2020
Tackling resolution mismatch of precipitation extremes from gridded GCMs and site-scale observations: implication to assessment and future projection
.
Atmospheric Research
293
,
104908
.
Liao
Y.
,
Zhang
Q.
&
Chen
D.
2004
Precipitation simulation in China with a weather generator
.
Acta Geographica Sinica
59
(
5
),
689
698
.
Liu
Z. F.
,
Xu
Z. X.
,
Charles
S. P.
,
Fu
G. B.
&
Liu
L.
2011
Evaluation of two statistical downscaling models for daily precipitation over an arid basin in China
.
International Journal of Climatology
31
(
13
),
2006
2020
.
Luo
J. F.
,
Chen
M.
,
Li
Y.
,
Peng
T.
&
Xu
H. T.
2018
Influence of the two types of ENSO events on seasonal precipitation over middle and lower reaches of the Yangtze River
.
South-to-North Water Trasfers and Water Science&Technology
016
(
004
),
82
89
.
Semenov
M. A.
2008
Simulation of extreme weather events by a stochastic weather generator
.
Climate Research
35
(
3
),
203
212
.
Srikanthan
R.
&
Pegram
G. G. S.
2009
A nested multisite daily rainfall stochastic generation model
.
Journal of Hydrology
371
(
1
),
142
153
.
Tseng
S. C.
,
Chen
C. J.
&
Senarath
S.
2020
Evaluation of multi-site precipitation generators across scales
.
International Journal of Climatology
40
(
10
),
4622
4637
.
Verdin
A.
,
Rajagopalan
B.
,
Kleiber
W.
,
Podestá
G.
&
Bert
F.
2015
A conditional stochastic weather generator for seasonal to multi-decadal simulations
.
Journal of Hydrology
556
(
2018
),
835
846
.
Wang
W. G.
,
Xing
W. Q.
,
Shao
Q. X.
,
Yu
Z. B.
,
Peng
S. Z.
,
Yang
T.
,
Yong
B.
,
Taylor
J.
&
Singh
V. P.
2013
Changes in reference evapotranspiration across the Tibetan Plateau: observations and future projections based on statistical downscaling
.
Journal of Geophysical Research: Atmospheres
118
(
10
),
4049
4068
.
Wang
W. G.
,
Yu
Z. B.
,
Zhang
W.
,
Shao
Q. X.
,
Zhang
Y. W.
,
Luo
Y. F.
,
Jiao
X. Y.
&
Xu
J. Z.
2014
Responses of rice yield, irrigation water requirement and water use efficiency to climate change in China: historical simulation and future projections
.
Agricultural Water Management
146
(
2014
),
249
261
.
Wetterhall
F.
,
Bárdossy
A.
,
Chen
D.
,
Halldin
S.
&
Xu
C. Y.
2006
Daily precipitation-downscaling techniques in three Chinese regions
.
Water Resources Research
42
(
11
),
W11423
.
Wilby
R. L.
,
Dawson
C. W.
&
Barrow
E. M.
2002
SDSM – a decision support tool for the assessment of regional climate change impacts
.
Environmental Modelling & Software
17
(
2
),
145
157
.
Wilks
D. S.
1998
Multisite generalization of a daily stochastic precipitation generation model
.
Journal of Hydrology
210
(
1–4
),
178
191
.
Wood
A. W.
,
Leung
L. R.
,
Sridhar
V.
&
Lettenmaier
D. P.
2004
Hydrologic implications of dynamical and statistical approaches to downscaling climate model outputs
.
Climatic Change
62
(
1–3
),
189
216
.
Xu
C. Y.
1999
From GCMs to river flow: a review of downscaling methods and hydrologic modelling approaches
.
Progress in Physical Geography: Earth and Environment
23
(
2
),
229
249
.
Zeng
X. M.
,
Wang
M.
,
Wang
N.
,
Yi
X.
,
Chen
C. H.
,
Zhou
Z. G.
,
Wang
G. L.
&
Zheng
Y. Q.
2017
Assessing simulated summer 10-m wind speed over China: influencing processes and sensitivities to land surface schemes
.
Climate Dynamics
50
(
2018
),
4189
4209
.
Zhang
Q.
,
Xu
C. Y.
,
Gemmer
M.
,
Chen
Y. D.
&
Liu
C.
2009
Changing properties of precipitation concentration in the Pearl River basin, China
.
Stochastic Environmental Research
23
(
3
),
377
385
.
Zheng
J. Z.
,
Wang
W. G.
,
Cao
X. C.
,
Feng
X. Z.
,
Xing
W. Q.
,
Ding
Y. M.
,
Dong
Q.
&
Shao
Q. X.
2018
Responses of phosphorus use efficiency to human interference and climate change in the middle and lower reaches of the Yangtze River: historical simulation and future projections
.
Journal of Cleaner Production
201
(
2018
),
403
415
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC-ND 4.0), which permits copying and redistribution for non-commercial purposes with no derivatives, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Supplementary data