Hydrological models are often plagued by substantial uncertainties in model parameters when analyzing water balance, predicting long-time streamflow, and investigating climate-change impact in watershed management. In this study, a Bayesian Box–Cox transformation three-level factorial analysis (BBC-TFA) method is developed for revealing the influence of parameter uncertainty on the runoff in the Naryn River Basin. BBC-TFA cannot only quantify the uncertainty through Bayesian inference but also investigate the individual and interactive effects of multiple parameters on model output. Main findings disclose that: (i) the contribution rate of runoff potential parameter during the non-melting period reaches 88.22%, indicating a flood risk in the rainy season; (ii) the contribution rate of snow temperature lag factor is the highest during the snow-melting period and the entire year (respectively occupying 76.69 and 53.70%), indicating that the glacier melting exists in the Naryn River Basin throughout the year; (iii) the Box–Cox transformation can successfully remove residual variance and enhance the correlation between input and output variables. These findings serve to revealing the presence of glacial resources in the study basin and the significant runoff during the rainy season. Policymakers can consider water storage during the rainy season while developing glacier resources to alleviate water scarcity.

  • An integrated Bayesian Box–Cox transformation three-level factorial analysis (BBC-TFA) method is developed.

  • BBC-TFA is first applied to the Naryn River Basin, Central Asia.

  • BBC-TFA can remove heteroskedasticity between simulated and observed data.

  • Effects of parameters on hydrological responses are identified.

  • Infiltration-excess runoff and snowmelt are important to the runoff in the Naryn River Basin.

Acronyms

SWAT

Soil and water assessment tool

DREAM

Differential evolutionary adaptive metropolis

MCMC

Markov chain Monte Carlo

PDF

Probability density function

LH-OAT

The Latin hypercube one factor at a time

TFA

Three-level factorial analysis

ArcGIS

Geographic information system

HRU

Hydrologically comparable unit

R2

Coefficient of determination

BCT

Box–Cox (BC) transformation

NSE

Nash–Sutcliffe model efficiency

PBIAS

Percentage bias

Notations

θ

The set of aggregate parameters in SWAT

Qobs

Set of observe discharge values

Qsim

Set of simulation discharge values

P(Qobs|θ)

Prior parameter probability distribution

P(θ)

Prior information for the parameters set θ

L(Qobs|θ)

Probability density of the observed discharge values

P(Qobs)

Normalizing constant

f(Qobs|θ)

Probability density of the likelihood

εi(θ)

The residual on day i

σε,i(θ)

Residual standard deviation

λ

Transformation parameters

εijk

A portion of uncertainty due to probability

yijk

Measured reaction for A in ith level and B in jth level for kth replicate

(τβ)ji

Effect of the interaction between τi and βj

τi

Effect of factor A in i level

βj

Effect of factor B in j level

μ

Total average effect

SS

The sums of squares

SSAB

Sum of squares for the interaction between factors A and B

SST

Total sum of squares

SSE

Sum of squares error

Motivation

Central Asia is a typical region that is characterized by the influence from the dominant Westerlies with the contributions from the East Asian and Indian monsoons. The combination of the unique atmospheric circulation and its distinctive geographical location make it one of the most ecologically and environmentally fragile regions in the world (e.g., the Qinghai-Tibet Plateau). Except for high-altitude regions, most of Central Asia is characterized by arid and semi-arid climates. Hydrological modeling in these regions is challenging due to the unique hydroclimatic variables and the sparse distribution of meteorological stations (Kan et al. 2017; Didovets et al. 2021). As the two largest rivers in Central Asia, the Amu Darya and the Syr Darya both terminate in the Aral Sea. Since the dissolution of the Soviet Union in 1991, unreasonable management and allocation of water resources between upstream countries (e.g., Kyrgyzstan and Tajikistan) and downstream countries (e.g., Kazakhstan, Uzbekistan, and Turkmenistan) have caused conflicts and highlighted water scarcity and water security issues in the downstream areas (Schöne et al. 2018; Zheng et al. 2019). The Naryn River originates in the southern foothills of the Tien Shan Mountains in eastern Kyrgyzstan and flows from east to west through Kyrgyzstan into Uzbekistan before joining the Kara River to form the Syr Darya. However, it is still unclear how the water flow responds to natural and anthropogenic factors. Hydrological modeling of the Naryn River Basin is crucial for analyzing water balance, forecasting long-range streamflow, and predicting flood risk. By analyzing the hydrological processes of the basin, decision makers can formulate appropriate policies to alleviate the pressure of water scarcity in such an arid and information-deficient region.

Literature review

Hydrological models, classified into physically based and machine learning (ML) models, were extensively used for analyzing water balance, forecasting long-range streamflow, predicting real-time flood, and investigating climate-change impact in watershed management (Yuan et al. 2018; Cheng et al. 2020; Ansell & Valle 2021; Adnan et al. 2022). Adnan et al. (2021) proposed a novel hybrid method, ELM-PSOGWO, which integrates particle swarm optimization (PSO) and gray wolf optimization (GWO) with extreme learning machine (ELM) for predicting monthly runoff in the Mangla watershed in northern Pakistan. Ikram et al. (2022a) developed a hybrid ML method based on the extended marine predators algorithm (EMPA) and artificial neural network (ANN-EMPA) for flow estimation in the upstream of the Indus River Basin. Hussainzada & Lee (2022) used the Soil and Water Assessment Tool (SWAT) model to determine the hydrological processes and assess the water resource capacity of the Barhab River Basin (BRB) in northern Afghanistan. Generally, the ML models can directly explore the relationship between hydrological cycle variables and runoff in theoretical system models without considering the involved physical processes. The majority of the parameters in the SWAT model are not readily accessible. They can only be derived by calibration against input and output data of the watershed response (Vrugt et al. 2008; Mengistu et al. 2019; Liu et al. 2022; Schuster-Wallace et al. 2022). Besides, hydrological models often encounter substantial uncertainties with respect to the input data, initial and boundary conditions, model structure, and parameters due to insufficient observation data and difference in spatiotemporal scale (Sidle 2021; Wang et al. 2021). These uncertainties can limit the reliability and practical value of the model simulation results while they have not been evaluated in practice (Baig et al. 2022; Han & Morrison 2022).

Over the past decades, a number of research works were conducted for investigating the effect of parameter uncertainty on modeling performance. Farsi & Mahjouri (2019) analyzed the uncertainty of the parameters of the models using the differential evolutionary adaptive metropolis (DREAM) algorithm to decrease the uncertainty of model estimations. Diao et al. (2021) elaborated the solution mechanism of the standard DREAM algorithm, and the algorithm was applied for the optimal operation model of the reservoir group in Jialing River. Xu et al. (2022) used DREAM algorithm to analyze the parameters’ posterior distributions and the nutrient loadings’ simulated uncertainty ranges for agriculture in SWAT model. Wang et al. (2022) used DREAM to obtain the posterior distribution of SWAT model parameters to consider parameter uncertainty. In general, the DREAM algorithm can obtain the accurate posterior distribution of parameters through large-scale simulations (Yang et al. 2008; Zhang et al. 2020; Liu et al. 2021). However, it still uses the traditional statistical assumption of residual homoscedasticity. In the hydrologic modeling, errors are not homoscedastic but heteroskedastic since the size of errors tends to rise with the observed flow magnitude.

Box–Cox transformation (BCT) is effective for eliminating the heteroscedasticity of residuals in the simulation, which can improve the normality and linearity of the data series. BCT has achieved good normality in hydrological frequency analysis and calculations (Qiao et al. 2022). Guo et al. (2022) utilized residual-based modeling to generate and evaluate probabilistic hydrologic and water quality predictions at 18 sites in the United States; where the BCT scheme provided the best predictions of uncertainty for all case studies. Zhang et al. (2022) proposed a new hybrid prediction model called variational mode decomposition (VMD)-BC-Elman by introducing the VMD and BCT into the Elman neural network; the results demonstrated that BCT could remove the skewness of the original streamflow sequence, transform skewed candidate input variables into normal distribution, and enhance the correlation between input variables and output variables. The BCT can be employed to eliminate the heteroskedasticity of model residuals, and the residuals can be assumed to adhere to a generalized error distribution (Cheng et al. 2014, 2018). However, even after translation, model data have trouble showing how several uncertain factors influence model results.

In the uncertainty analysis, it is very important to identify how much each parameter affects the final result. Sensitivity analysis is often used for quantifying the influence of various parameters on the model output, specifically assessing the contribution of parameters to overall model output uncertainty (Uusitalo et al. 2015; Koo et al. 2020). Van Griensven & Meixner (2003) proposed an improved sensitivity analysis approach, named as Latin Hypercube One factor At a Time (LH-OAT), for the sensitivity analyses for a large number of parameters. However, the LH-OAT approach cannot quantify the effect of parameter interactions on model output, it can only offer a sensitivity ranking of parameters (Liu et al. 2017). In comparison, factorial analysis (FA) can quantify the main effect of a single parameter and different-level interactions of multiple input parameters on the system performance (Gao et al. 2021). As an extension of the traditional FA, three-level factorial analysis (TFA) is useful for detecting the curvilinear relationship between the parameters and the response (Zhang et al. 2016; Liu et al. 2019; Jia et al. 2020). Unfortunately, no research work has been reported on the integration of DREAM, BCT, and TFA techniques into SWAT hydrological model to disclose the influence of parameter uncertainty and their interactions on the runoff for the river basin in Central Asia.

Research gap and objective

Due to the importance of water resources management, hydrological simulation has always been a topic of concern for researchers. The majority of the parameters in the SWAT model are not readily accessible. They can only be derived by calibration against input and output data of the watershed response (Vrugt et al. 2008; Mengistu et al. 2019). Parameter errors are always presented throughout the calibration and derivation processes. In addition, the previous hydrological studies in the Naryn River Basin mainly focus on using physical or ML models to simulate river runoff, and there is little research on revealing parameter uncertainty effecting hydrological processes through physical model (i.e., SWAT). Due to the scarcity of data in the Naryn River Basin, the availability of hydrological and meteorological observation data limits streamflow simulation performance. It is critical to undertake a thorough investigation of the uncertainties in the SWAT model simulation process for enhancing the accuracy of the hydrological simulation for the Naryn River Basin. In order to fill these research gaps, further studies are needed to explore the uncertainties associated with hydrological processes in the Naryn River Basin.

This study aims to develop a Bayesian Box–Cox transformation three-level factorial analysis (BBC-TFA) method for assessing parameter uncertainties and their interactions on hydrological model responses. BBC-TFA incorporates BCT, DREAM, and TFA techniques within a general framework. BBC-TFA will then be applied to the Naryn River Basin to quantify the uncertainty of hydrological parameters in the SWAT model. The main novelty and contribution are as follows: (i) this is the first attempt of integrating BCT, DREAM, and TFA techniques into SWAT model; BBC-TFA can remove heteroskedasticity and get a more trustworthy posterior distribution of the parameters by introducing BCT into the DREAM method; (ii) BBC-TFA can obtain the parameters’ contribution; (iii) BBC-TFA is capable of identifying the influence of sensitive parameters on runoff; (iv) BBC-TFA is first applied to the Naryn River Basin (an information-poor area) to reveal the effects of parameter uncertainty. The results can assist relevant researchers in establishing effective hydrological models and help decision makers understand the physical factors associated with the watershed and provide theoretical support for studies in similar watersheds.

The organization of the remaining sections of this paper is as follows: Section 2 describes the framework of BBC-TFA and the evaluation criteria used in this study. Section 3 describes the study area and the data used in this paper. Section 4 discusses the results of this study. Section 5 concludes the study and discusses the limitations and potential extensions of the BBC-TFA method.

The technical framework of the BBC-TFA method is presented in Figure 1. BBC-TFA can systematically address the heteroscedasticity of hydrological simulation and investigate parameter uncertainty and their interconnections. Based on input data, the SWAT model can represent the correlation between parameters and responses. Sensitive parameters can be screened using the LH-OAT approach to enhance model optimization and uncertainty analysis efficiency. BCT is combined with DREAM to solve the heteroskedasticity of the hydrological simulation and to obtain the posterior distribution of the hydrological parameters. To identify the effect of parameter uncertainty on the runoff responses, TFA is introduced where the parameter values are divided into three levels according to the parameters’ posterior distributions.
Figure 1

The framework of BBC-TFA method.

Figure 1

The framework of BBC-TFA method.

Close modal

The DREAM algorithm with BCT

The Markov Chain Monte Carlo (MCMC) analysis is an effective approach for addressing uncertainty in hydrologic modeling. For long-running models, the DREAM algorithm can evaluate multiple parameters sets simultaneously, which can significantly reduce the time required for MCMC analysis (Zhang et al. 2020). The probability density function (PDF) of parameter posterior is proportional to the likelihood function multiplied by the assumed PDF of parameter prior, according to Bayes’ theorem. The theorem can be expressed by the following equation (Liu et al. 2017):
(1)
where θ is the set of aggregate parameters in SWAT, such as V__ALPHA_BF, V__CN2; Qobs is the set of observe discharge values. P(Qobs|θ) represents prior parameter probability distribution; P(θ) is prior information for the parameters set θ; L(Qobs|θ) denotes the probability density of the observed discharge values set when the parameter set is θ; P(Qobs) is normalizing constant, except for the Bayesian model selection and Bayesian model averaging, it is not required to compute in practice. The common practice assumes that the differences between the measured and simulated values are uncorrelated and homoscedastic, or that the residuals satisfy the Gaussian distribution, one of the most prevalent PDFs. If the calibration period has n time points, then the probability density, also known as the likelihood function, may be written as (Tasdighi et al. 2018; Wang et al. 2022):
(2)
where εi(θ) is the residual on day i, the value that represents the deviation between the actual streamflow on day i, and the streamflow simulated by the model using the set θ of input parameters; f(Qobs|θ) is the probability density of the likelihood that observed streamflow Qobs is obtained for every given θ set of parameters; σε,i(θ) is the residual standard deviation on day i. It was previously assumed that the errors satisfy a Gaussian distribution, so before using the data, it must be transformed such that the residuals are homoscedastic. BCT is a popular tool for transforming data (Box & Cox 1964). Using the BCT, we can convert the Q value of the simulated (Qsim) and observed (Qobs) data to the Q′ value of the following:
(3)
The residuals of transformed data (QsimQobs) may be turned homoscedastic by adjusting the transformation parameters λ1 and λ2. Together with the SWAT aggregate parameters, the transformation parameter is computed by optimizing the likelihood function. But the parameter λ2 is difficult or impossible to be directly inferred by the maximum likelihood method, it may cause complications in the autocalibration process (Cheng et al. 2018). We strive to only use λ2 to modify the data and avoid using λ1 as much as possible. In the case when the residuals of the converted data follow a normal distribution, we may express the likelihood function as (Joseph 2011):
(4)
The residual of the transformed data on the day i denoted by εi′(θ) is defined as follows:
(5)

The residuals of the original data are of particular importance, not the converted data, thus we need to add a fraction in the likelihood function to account for this.

Three-level factorial analysis

One of the most reliable methods for determining the impact of several independent variables and their interactions on a dependent variable is FA (Duan et al. 2018). In this study, the three-level factor analysis (TFA) method is adopted to analyze the complex parameters relationship in the SWAT model. It conducts 3k experiments to realize each of the k factors at low (L), medium (M), and high (H) levels in the three-level factorial design. The linear statistical effects of the parameters interacting with each other are as follows when distinct parameters exist at various levels:
(6)
where εijk includes a portion of uncertainty due to probability; yijk is the measured reaction for A in ith level and B in jth level for kth replicate, subscript i, j denotes i, j level, subscript k denotes k replicate; τi, βj is the effect of factor A in i level and the effect of factor B in j level, respectively; (τβ)ji is the effect of the interaction between τi and βj; μ is the total average effect. SSA and SSB are the sums of squares for factor A and factor B, respectively; SSAB is the sum of squares for the interaction between factors A and B; SST represents the total sum of squares. SSE is the sum of squares error. A method of calculating them is (Wang et al. 2021):
(7)
(8)
(9)
(10)
(11)

By dividing the sum of squares for each input factor (SSA, SSB, and SSAB) by the total sum of squares for all parameters (SST), the sensitivity index of each factor was obtained. The model's parameters and their values are selected using the DREAM method, which includes the posterior distribution's shape, statistical properties of the posterior parameters, and correlation coefficients. The TFA has a wider range of parameter values, which allows for a more accurate assessment of parameters’ fuzziness and the links between them.

Model performance evaluation

Many parameters in the SWAT model can be derived by calibrating simulated data to observed data (Sidle 2021). The fundamental process of getting ideal parameter values that closely match the simulated and observed data is known as model calibration. In this work, calibration and validation are carried out in accordance with the DREAM method. The coefficient of determination (R2) and Nash–Sutcliffe model efficiency (NSE) are presented to assess the model's capacity for simulation. R2 is an indicator of the degree of linear relationship between observed data and simulated data; NSE is a standardized statistical measure that represents the degree of fit between observed data and simulated data to the 1:1 line (Moriasi et al. 2007; Triana et al. 2019; Chiphang et al. 2020). The equations are:
(12)
(13)
The percentage bias (PBIAS) compares the average tendency of the simulated data to the trend of the corresponding real data. The ideal PBIAS value is zero. A positive score implies that the model underestimates, while a negative value shows that it overestimates (Gupta et al. 1999). PBIAS is given as:
(14)
The Central Asian Tien Shan Mountains are the source of the Naryn River, it flows westward through the Fergana Valley into Uzbekistan before joining the Kara River to form the Syr Darya (Liu et al. 2020b). The Naryn River has a length of 807 km and a catchment area of 58,205 km2 (Gan et al. 2015). It has a continental climate; thus, winters are cold, and summers are quite hot (Shannon et al. 2023). The average yearly precipitation ranges from 280 to 450 mm (Wu et al. 2022). Spring and early summer see the greatest proportion of precipitation (Kriegel et al. 2013). Snowfall occurs throughout the winter, when it makes up a significant amount of precipitation in the mountains owing to plentiful precipitation and cold temperatures. The glaciers of the Central Tianshan Mountains occupy around 2% of the basin area and contribute considerably to the yearly flow. Snowmelt contributes significantly to runoff. In summer, melting glaciers provide a specific source of water for the river. Figure 2 illustrates how daily flow data from the Kurgan hydrological station (41°17′N–72°10′E) were used in this work for hydrological modeling. The highest amount of runoff occurs in the month of June, when melting snow in the mountains increases the volume of water flowing down the rivers. In the Naryn River Basin, uneven elevation, complex vegetation distribution, and significant impact of glaciers present challenges and potential research opportunities for understanding hydrological processes in mountainous areas. Therefore, the use of hydrological modeling is desirable in this watershed to better understand hydrological processes and manage water resources.
Figure 2

The study area.

The CGIAR Consortium for Spatial Information (CGIAR-CSI) provided digital elevation model (DEM) grid sizes of 90 m × 90 m (https://srtm.csi.cgiar.org). Digital topographic data were utilized to divide watersheds into sub-basins, build river networks, and identify water bodies and outlets. From the Harmonized World Soil Database (HWSD) of the Food and Agriculture Organization of the United Nations, soil attributes are retrieved; the database has a spatial resolution of about 1,000 m (http://www.fao.org/soils-portal). Data on land use characteristics were taken straight from SWAT model database. Only 1951–1974 data of runoff from the Kurgan hydrographic station have been found because of the scarcity of data (https://portal.grdc.bafg.de). The historical (1951–1974) daily temperature data are downloaded from the website of Princeton Global Meteorological Forcing Dataset (PGMFD, http://hydrology.princeton.edu). PGMFD is derived by merging a number of observation-based datasets with the National Center for Environmental Prediction-National Center for Atmospheric Research (NCEP-NCAR) reanalysis. The temperature in the dataset is a perfect reflection of the actual value (Ahmed et al. 2019). The daily precipitation grids for the period 1951–1974 are taken from the Asian Precipitation-Highly Resolved Observational Data Integration Towards Evaluation (APHRODITE) dataset (http://aphrodite.st.hirosaki-u.ac.jp). APHRODITE is a long-term continental daily precipitation product that has been shown to be superior to other models and can be used directly (Ghimire et al. 2018).

This study uses the ArcSWAT2012, which is an ArcGIS extension and interface for SWAT, for model configuration and parameterization. A 100,000-ha threshold was used for the DEM data to split the study region into 49 sub-basins, which were then segmented into 541 hydrologically comparable units (HRUs). The thresholds of 10% for land use, 10% for soil, and 5% for slope were established, and the multiple slope option was used in the creation of each hydrologically comparable unit (HRU), with three classes of 0–10%, 10–20%, and >20%. There is a distinct set of land use types, soil types, and slope classes represented by each HRU in the SWAT model. Based on available data from 1951 to 1974, the runoff was simulated. The study ensures that the climate data used for calibration and validation do not differ significantly, with wet, mild, and dry years occurring in both periods (Arnold et al. 2012; Polong et al. 2023). In this study, the model is calibrated and validated using available data from 1951 to 1974. The calibration period is from 1954 to 1969, during which the average monthly flow is 403.1 mm/s. The validation period uses monthly data sets from 1970 to 1974, during which the average monthly flow is 375.4 mm/s.

In the SWAT model, there are several HRUs in an area, which might result in thousands of distinct parameter changes during model calibration, making it impossible for the researcher to alter them one by one. As a result, distinct parameters must be aggregated. To aggregate parameters in this article, two sorts of prefixes, ‘V_’ and ‘R_’, are employed. v__ means the default parameter is replaced by a given value, r__ means the existing parameter value is multiplied by (1 + a given value) (for the detailed parameter aggregation scheme, please refer to Abbaspour et al. 2007). Table 1 contains brief explanations of each parameter and its default values. According to the suggestion with Abbaspour et al. (2017), it is advisable to consider as few parameters as possible for model validation and calibration. In more than 200 parameter files, 19 parameters were chosen based on reviews of the literature, and the LH-OAT method was used to determine sensitivity rankings (Arnold et al. 2013; Abbaspour 2015).

Table 1

Descriptions and initial ranges of parameters

Parameter namesDescriptionInitial ranges
V__ALPHA_BF Baseflow alpha factor (days) [0.1,1] 
V__CN2 Initial SCS runoff curve number for moisture condition II [30,98] 
V__CH_K2 Effective hydraulic conductivity in main channel alluvium (mm/hr) [0.025,500] 
V__SMFMX Melt factor for snow on June 21 (mm H2O/°C-day) [−5,5] 
V__TIMP Snow pack temperature lag factor [0.01,1] 
V__RCHRG_DP Deep aquifer percolation fraction [0,1] 
V__SMTMP Snow melt base temperature (°C) [−5,5] 
V__ESCO Soil evaporation compensation factor [0.01,1] 
V__SMFMN Melt factor for snow on December 21 (mm H2O/°C-day) [1.4,6.9] 
R__SOL_K Soil hydraulic conductivity (mm/h) [−0.5,0.5] 
R__SOL_AWC Available water capacity of the soil layer [−0.5,0.5] 
V__SFTMP Snowfall temperature (°C) [−5,5] 
V__OV_N Manning's ‘n’ value for overland flow [0.008,0.48] 
V__GW_DELAY Groundwater delay (days) [0,500] 
V__REVAPMN Threshold depth of water in the shallow aquifer for ‘revap’ or percolation to the deep aquifer to occur (mm H2O) [0,500] 
V__GWQMN Threshold depth of water in the shallow aquifer required for return flow to occur (mm) [0,5000] 
V__HRU_SLP Average slope steepness (m/m) [0.01,0.6] 
V__SURLAG Surface runoff lag coefficient [0.05,24] 
V__EPCO Plant uptake compensation factor [0.01,1] 
Parameter namesDescriptionInitial ranges
V__ALPHA_BF Baseflow alpha factor (days) [0.1,1] 
V__CN2 Initial SCS runoff curve number for moisture condition II [30,98] 
V__CH_K2 Effective hydraulic conductivity in main channel alluvium (mm/hr) [0.025,500] 
V__SMFMX Melt factor for snow on June 21 (mm H2O/°C-day) [−5,5] 
V__TIMP Snow pack temperature lag factor [0.01,1] 
V__RCHRG_DP Deep aquifer percolation fraction [0,1] 
V__SMTMP Snow melt base temperature (°C) [−5,5] 
V__ESCO Soil evaporation compensation factor [0.01,1] 
V__SMFMN Melt factor for snow on December 21 (mm H2O/°C-day) [1.4,6.9] 
R__SOL_K Soil hydraulic conductivity (mm/h) [−0.5,0.5] 
R__SOL_AWC Available water capacity of the soil layer [−0.5,0.5] 
V__SFTMP Snowfall temperature (°C) [−5,5] 
V__OV_N Manning's ‘n’ value for overland flow [0.008,0.48] 
V__GW_DELAY Groundwater delay (days) [0,500] 
V__REVAPMN Threshold depth of water in the shallow aquifer for ‘revap’ or percolation to the deep aquifer to occur (mm H2O) [0,500] 
V__GWQMN Threshold depth of water in the shallow aquifer required for return flow to occur (mm) [0,5000] 
V__HRU_SLP Average slope steepness (m/m) [0.01,0.6] 
V__SURLAG Surface runoff lag coefficient [0.05,24] 
V__EPCO Plant uptake compensation factor [0.01,1] 

The LH-OAT method is a combination of the Latin Hypercube (LH) sampling method proposed by McKay and the random One factor At a Time (OAT) sensitivity analysis method proposed by Morris (1991) and McKay et al. (2000). The LH-OAT method partitions the parameters of interest into layers and obtains several sets of data through LH sampling, with each parameter being sampled once per layer. By conducting a sensitivity analysis on each group of data, the sensitivity of each parameter in each group of data can be known, and the average sensitivity of each group of data can be calculated. This method can effectively identify the core parameter factors that affect the model results and significantly improve the credibility of the model parameters (Pan et al. 2022). The LH-OAT method is used to determine the sensitivity rankings of the 19 selected parameters in the SWAT model. The most sensitive parameters are then screened to identify which parameters have the most significant impact on the model's outputs, and the findings are presented in Figure 3. From the results, surface runoff-related parameters (e.g., ALPHA_BF, CN2, and CH_K2) are the most sensitive parameters, followed by snow-melting parameters (e.g., SMFMX, TIMP, SMTMP, and SMFMN).
Figure 3

Sensitivity ranking of the SWAT parameters from LH-OAT method.

Figure 3

Sensitivity ranking of the SWAT parameters from LH-OAT method.

Close modal

Parameter distribution

According to the sensitivity assessment, 10 of the most sensitive parameters were chosen. The DREAM algorithm was used for parameterization, and the prior distributions were assumed to be uniform with the initial ranges specified in Table 1. The convergence criterion of R = 1.2 was fulfilled within 34,000 iterations, and the iteration ended. Each chain performed 4,250 iterations with a total time of 151,774 seconds. As a warm-up, the first 50% of samples in each chain are discarded, and the average acceptance rate is 13.5%. After R convergence was attained, 17,000 parameter sets (2,125 parameter sets in each chain) were sampled to generate posterior distributions. Figure 4(a) shows the posterior PDFs of the model parameters. All parameters’ ranges looked to be distinct from their starting values within a reasonably limited span. Ten parameters’ (e.g., ALPHA_BF, CN2, CH_K2, SMFMX, TIMP, RCHRG_DP, SMTMP, ESCO, SMFMN, and SOL_K) posterior PDFs exhibited nearly normal distribution features, suggesting that their posterior PDFs are well defined. Two parameters (e.g., RCHRG_DP and SMTMP) have posterior PDFs with distributions that are negatively skewed and heavily concentrated at their lower limits. Finally, the sensitivity rankings of these 10 parameters based on the LH-OAT method are derived and presented in Figure 5.
Figure 4

Posterior distributions of the parameters in different methods.

Figure 4

Posterior distributions of the parameters in different methods.

Close modal
Figure 5

Sensitivity ranking of the SWAT parameters from LH-OAT method after DREAM.

Figure 5

Sensitivity ranking of the SWAT parameters from LH-OAT method after DREAM.

Close modal

Table 2 displays the correlation coefficients for the calculated parameters. Based on the data, the strongest positive correlations are seen between TIMP and SMFMX, with a value of 0.7. This is because both TIMP and SMFMX influence the rate at which snowmelt runoff is produced. SMFMX is the melt factor for snow on June 21. The Naryn River Basin is in the Northern Hemisphere, according to the file ‘SOIL AND WATER1 ASSESSMENT TOOL INPUT/OUTPUT FILE DOCUMENTATION’, the parameter SMFMX means the maximum melt factor in this study. It has affirmative effects on the generation of snowmelt runoff. TIMP also has a positive influence on snowmelt runoff production. It is therefore desirable to delve further into the interrelationship between the factors.

Table 2

The correlation coefficients between the samples of the parameter estimated from DREAM algorithm

v__ALPHA_BFv__CN2v__CH_K2v__SMFMXv__TIMPv__RCHRG_DPv__SMTMPv__ESCOv__SMFMNr__SOL_K
v__ALPHA_BF 0.257 0.109 −0.368 −0.227 −0.005 0.327 0.067 −0.181 −0.041 
v__CN2  0.075 −0.353 −0.482 −0.052 0.132 0.001 −0.104 −0.067 
v__CH_K2   −0.183 −0.127 −0.011 0.013 0.039 0.041 0.008 
v__SMFMX    0.725 0.075 − 0.52 −0.018 0.214 0.029 
v__TIMP     0.178 −0.249 0.11 0.183 0.021 
v__RCHRG_DP      −0.019 0.101 −0.057 −0.016 
v__SMTMP       0.057 −0.103 −0.019 
v__ESCO        −0.07 −0.059 
v__SMFMN         0.13 
r__SOL_K          
r__SOL_AWC           
v__ALPHA_BFv__CN2v__CH_K2v__SMFMXv__TIMPv__RCHRG_DPv__SMTMPv__ESCOv__SMFMNr__SOL_K
v__ALPHA_BF 0.257 0.109 −0.368 −0.227 −0.005 0.327 0.067 −0.181 −0.041 
v__CN2  0.075 −0.353 −0.482 −0.052 0.132 0.001 −0.104 −0.067 
v__CH_K2   −0.183 −0.127 −0.011 0.013 0.039 0.041 0.008 
v__SMFMX    0.725 0.075 − 0.52 −0.018 0.214 0.029 
v__TIMP     0.178 −0.249 0.11 0.183 0.021 
v__RCHRG_DP      −0.019 0.101 −0.057 −0.016 
v__SMTMP       0.057 −0.103 −0.019 
v__ESCO        −0.07 −0.059 
v__SMFMN         0.13 
r__SOL_K          
r__SOL_AWC           

Bold indicates that the absolute value of the sample correlation coefficient is larger than 0.5.

Figure 6(a) shows simulated back-end discharge when posterior PDFs are highest as well as the 95% confidence intervals (CI) for monthly discharge. The monthly observed data from the Kurgan hydrometric station was used to perform the model calibration and validation. R2 values in calibration and validation are 0.78 and 0.61, and ENS values in calibration and validation are 0.75 and 0.58, respectively. Results show that the SWAT model simulation is fairly preferred in the Naryn River Basin. Inadequacies in the model structure, incorrect input data, or inaccurate calibration data might all contribute to the discrepancies between the observed and predicted discharge. In reality, the inaccuracy of input and calibration data is born out of the inevitable random mistakes of hydro-meteorological data, and a hydrological model (like SWAT) has limitations in its ability to represent hydrological processes due to a structural mistake (Leta et al. 2015).
Figure 6

The prediction intervals for streamflow with different methods.

Figure 6

The prediction intervals for streamflow with different methods.

Close modal

Comparison with DREAM

In this study, parameter uncertainty based on the SWAT model is also analyzed using the untransformed DREAM method. Figure 4(a) and 4(b) show the probability density histograms of the posterior distributions of the model parameters for the transformed DREAM method and the untransformed case, respectively, and the results show that there are big dissimilarities in the posterior distributions of the parameters considering the residual variance. Figure 4(b) shows that when applying the traditional DREAM method, the model parameters are distributed in a larger parameter interval. This is because the BCT used a similar logarithmic transformation to both observed and simulated data. It could greatly increase the data's normalcy, with the obvious result being that the bigger numbers make a bigger difference. This also made the chain of DREAM approaches need additional iterations to determine the optimal parameter distribution. The transformed DREAM method considering residual variance has a smaller distribution of parameter posteriors and more concentrated values of parameter posteriors than the untransformed DREAM method. Liu et al. (2017) used the DREAM algorithm to approximate the posterior distribution of model parameters through Bayesian inference. The results indicate that the posterior parameter distribution has a large range.

Figure 6 shows the ranges of model parameters and input uncertainty in discharge simulations with and without residual variance, respectively. Using the DREAM method, to calibrate and validate, PBIAS values are 11.8 and 11.7. Using the transformed DREAM method, the values of PBIAS for calibration and validation are −2 and −2.4. The results show that the range of uncertainty in the discharge simulations due to model input and parameter uncertainty is reduced considering the heteroskedasticity case, which may be caused by the reduction in the range of model parameters due to the introduction of the heteroskedasticity model. The BCT can improve the accuracy of the DREAM method for hydrological model uncertainty prediction and obtain more accurate parameter values and more representative uncertainty prediction intervals.

Errors in hydrologic models tend to be heteroskedastic, instead of homoskedastic, since their magnitude grows in proportion to the size of the observed flow (Joseph & Guillaume 2013). Figure 7(a) shows a plot of flow residuals for the DREAM method, with the most residuals for observations between 0 and 500 m3/s. Typically, in homoskedastic cases, this region displays the biggest vertical distribution of residuals. Clearly, the graph does not show this outcome. Therefore, data transformations are necessary to guarantee post-transformation residual homoscedasticity before employing the likelihood function. The residuals of the transformed data of the observed streamflow are shown in Figure 7(b). The distribution of residuals varies along the x-axis; as it is greater, the residuals’ vertical distribution should be higher. The observed variations in the flow do not substantially affect the residuals’ variance.
Figure 7

Streamflow residuals.

Figure 7

Streamflow residuals.

Close modal

Effects of individual parameters

Based on the simulated DREAM algorithm obtained after BCT, to delve further into the one-at-a-time and interaction impacts of parameters on model responses, the TFA experiment was carried out, including non-melting period peak flow (NPF), snow-melting period peak flow (SPF), and average annual flow (AAF). Based on the sensitivity ranking of the SWAT parameters from LH-OAT method after DREAM, four parameters (i.e., TIMP, SMFMX, CN2, and CH_K2) were chosen as the guiding principles for the design. The functions played by the three parameters SMFMX, SMFMN, and SMTMP in the SWAT model are substantially overlapping, thus only SMFMX, the most sensitive of the three, is selected. In Table 3, it can be seen that there are four parameters with three distinct values, where the 2.5th, 50th, and 97.5th percentiles of posterior PDF are represented by the levels L, M, and H, respectively. There are 34 treatment combinations in this study and Table 4 represents the 34 possible treatment permutations with their respective serial numbers. After running a total of 34 treatments, the simulation results for each combination are shown in Figure 8.
Table 3

The design of four parameters of three levels

ParameterLevel
LowMediumHigh
v__CN2.mgt 97.740 97.750 97.770 
v__CH_K2.rte 200.000 200.100 200.379 
v__SMFMX.bsn 8.531 8.962 8.999 
v__TIMP.bsn 0.529 0.748 0.750 
ParameterLevel
LowMediumHigh
v__CN2.mgt 97.740 97.750 97.770 
v__CH_K2.rte 200.000 200.100 200.379 
v__SMFMX.bsn 8.531 8.962 8.999 
v__TIMP.bsn 0.529 0.748 0.750 
Table 4

Serial number of 34 treatment combinations (L: low level; M: medium level; H: high level)

Serial number12345678910111213141581
v__CN2 … 
v__CH_K2 … 
v__SMFMX … 
v__TIMP … 
Serial number12345678910111213141581
v__CN2 … 
v__CH_K2 … 
v__SMFMX … 
v__TIMP … 
Figure 8

Simulation results for three responses values under different combinations of factor levels.

Figure 8

Simulation results for three responses values under different combinations of factor levels.

Close modal
Figure 9 shows the major effects plot of the four parameters for the three degrees of variation. The parameter CN2 represents the potential for surface runoff and is affected by soil permeability, land use, and initial soil water condition (Liu et al. 2017). The positive slope of CN2 suggests that it contributes positively to NPF. The reason for this is the study watershed's hefty wet-season rainfall, which might lead to the infiltration of surplus surface flow and a consequently high NPF. The parameter contributions shown in Figure 10 agree with the findings of individual impacts shown in Figure 9. With a contribution of 88.22% to NPF, CN2 clearly stands out as the most significant element in this plot. This means that, during the non-snowmelt season, precipitation is the main source of runoff in the Naryn River. Also, in NPF, SMFMX's 7.18% contribution rate makes it the second most important metric. Additionally, with contributions of 76.69 and 53.70%, respectively, TIMP had the biggest impact on SPF and AAF. This means that, during the non-snowmelt season, precipitation is the main source of runoff in the Naryn River. Among the three responses, CH_K2 had a negligible impact.
Figure 9

The main effect of each parameter on responses variable.

Figure 9

The main effect of each parameter on responses variable.

Close modal
Figure 10

Contributions of parameters to modeling responses.

Figure 10

Contributions of parameters to modeling responses.

Close modal

The parameter CH_K2 is effective hydraulic conductivity in main channel alluvium. It can affect the mutual recharge between groundwater and river (Zhang et al. 2016). Soil permeability coefficient is positively correlated with hydraulic conductivity, the higher the soil hydraulic conductivity, the more rainfall infiltration, and the smaller the runoff generated on the surface. The larger the value of CH_K2, the more water will permeate into groundwater and reduce the river discharge. In this study, it has a weak negative effect on SPF but has an insignificant effect on NPF and AAF. It means that the infiltration water in this area is relatively small when the water is abundant.

The parameter SMFMX can account for the impact of snowpack density on snowmelt like SMFMN. The SWAT model calculates the amount of snowmelt water in the watershed using the degree-day factor method, which works under the assumption that the potential snowmelt rate follows a date-dependent sinusoidal function, with the highest expected on June 21 and the lowest expected on December 21 (Liu et al. 2020a). The parameter has a positive slope for all responses and has a positive impact. This means that snowmelt contributes significantly to runoff in the basin.

The delay factor TIMP regulates how much the previous day's snow pile temperature affects the present snow temperature. The average daily temperature has a greater impact on snow pile temperature as TIMP gets closer to 1. Assuming the user sets a temperature threshold, the snow pile will only melt after it reaches or passes that point (Abbas et al. 2019). This parameter has a significant effect on all responses; it means that the amount of snow that melts daily fluctuates significantly from 1 day to the next. This extrapolation implies that the temperature of the basin varies significantly. These results demonstrate the significance of soil conditions and snowmelt, as represented by the model, for the study basin in response to the model. Studies on soil quality and snowmelt should be expanded if the model performance is to be enhanced.

Effects of interactive parameters

Figures 11 and 12 display the four-parameter, three-level interactive effecting plots. These plots illustrate the impact of one parameter on the hydrologic response (i.e., NPF, SPF, and AAF) when one parameter is influenced by another one. There is no interaction between the two parameters if the lines are parallel to one another. The higher the dissimilarity in slopes, the greater the impact of interactions on hydrological responses. The TIMP* SMFMX interaction plot is shown in Figure 11(a) (on the bottom right), where blue, red, and green lines, respectively, denote low, medium, and high levels of SMFMX. The degree to which the NPF shifts vary throughout the three SMFMX levels is directly related to the level of the corresponding TIMP, indicating that the peak flow is affected by the interaction between SMFMX and TIMP and that the two parameters are mutually reliant. TIMP's negative impact on NPF is more pronounced when SMFMX is low; on the other hand, TIMP's influence on NPF is negligible at high SMFMX. The primary explanation is that SMFMX and TIMP each play a significant role in snow melting. With NPF, the larger value of SMFMX indicates a small snow-melting factor on that day, and it can decrease the effect of snow pile temperature delay factor represented by TIMP as a multiplier. The degree-day factor method assumes that the snowmelt factor varies seasonally, reaching the maximum and minimum values at the summer solstice and winter solstice, respectively (Liu et al. 2020a). In the non-melting period, the sine value is less than zero. Therefore, as the SMFMX increases, the snowmelt factor decreases. And melting snow totals for the current day is a function of the product of snow-melting factor and the negative snow pile temperature delay factor. So, a high value of SMFMX can weaken the influence of TIMP on snow melting. The findings show that SMFMX and TIMP's interaction significantly affects NPF, and that low levels of both factors provide the largest peak flow. It is necessary to take into account both the SMFMX and TIMP values concurrently in order to obtain the best possible model results.
Figure 11

Interactive effects plot for (a) non-melting period peak flow and (b) snow-melting period peak flow. Please refer to the online version of this paper to see this figure in colour: http://dx.doi.org/10.2166/wcc.2023.014.

Figure 11

Interactive effects plot for (a) non-melting period peak flow and (b) snow-melting period peak flow. Please refer to the online version of this paper to see this figure in colour: http://dx.doi.org/10.2166/wcc.2023.014.

Close modal
Figure 12

Interaction plot matrix for AAF.

Figure 12

Interaction plot matrix for AAF.

Close modal

In this study, a BBC-TFA method has been developed and applied to assess the uncertainty impacts of hydrological parameters on runoff in the Naryn River Basin (Central Asia). The BBC-TFA method has shown high accuracy in runoff simulation. The uncertainties of four sensitive parameters, such as snowpack temperature lag factor (TIMP), effective hydraulic conductivity in main channel alluvium (CH_K2), melt factor for snow on June 21 (SMFMX), and initial soil conservation service (SCS) runoff curve number for moisture condition II (CN2), have been investigated, and their effects and interactions on different hydrological responses have also been explored. SWAT provides the SCS curve number procedure (USDA SCS 1972) for estimating surface runoff, the SCS curve number procedure is an empirical model that came into common use in the 1950s.

Several findings are (i) the parameter CN2 has a positive effect on NPF, implying that the Naryn River Basin is mountainous (with steep topography and concentrated summer rainfall) and tends to trigger an infiltration-excess runoff; (ii) the parameter CH_K2 affects the mutual recharge between groundwater and rivers. The negative effect on all responses in this study is weak; this disclose that, in the Naryn River Basin, only a limited quantity of surface runoff infiltrates as groundwater; (iii) the parameter TIMP has a significant effect on all responses, which means that the temperature of the basin varies significantly; (iv) individual and interactive effects of parameter SMFMX and TIMP have a significant effect on the hydrological response, implying that snowmelt is a significant contributor to runoff in the Naryn River Basin; (v) the BCT can effectively eliminate the residual variance of the model output values from the observed values and obtain a lower range of uncertainty prediction intervals, enhancing the correlation between input variables and output variables; (vi) CN2 has the highest contribution to NPF with a rate of 88.22%, it means that the Naryn River Basin is at risk of flooding, but preparations can be made for water storage; TIMP has the highest contribution to both SPF and AAF with rates of 79.69 and 53.70%, respectively; it is indicated that the Naryn River has abundant glacier resources that can be appropriately developed and/or protected. Based on the above findings, the relationships between important parameters and the Naryn River Basin can be clarified, thus improving the simulation/prediction capability of the hydrological model for water resources in the Naryn River Basin.

This study is the first parametric uncertainty analysis of the hydrological modeling of the Naryn River Basin by the BBC-TFA method, and there are some limitations that can be further improved. First, BBC-TFA was only applied to the Naryn River Basin, and its application to more regions, especially for data-scarce areas, could be explored. Second, the Naryn River Basin is located in the Central Asian mountains, where the low density of observation points and data scarcity can increase the uncertainty of model performance and results. Therefore, to improve the simulation accuracy of the model, satellite remote sensing inversion can be used to increase the runoff sequence in future work. Third, this study used BCT to process data, resulting in a small range of posterior parameter distribution. In the future research work, new data preprocessing techniques can be attempted to eliminate homoscedasticity without changing the distribution of the data itself (Ikram et al. 2022b). Future research might take into account these extensions to enhance parameter and hydrologic modeling uncertainty estimations.

This research was supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (No. XDA20060302), the Fund for Innovative Research Group of the National Natural Science Foundation of China (52221003) and the Start-up Project for Advanced Talent of Xiamen University of Technology in the First Half of 2022 (YKJ22011R). The authors are grateful to the editors and the anonymous reviewers for their insightful comments and suggestions.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Abbas
T.
,
Hussain
F.
,
Nabi
G.
,
Boota
M. W.
&
Wu
R. S.
2019
Uncertainty evaluation of SWAT model for snowmelt runoff in a Himalayan watershed
.
Terrestrial, Atmospheric & Oceanic Sciences
30
(
2
).
https://doi.org/10.3319/TAO.2018.10.08.01
Abbaspour
K. C.
2015
SWAT calibration and uncertainty programs
.
A User Manual
103
,
17
66
.
Abbaspour
K. C.
,
Yang
J.
,
Maximov
I.
,
Siber
R.
,
Bogner
K.
,
Mieleitner
J.
,
Zobrist
J.
&
Srinivasan
R.
2007
Modelling hydrology and water quality in the pre-alpine/alpine Thur watershed using SWAT
.
Journal of Hydrology
333
(
2–4
),
413
430
.
https://doi.org/10.1016/j.jhydrol.2006.09.014
.
Adnan
R. M.
,
Mostafa
R. R.
,
Kisi
O.
,
Yaseen
Z. M.
,
Shahid
S.
&
Zounemat-Kermani
M.
2021
Improving streamflow prediction using a new hybrid ELM model combined with hybrid particle swarm optimization and grey wolf optimization
.
Knowledge-Based Systems
230
,
107379
.
https://doi.org/10.1016/j.knosys.2021.107379
.
Adnan
R. M.
,
Mostafa
R. R.
,
Elbeltagi
A.
,
Yaseen
Z. M.
,
Shahid
S.
&
Kisi
O.
2022
Development of new machine learning model for streamflow prediction: case studies in Pakistan
.
Stochastic Environmental Research and Risk Assessment
1
35
.
https://doi.org/10.1007/s00477-021-02111-z
.
Ahmed
K.
,
Shahid
S.
,
Sachindra
D. A.
,
Nawaz
N.
&
Chung
E. S.
2019
Fidelity assessment of general circulation model simulated precipitation and temperature over Pakistan using a feature selection method
.
Journal of Hydrology
573
,
281
298
.
https://doi.org/10.1016/j.jhydrol.2019.03.092
.
Ansell
L.
&
Valle
L. D.
2021
Social media integration of flood data: A vine copula-based approach. arXiv preprint arXiv:2104.01869. http://doi.org/10.3808/jei.202200471
.
Arnold
J. G.
,
Moriasi
D. N.
,
Gassman
P. W.
,
Abbaspour
K. C.
,
White
M. J.
,
Srinivasan
R.
,
Santhi
C.
,
Harmel
R. D.
,
van Griensven
A.
,
Van Liew
M. W.
,
Kannan
N.
&
Jha
M. K.
2012
SWAT: Model use, calibration, and validation
.
Transactions of the ASABE, American Society of Agricultural and Biological Engineers
55
(
4
),
1491
1508
.
https://doi.org/10.13031/2013.42256
.
Arnold
J. G.
,
Kiniry
J. R.
,
Srinivasan
R.
,
Williams
J. R.
,
Haney
E. B.
&
Neitsch
S. L.
2013
SWAT 2012 Input/Output Documentation
.
Texas Water Resources Institute
. .
Baig
F.
,
Sherif
M.
&
Faiz
M. A.
2022
Quantification of precipitation and evapotranspiration uncertainty in rainfall-runoff modeling
.
Hydrology
9
(
3
),
51
.
https://doi.org/10.3390/hydrology9030051
.
Box
G. P.
&
Cox
D. R.
1964
An analysis of transformations
.
Journal of the Royal Statistical Society
26
(
21
),
1
43
.
https://doi.org/10.1111/j.2517-6161.1964.tb00553.x
.
Cheng
Q. B.
,
Chen
X.
,
Xu
C. Y.
,
Reinhardt-Imjela
C.
&
Schulte
A.
2014
Improvement and comparison of likelihood functions for model calibration and parameter uncertainty analysis within a Markov chain Monte Carlo scheme
.
Journal of Hydrology
519
,
2202
2214
.
https://doi.org/10.1016/j.jhydrol.2014.10.008
.
Cheng
Q. B.
,
Chen
X.
,
Xu
C. Y.
,
Zhang
Z. C.
,
Reinhardt-Imjela
C.
&
Schulte
A.
2018
Using maximum likelihood to derive various distance-based goodness-of-fit indicators for hydrologic modeling assessment
.
Stochastic Environmental Research and Risk Assessment
32
,
949
966
.
https://doi.org/10.1007/s00477-017-1507-8
.
Cheng
M.
,
Fang
F.
,
Kinouchi
T.
,
Navon
I. M.
&
Pain
C. C.
2020
Long lead-time daily and monthly streamflow forecasting using machine learning methods
.
Journal of Hydrology
590
,
125376
.
https://doi.org/10.1016/j.jhydrol.2020.125376
.
Chiphang
N.
,
Bandyopadhyay
A.
&
Bhadra
A.
2020
Assessing the effects of snowmelt dynamics on streamflow and water balance components in an Eastern Himalayan river basin using SWAT model
.
Environmental Modeling & Assessment
25
,
861
883
.
https://doi.org/10.1007/s10666-020-09716-8
.
Diao
W.
,
Peng
P.
,
Zhang
C.
,
Yang
S.
&
Zhang
X.
2021
Multi-objective optimal operation of reservoir group in Jialing River based on DREAM algorithm
.
Water Supply
21
(
5
),
2518
2531
.
https://doi.org/10.2166/ws.2021.064
.
Didovets
I.
,
Lobanova
A.
,
Krysanova
V.
,
Menz
C.
,
Babagalieva
Z.
,
Nurbatsina
A.
,
Gavrilenko
N.
,
Khamidov
V.
,
Umirbekov
A.
,
Qodirov
S.
,
Muhyyew
D.
&
Hattermann
F. F.
2021
Central Asian rivers under climate change: impacts assessment in eight representative catchments
.
Journal of Hydrology: Regional Studies
34
,
100779
.
https://doi.org/10.1016/j.ejrh.2021.100779
.
Duan
Y.
,
Liu
T.
,
Meng
F.
,
Luo
M.
,
Frankl
A.
,
De Maeyer
P.
,
Bao
A.
&
Kurban
A.
2018
Inclusion of modified snow melting and flood processes in the swat model
.
Water
10
(
12
),
1715
.
https://doi.org/10.3390/w10121715
.
Farsi
N.
&
Mahjouri
N.
2019
Evaluating the contribution of the climate change and human activities to runoff change under uncertainty
.
Journal of Hydrology
574
,
872
891
.
https://doi.org/10.1016/j.jhydrol.2019.04.028
.
Gan
R.
,
Luo
Y.
,
Zuo
Q.
&
Sun
L.
2015
Effects of projected climate change on the glacier and runoff generation in the Naryn River Basin, Central Asia
.
Journal of Hydrology
523
,
240
251
.
https://doi.org/10.1016/j.jhydrol.2015.01.057
.
Ghimire
S.
,
Choudhary
A.
&
Dimri
A. P.
2018
Assessment of the performance of CORDEX-South Asia experiments for monsoonal precipitation over the Himalayan region during present climate: part I
.
Climate Dynamics
50
,
2311
2334
.
https://doi.org/10.1007/s00382-015-2747-2
.
Guo
T.
,
Liu
Y.
,
Shao
G.
,
Engel
B. A.
,
Sharma
A.
,
Marshall
L. A.
,
Flanagan
D. C.
,
Cibin
R.
,
Wallace
C. W.
,
Zhao
K.
,
Ren
D.
,
Mercado
J. V.
&
Aboelnour
M. A.
2022
Improving probabilistic monthly water quantity and quality predictions using a simplified residual-based modeling approach
.
Environmental Modelling & Software
156
,
105499
.
https://doi.org/10.1016/j.envsoft.2022.105499
.
Gupta
H. V.
,
Sorooshian
S.
&
Yapo
P. O.
1999
Status of automatic calibration for hydrologic models: comparison with multilevel expert calibration
.
Journal of Hydrologic Engineering
4
(
2
),
135
143
.
https://doi.org/10.1061/(ASCE)1084-0699(1999)4:2(135)
.
Han
H.
&
Morrison
R. R.
2022
Improved runoff forecasting performance through error predictions using a deep-learning approach
.
Journal of Hydrology
608
,
127653
.
https://doi.org/10.1016/j.jhydrol.2022.127653
.
Ikram
R. M. A.
,
Ewees
A. A.
,
Parmar
K. S.
,
Yaseen
Z. M.
,
Shahid
S.
&
Kisi
O.
2022a
The viability of extended marine predators algorithm-based artificial neural networks for streamflow prediction
.
Applied Soft Computing
131
,
109739
.
https://doi.org/10.1016/j.asoc.2022.109739
.
Ikram
R. M. A.
,
Hazarika
B. B.
,
Gupta
D.
,
Heddam
S.
&
Kisi
O.
2022b
Streamflow prediction in mountainous region using new machine learning and data preprocessing methods: a case study
.
Neural Computing and Applications
1
18
.
https://doi.org/10.1007/s00521-022-08163-8
.
Jia
Q. M.
,
Li
Y. P.
,
Li
Y. F.
&
Huang
G. H.
2020
Analyzing variation of inflow from the Syr Darya to the Aral Sea: a Bayesian-neural-network-based factorial analysis method
.
Journal of Hydrology
587
,
124976
.
https://doi.org/10.1016/j.jhydrol.2020.124976
.
Joseph
J. F.
2011
Preliminaries to Watershed Instrumentation System Design
.
PhD dissertation. The University of Texas at San Antonio
,
Department of Civil and Environmental Engineering, San Antonio, TX, USA. https://www.proquest.com/openview/08f0d25e7d40a0b2072c940442b94893/1?pq-origsite=gscholar&cbl=18750
.
Joseph
J. F.
&
Guillaume
J. H.
2013
Using a parallelized MCMC algorithm in R to identify appropriate likelihood functions for SWAT
.
Environmental Modelling & Software
46
,
292
298
.
https://doi.org/10.1016/j.envsoft.2013.03.012
.
Kan
G.
,
He
X.
,
Ding
L.
,
Li
J.
,
Liang
K.
&
Hong
Y.
2017
Study on applicability of conceptual hydrological models for flood forecasting in humid, semi-humid semi-arid and arid basins in China
.
Water
9
(
10
),
719
.
https://doi.org/10.3390/w9100719
.
Kriegel
D.
,
Mayer
C.
,
Hagg
W.
,
Vorogushyn
S.
,
Duethmann
D.
,
Gafurov
A.
&
Farinotti
D.
2013
Changes in glacierisation, climate and runoff in the second half of the 20th century in the Naryn basin, Central Asia
.
Global and Planetary Change
110
,
51
61
.
https://doi.org/10.1016/j.gloplacha.2013.05.014
.
Leta
O. T.
,
Nossent
J.
,
Velez
C.
,
Shrestha
N. K.
,
Van Griensven
A.
&
Bauwens
W.
2015
Assessment of the different sources of uncertainty in a SWAT model of the River Senne (Belgium)
.
Environmental Modelling & Software
68
,
129
146
.
https://doi.org/10.1016/j.envsoft.2015.02.010
.
Liu
Y. R.
,
Li
Y. P.
,
Huang
G. H.
,
Zhang
J. L.
&
Fan
Y. R.
2017
A Bayesian-based multilevel factorial analysis method for analyzing parameter uncertainty of hydrological model
.
Journal of Hydrology
553
,
750
762
.
https://doi.org/10.1016/j.jhydrol.2017.08.048
.
Liu
H. X.
,
Li
Y. P.
&
Yu
L.
2019
Urban agglomeration (Guangzhou-Foshan-Zhaoqing) ecosystem management under uncertainty: a factorial fuzzy chance-constrained programming method
.
Environmental Research
173
,
97
111
.
https://doi.org/10.1016/j.envres.2019.03.018
.
Liu
Y. R.
,
Li
Y. P.
,
Ma
Y. A.
,
Jia
Q. M.
&
Su
Y. Y.
2020b
Development of a Bayesian-copula-based frequency analysis method for hydrological risk assessment – the Naryn River in Central Asia
.
Journal of Hydrology
580
,
124349
.
https://doi.org/10.1016/j.jhydrol.2019.124349
.
Liu
Y.
,
Fernández-Ortega
J.
,
Mudarra
M.
&
Hartmann
A.
2021
Pitfalls and a feasible solution for using KGE as an informal likelihood function in MCMC methods: DREAM (ZS) as an example
.
Hydrology and Earth System Sciences
26
(
20
),
5341
5355
.
https://doi.org/10.5194/hess-26-5341-2022
.
Liu
Y. L.
,
Du
J. Z.
,
Wang
Q.
,
Yang
W.
&
Cui
B. S.
2022
Toward an assessment of runoff and thermal connectivity in A river-lake system within an urban environment
.
Journal of Environmental Informatics
40
(
2
).
http://doi.org/10.3808/jei.202200482
McKay
M. D.
,
Beckman
R. J.
&
Conover
W. J.
2000
A comparison of three methods for selecting values of input variables in the analysis of output from a computer code
.
Technometrics
42
(
1
),
55
61
.
https://doi.org/10.1080/00401706.2000.10485979
.
Mengistu
A. G.
,
van Rensburg
L. D.
&
Woyessa
Y. E.
2019
Techniques for calibration and validation of SWAT model in data scarce arid and semi-arid catchments in South Africa
.
Journal of Hydrology: Regional Studies
25
,
100621
.
https://doi.org/10.1016/j.ejrh.2019.100621
.
Moriasi
D. N.
,
Arnold
J. G.
,
Van Liew
M. W.
,
Bingner
R. L.
,
Harmel
R. D.
&
Veith
T. L.
2007
Model evaluation guidelines for systematic quantification of accuracy in watershed simulations
.
Transactions of the ASABE
50
(
3
),
885
900
.
https://doi.org/10.13031/2013.23153
.
Morris
M. D.
1991
Factorial sampling plans for preliminary computational experiments
.
Technometrics
33
(
2
),
161
174
.
https://doi.org/10.1080/00401706.1991.10484804
.
Pan
J.
,
Zhang
W.
,
Liu
Z.
,
Shi
X.
&
Chen
Z.
2022
Application of improved multi-coupling model based on LH-OAT method in water environment simulation
.
Desalination and Water Treatment
269
,
238
248
.
https://doi.org/10.5004/dwt.2022.28728
.
Polong
F.
,
Deng
K.
,
Pham
Q. B.
,
Linh
N. T. T.
,
Abba
S. I.
,
Ahmed
A. N.
,
Anh
D. T.
,
Khedher
K. M.
&
El-Shafie
A.
2023
Separation and attribution of impacts of changes in land use and climate on hydrological processes
.
Theoretical and Applied Climatology
1
17
.
https://doi.org/10.1007/s00704-022-04351-7
.
Qiao
C.
,
Cai
G.
,
Liu
Y.
,
Li
J.
&
Chen
F.
2022
Study of the flood frequency based on normal transformation in arid inland region: a case study of Manas River in North-Western China
.
Mobile Information Systems
2022
.
https://doi.org/10.1155/2022/5229348
Schöne
T.
,
Dusik
E.
,
Illigner
J.
&
Klein
I.
2018
Water in Central Asia: reservoir monitoring with radar altimetry along the Naryn and Syr Darya Rivers. In: International Symposium on Earth and Environmental Sciences for Future Generations: Proceedings of the IAG General Assembly, Prague, Czech Republic, June 22-July 2, 2015 (pp. 349–357). https://doi.org/10.1007/1345_2017_265
.
Schuster-Wallace
C. J.
,
Dickson-Anderson
S. E.
,
Papalexiou
S. M.
&
Ganzouri
A. E.
2022
Design and application of the tank simulation model (TSM): assessing the ability of rainwater harvesting to meet domestic water demand
.
Journal of Environmental Informatics
40
(
1
).
http:// doi.org/10.3808/jei.202200477
Shannon
S.
,
Payne
A.
,
Freer
J.
,
Coxon
G.
,
Kauzlaric
M.
,
Kriegel
D.
&
Harrison
S.
2023
A snow and glacier hydrological model for large catchments–case study for the Naryn River, central Asia
.
Hydrology and Earth System Sciences
27
(
2
),
453
480
.
https://doi.org/10.5194/hess-2022-51
.
Sidle
R. C.
2021
Strategies for smarter catchment hydrology models: incorporating scaling and better process representation
.
Geoscience Letters
8
(
1
),
24
.
https://doi.org/10.1186/s40562-021-00193-9
.
Tasdighi
A.
,
Arabi
M.
&
Harmel
D.
2018
A probabilistic appraisal of rainfall-runoff modeling approaches within SWAT in mixed land use watersheds
.
Journal of Hydrology
564
,
476
489
.
https://doi.org/10.1016/j.jhydrol.2018.07.035
.
Triana
J. S. A.
,
Chu
M. L.
,
Guzman
J. A.
,
Moriasi
D. N.
&
Steiner
J. L.
2019
Beyond model metrics: the perils of calibrating hydrologic models
.
Journal of Hydrology
578
,
124032
.
https://doi.org/10.1016/j.jhydrol.2019.124032
.
USDA Soil Conservation Service
1972
A hydrological modelling-based approach for vulnerable area identification under changing climate scenarios. National Engineering Handbook
.
U.S. Printing Office, Washington, D.C Section 4 Hydrology. Chapters 4 -10. https://doi.org/10.2166/wcc.2020.202
.
Uusitalo
L.
,
Lehikoinen
A.
,
Helle
I.
&
Myrberg
K.
2015
An overview of methods to evaluate uncertainty of deterministic models in decision support
.
Environmental Modelling & Software
63
,
24
31
.
https://doi.org/10.1016/j.envsoft.2014.09.017
.
Van Griensven
A.
&
Meixner
T.
2003
Sensitivity, optimisation and uncertainty analysis for the model parameters of SWAT
. In:
Proceedings of 2nd International SWAT Conference
, pp.
162
167
.
Vrugt
J. A.
,
Ter Braak
C. J.
,
Clark
M. P.
,
Hyman
J. M.
&
Robinson
B. A.
2008
Treatment of input uncertainty in hydrologic modeling: doing hydrology backward with Markov chain Monte Carlo simulation
.
Water Resources Research
44
(
12
).
https://doi.org/10.1029/2007WR006720
Wang
H.
,
Li
Y. P.
,
Liu
Y. R.
,
Huang
G. H.
,
Li
Y. F.
&
Jia
Q. M.
2021
Analyzing streamflow variation in the data-sparse mountainous regions: an integrated CCA-RF-FA framework
.
Journal of Hydrology
596
,
126056
.
https://doi.org/10.1016/j.jhydrol.2021.126056
.
Wang
M.
,
Zhang
Y.
,
Lu
Y.
,
Gao
L.
&
Wang
L.
2022
Attribution analysis of streamflow changes based on large-scale hydrological modeling with uncertainties
.
Water Resources Management
1
18
.
https://doi.org/10.1007/s11269-022-03396-7
.
Wu
J. S.
,
Li
Y. P.
,
Sun
J.
,
Gao
P. P.
,
Huang
G. H.
&
Liu
J.
2022
Identifying the runoff variation in the Naryn River Basin under multiple climate and land-use change scenarios
.
Journal of Water and Climate Change
13
(
2
),
574
592
.
https://doi.org/10.2166/wcc.2021.422
.
Xu
X.
,
Zeng
X.
,
Li
Y.
,
Wang
C.
,
Yu
L.
,
Huang
G.
,
Zhang
J.
,
Feng
J.
&
Han
X.
2022
Multi-Watershed nonpoint source pollution management through coupling Bayesian-based simulation and mechanism-based effluent trading optimization
.
Stochastic Environmental Research and Risk Assessment
1
39
.
https://doi.org/10.1007/s00477-021-02130-w
.
Yang
J.
,
Reichert
P.
,
Abbaspour
K. C.
,
Xia
J.
&
Yang
H.
2008
Comparing uncertainty analysis techniques for a SWAT application to the Chaohe Basin in China
.
Journal of Hydrology
358
(
1–2
),
1
23
.
https://doi.org/10.1016/j.jhydrol.2008.05.012
.
Yuan
X.
,
Chen
C.
,
Lei
X.
,
Yuan
Y.
&
Muhammad Adnan
R.
2018
Monthly runoff forecasting based on LSTM–ALO model
.
Stochastic Environmental Research and Risk Assessment
32
,
2199
2212
.
https://doi.org/10.1007/s00477-018-1560-y
.
Zhang
J.
,
Li
Y.
,
Huang
G.
,
Chen
X.
&
Bao
A.
2016
Assessment of parameter uncertainty in hydrological model using a Markov-Chain-Monte-Carlo-based multilevel-factorial-analysis method
.
Journal of Hydrology
538
,
471
486
.
https://doi.org/10.1016/j.jhydrol.2016.04.044
.
Zhang
J. L.
,
Wang
X.
,
Sun
W. N.
,
Li
Y. P.
,
Liu
Z. R.
,
Liu
Y. R.
&
Huang
G. H.
2020
Application of fiducial method for streamflow prediction under small sample cases in Xiangxihe watershed, China
.
Journal of Hydrology
586
,
124866
.
https://doi.org/10.1016/j.jhydrol.2020.124866
.
Zhang
F.
,
Kang
Y.
,
Cheng
X.
,
Chen
P.
&
Song
S.
2022
A hybrid model integrating Elman neural network with variational mode decomposition and Box–Cox transformation for monthly runoff time series prediction
.
Water Resources Management
36
(
10
),
3673
3697
.
https://doi.org/10.1007/s11269-022-03220-2
.
Zheng
G.
,
Bao
A.
,
Li
J.
,
Zhang
G.
,
Xie
H.
,
Guo
H.
,
Jiang
L.
,
Chen
T.
,
Chang
C.
&
Chen
W.
2019
Sustained growth of high mountain lakes in the headwaters of the Syr Darya River, Central Asia
.
Global and Planetary Change
176
,
84
99
.
https://doi.org/10.1016/10.1016/j.gloplacha.2019.03.004
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).