In this study, a physics-informed machine learning-based surrogate model (SM) for the variable infiltration capacity (VIC) model was developed to improve simulation efficiency in the Yarlung Tsangpo River basin. The approach combines the empirical orthogonal function decomposition of low-fidelity VIC models to extract spatial and temporal features, with machine learning techniques applied to refine temporal feature series. This allows for accurate reconstruction of high-fidelity spatial simulations from the results of the low-fidelity model. Using the SM built from the 1.0°-resolution VIC model as an example, the study highlights the challenges and solutions associated with low-fidelity simulations. The SM significantly improves accuracy, achieving a Kling–Gupta efficiency of 0.88, an Nash–Sutcliffe efficiency of 0.97, and a PBIAS value of −6.21% with reduced computational demands. Additionally, different machine learning methods impact the performance of the SM, with the support vector machine regression model performing best in these methods. SMs from varying low-fidelity resolutions maintain similar accuracy, but higher resolutions notably enhance computational efficiency, reducing time by 86.31% when compared to the high-fidelity VIC model. These findings demonstrate the potential of the SM to enhance VIC model simulations while reducing computational requirements.

  • Developed a physics-informed machine learning-based surrogate model to enhance the efficiency and accuracy of variable infiltration capacity (VIC) streamflow simulations.

  • Achieved significant reduction in computational time while retaining key spatial features of the original VIC model.

  • The model effectively balances accuracy and computational efficiency, making it suitable for large-scale hydrological studies.

The study of hydrological systems is critical for understanding the impacts of climate change on water resources, assessing flood risks, and developing strategies for sustainable water management (Gu et al. 2020a). Distributed models have gained significant attention regarding these topics due to their ability to represent spatial variability in environmental processes, which are designed to simulate the spatial and temporal distribution of hydrological variables, such as runoff, evapotranspiration and soil moisture, across large geographical regions (Wang et al. 2024). For example, Vogeti et al. (2023) used the Soil Water Assessment Tool, Hydrologic Engineering Center–Hydrologic Modelling System, and Hydrologic Simulation Program Fortran to explore climate change impact on streamflow in the Lower Godavari Basin, India; Kovalets et al. (2015) forecasted the extreme floods in mountainous areas with the distributed hydrology soil vegetation model and the weather research and forecasting modeling system. The variable infiltration capacity (VIC) model is one of the popular distributed hydrological models, which represents various processes in the terrestrial water cycle, including runoff generation, soil moisture storage, and energy balance (Liang et al. 1994, 1996). The VIC has been widely adopted in water-related fields, such as climate change impacts on hydrology (Lu et al. 2013), runoff forecasts (Sinha & Sankarasubramanian 2013), and water resources assessment (Sridhar et al. 2019), because of its ability to simulate complex processes using a wide range of parameters, allowing for more accurate simulations. However, the computational demands of the VIC model pose substantial challenges (Tsai et al. 2021).

To alleviate this computational burden, surrogate models (SMs) have emerged as a promising solution. SMs approximate the behavior of more complex models, allowing for faster simulations while maintaining an acceptable accuracy (Maier et al. 2014). Among the different types of SMs, artificial neural networks (ANNs) (Behzadian et al. 2009; Zhou et al. 2021), Gaussian process regression (GPR) models (Gong et al. 2016; Zhang et al. 2016), and low-fidelity models (Razavi & Tolson 2013) have shown great promise in enhancing computational efficiency in hydrological studies. However, traditional SMs such as ANNs and GPRs often struggle to balance computational efficiency with the ability to capture the complex, nonlinear dynamics and spatial-temporal variability inherent in hydrological systems (Razavi et al. 2012). The effectiveness of SMs has been demonstrated in numerous hydrological studies, where they have been used for applications such as streamflow forecasting (Gu et al. 2024a) and flood risk modeling (Zhou et al. 2021; Mosimann et al. 2024). However, these studies typically focus on temporal variations while neglecting spatial correlations. This characteristic leads to a lumped representation in SMs, weakening the advantage of distributed hydrological models in spatial simulation (Gu et al. 2020b).

In recent years, physics-informed machine learning (PIML) has gained attention as a novel approach that can integrate physical knowledge into the training of data-driven models (Karniadakis et al. 2021). Generally speaking, traditional black box models perform well in both simulation efficiency and data fitting (Mishra et al. 2024; Vogeti et al. 2024). However, some studies have also pointed out that the predictions of black box models can be unrealistic or inconsistent with physical laws due to errors from extrapolating beyond the data or biases in the observations, which can lead to poor generalization (Razavi 2021; Yaseen 2023). To settle this issue, PIML teaches machine learning models the laws of physics by incorporating fundamental physical principles and domain knowledge, which in turn provides informative priors, offering strong theoretical constraints and inductive biases on top of the observational data (Karniadakis et al. 2021). This unique feature allows PIML to bridge the gap between high-fidelity physical models and purely data-driven approaches, offering a balance between accuracy and computational efficiency (Fraehr et al. 2023a). By integrating physical constraints, PIML can reduce the need for extensive training data, making them suitable for complex hydrological models like VIC. A hybrid approach involving a low-fidelity model, spatial analysis, and Gaussian processing regression is one kind of PIML, which uses simplified physical models to provide initial approximations and uses the machine learning algorithm to refine these approximations and complete high-fidelity simulation (Fraehr et al. 2024). The dimension reduction techniques, such as empirical orthogonal function (EOF) analysis, have been introduced to overcome these limitations of spatial simulation in the SM (Hu et al. 2019; Donnelly et al. 2022).

This study aims to enhance hydrological modeling by developing a PIML-based SM to boost the VIC model's efficiency in streamflow simulation. The specific objectives are: (1) to decrease computational demands of the VIC model through PIML techniques; (2) to improve the SM's accuracy while minimizing fidelity loss compared to the high-fidelity VIC outputs; and (3) to retain as much spatial detail from the high-fidelity VIC model as possible.

Hydrological model

The VIC is a grid-based distributed hydrological model designed for large-scale catchments (Liang et al. 1994, 1996). In the VIC model, each grid cell operates independently without considering the lateral communication between cells. The VIC model does not have a built-in routing module. Hence, a source-to-sink routing model to solve linearized Saint-Venant equations (Lohmann et al. 1996, 1998) is utilized to simulate the movement of runoff and baseflow from each grid cell to a river channel network.

In this study, we utilize version 4.2.d of the VIC model, running at a 6-h time step for both water and energy balance components, with a spatial resolution of 0.25, 0.50, 0.75, and 1.00° (denoted as VIC25, VIC50, VIC75, and VIC100, respectively). The model includes snow (Bowling et al. 2004; Andreadis et al. 2009) and frozen soil (Cherkauer & Lettenmaier 1999; Cherkauer et al. 2003) algorithms, allowing it to account for seasonal variations in cold climates and their impact on surface water and energy fluxes. A glacier melt module based on a temperature-index approach is also added into the VIC model, which assumes that meltwater originates from the snowpack, and this portion of meltwater is excluded from the snow water equivalent (Gu et al. 2023). The selection of this simplified glacier module is mainly due to the limitations in available glacier observation data.
(1)
where fi is the glacier area fraction of cell i; Df is the degree-day factor for glacier (mm°C−1day−1); Ti is the daily mean air temperature; and Tm is the temperature at which the glacier begins to melt.

Surrogate model

Low-fidelity, spatiotemporal decomposition, and post-processing model

As shown in Figure 1, this study uses the low-fidelity, spatiotemporal decomposition, and post-processing model (LSP) (Fraehr et al. 2023b) to build an SM for VIC. First, high-fidelity and low-fidelity VIC models are built to estimate the temporal and spatial variation patterns of runoff. Second, these spatial field variables simulated by the high-fidelity model are decomposed into spatial variation and temporal variation principal components (PCs) using EOF analysis. Third, with the spatial variation components from the high-fidelity model, the temporal variation components of the low-fidelity model can be acquired. Then, the machine learning model is used as the post-processing algorithm to correct the bias between the temporal variation components of the low-fidelity model and those of the high-fidelity model. Finally, the corrected temporal variation components and the spatial variation components are used to recompose high-fidelity spatial field variables of runoff.
Figure 1

Framework of the LSP, where EOF is the abbreviation for EOF and PC is the abbreviation for PC.

Figure 1

Framework of the LSP, where EOF is the abbreviation for EOF and PC is the abbreviation for PC.

Close modal

EOF analysis

The EOF analysis (Marukatat 2023), also known as principal component analysis in the context of spatial data, is a statistical method used to identify and analyze patterns in high-dimensional datasets. It is particularly effective in reducing dimensionality while preserving the essential characteristics of the data. The main steps of EOF are as follows:

  • (1) Standardization. Remove the trend and periodic components from the data, and then standardize the data into a mean value of 0 and a standard deviation of 1.

  • (2) Covariance matrix computation. The covariance matrix is calculated with the formula:
    (2)
Where A is the standardized data matrix of size n × p, n is the number of time steps, and p is the number of grid cells; C is the covariance matrix of A.
  • (3) Computation of eigenvalues and eigenvectors of covariance matrix. The characteristic equation can be written as follows:
    (3)
Where λk is the kth eigenvalue, which is in descending order; νk is the eigenvector associated with λk.
  • (4) Select principal modes. The percent of variance explained is calculated as follows:
    (4)

The percent of variance explained gives the percentage of the total variance that is explained by each mode. With the percent and cumulative percent of variance explained, the principal modes are selected.

  • (5) PC calculation. Transform the data matrix with eigenvectors to obtain the PCs as follows:
    (5)

PCk represents the temporal variation of the kth EOF mode, and νk represents the spatial variation.

In this study, we performed EOF decomposition on the high-fidelity model, resulting in and . Using the data matrix from the low-fidelity model Al and the Vh from the high-fidelity model, the variable PCl can be calculated. Then, the machine learning model is used as the post-processing algorithm to correct PCl, using PCh as the reference. Finally, the data are reconstructed with the following equation:
(6)
where m is the number of EOF modes used for reconstruction; is the corrected PC of the kth EOF mode from the low-fidelity model.

Machine learning model

  • (1) Gaussian process regression

GPR is a Bayesian regression technique that provides a probabilistic approach to modeling complex relationships in data (Williams & Rasmussen 2006). GPR operates by defining a prior distribution over functions and updating this distribution based on observed data, which allows it to capture uncertainty in predictions. Unlike traditional regression methods, GPR can automatically account for noise in the data and provides confidence intervals for its predictions (Schölkopf et al. 2001). Key advantages of GPR include: (1) the ability to model complex, nonlinear relationships without the need for specific parametric forms; (2) a natural incorporation of uncertainty estimation, allowing for more informed decision-making; (3) flexibility in choosing kernel functions to tailor the model to specific datasets; and (4) suitability for small- to medium-sized datasets where the overhead of computation remains manageable.

  • (2) Backpropagation (BP) neural network

The BP neural network model is a widely used supervised learning algorithm in the field of ANNs (Rumelhart et al. 1986). The BP model is composed of the input layer, the hidden layers, and the output layer. The BP algorithm is used to train the weights in the network and correct model errors, which includes forward-pass and backward-pass phases. In the former phase, the input data are fed into the network, and the output is calculated. In the latter phase, the error calculated from the output is propagated back through the network to update the weights and biases. Unlike simpler algorithms, BP can learn complex nonlinear relationships between inputs and outputs by using multiple layers of neurons. When acquired sufficient data and appropriate network complexity, the BP model can approximate almost any continuous function. Its flexible structure to allow customizing internal layer types and sizes makes the model suitable for a wide array of applications.

  • (3) Support vector machine (SVM) regression model

SVM is a powerful supervised learning technique used to predict continuous outcomes. It builds on the concepts of SVM for classification but focuses on fitting a function that can predict values based on input data (Cortes & Vapnik 1995). The model works by finding the best hyperplane that fits the data, while allowing for a certain amount of error defined by a tolerance margin. SVM can handle both linear and nonlinear relationships thanks to the use of different kernel functions.

  • (4) Random forest (RF)

RF refers to an ensemble learning method that uses multiple trees to train and predict samples. RF utilizes bootstrapping to generate diverse subsets from the original dataset to build several models and integrate them rather than building a single global model from raw data, which helps mitigate overfitting to some extent (Mariano & Monica 2021). Additionally, decision trees of RF can be trained in parallel, accelerating the algorithm's training process through parallel computing, while also reducing data dimensionality by randomly selecting features.

Accuracy evaluation index

To evaluate model performance, several widely used evaluation metrics are utilized in our study, which are the Nash–Sutcliffe efficiency (NSE) (Nash & Sutcliffe 1970), relative percentage bias (PBIAS), and Kling–Gupta efficiency (KGE) (Gupta et al. 2009; Kling et al. 2012). The formulas of NSE and PBIAS are defined as follows:
(7)
(8)
where n is the length of the time series; Qo(i) and Qs(i) are the observed and simulated values of the ith day, respectively; is the mean value of the observation. The KGE consists of three parametric components, i.e. the correlation coefficient of the observed and simulated streamflow, the variance bias and the total volume bias. The formula is shown as follows:
(9)
where r is the correlation coefficient of two compared time series; μs and σs are the mean values and the standard deviation of the simulated values; μo and σo are the mean values and the standard deviation of the observation.

These three parameters are the most commonly used evaluation criteria in the assessment of hydrological models. NSE is sensitive to extreme values or outliers, which can be used to evaluate overall model efficiency and how well the model captures the observed data's variability (Moriasi et al. 2007). PBIAS is important for quantifying the overall tendency of the model to overestimate or underestimate the observed values, providing insight into the model's systematic bias. KGE is a more balanced metric used for a more comprehensive assessment by evaluating correlation, variability, and bias separately, helping to identify specific weaknesses in the model (Althoff & Rodrigues 2021). In this study, these three evaluation criteria were used simultaneously to conduct a comprehensive assessment of the model.

Model setting

In this study, we established four VIC models with different resolutions of 0.25, 0.50, 0.75, and 1.00°, which are denoted as VIC25, VIC50, VIC75, and VIC100, respectively. Among these, VIC25 is referred to as the high-fidelity model, while VIC50, VIC75, and VIC100 are considered low-fidelity models. The parameters of the VIC models are calibrated with the ε-Dominance Non-Dominated Sorted Genetic Algorithm II (ε-NSGA II) (Kollat & Reed 2006; Liu et al. 2017). The details of the algorithm can be found in Gu et al. (2020a). The LSP models constructed with the low-fidelity models, i.e. VIC50, VIC75, and VIC100, are referred to as SM50, SM75, and SM100, respectively. Due to the ability to model complex, nonlinear relationships without the need for specific parametric forms, the GPR model is employed by default for post-processing in the LSP models. In this study, the GPR model utilizes the squared exponential kernel function, while the SVM model uses the linear kernel function, which is found to be most effective in the post-processing for PC. In the RF model, the number of decision trees in the bagged ensemble and the minimum number of leaf node observations are set as 200 and 5, respectively. The three-layer BP model is employed. The number of nodes in the hidden layer is calculated in accordance with the empirical formula proposed by Li & Liu (2009), and set as 25 in this study. The default values for other parameters suggested by Beale et al. (2010) are employed in the training of these machine learning models.

Study area

The proposed SM has been applied to the Yarlung Tsangpo River (YTR) basin to evaluate its role, efficiency, and performance in hydrological modeling. The YTR is the longest plateau river in China, with its source at the Jiemayangzong Glacier, which lies at an elevation of 5,590 m above sea level on the Tibetan Plateau (see Figure 2). This river flows through diverse and complex terrain, contributing to its unique hydrological characteristics. The drainage area above the Nuxia (NX) hydrological station is approximately 2 × 105 km², with an average altitude of over 4,000 m, making it a key area for studying high-altitude hydrological processes. The basin is also characterized by extensive perennial snow and glaciers, particularly in its upper and lower reaches and in the southern regions, which lie near the Himalayas (Liu et al. 2019). These frozen water reserves play a crucial role in maintaining river flow (Gu et al. 2024b).
Figure 2

Location of the YTR basin, the hydrological station, and the glacier distribution.

Figure 2

Location of the YTR basin, the hydrological station, and the glacier distribution.

Close modal

The YTR basin experiences an average annual precipitation of around 560 mm, which is relatively stable from year to year. Most of this rainfall occurs during the summer monsoon season, between June and September, influenced by the southwest monsoon (Su et al. 2016; Gu et al. 2023). The average temperature in the basin is about 6.27 °C, with peak temperatures observed in June and July and the lowest in January. This concentrated precipitation pattern, combined with seasonal temperature variations, plays a significant role in the hydrology of the basin, where a large amount of the glacier and snowmelt water is always accompanied by a large amount of precipitation. The hydrological behavior of the YTR basin is primarily driven by a combination of factors, including monsoon-controlled precipitation, snow and glacier meltwater, providing a complex system for the study of hydrological models (You et al. 2007; Chen et al. 2017).

Data

The China Meteorological Forcing Dataset (CMFD) is a gridded near-surface meteorological dataset (Yang & He 2019; He et al. 2020). In this study, the forcing data of the VIC model, including daily precipitation, temperature, wind speed, relative humidity, and radiation, is provided by CMFD. These forcing data range from 1999 to 2018. The daily streamflow of the NX station from 2000 to 2015 is utilized for model calibration and validation. Here, the calibration and validation periods for VIC are 2000–2010 and 2011–2015, respectively. As for the SM, an additional period from 2016 to 2018 is used for testing to show the robustness of the proposed SM. Meanwhile, the training period for the surrogate is from 2000 to 2015.

The terrain data of the YTR basin are from the Shuttle Radar Topography Mission digital elevation model (SRTM DEM). The SRTM DEM is a global elevation dataset that provides high-resolution topographic data, which is widely used in hydrological fields (Sharma & Tiwari 2014). The soil and land cover data for building the VIC model are from the Harmonized World Soil Database (HWSD) (Fischer et al. 2008) and the land cover dataset from the Environmental and Ecological Science Data Center for West China (WESTDC) (Ran et al. 2010). The soil data of HWSD in China are provided by the Nanjing Institute of Soil Science from the second national land survey. The distribution and area of glaciers in the YTR basin are derived from the Second Glacier Inventory of China (Guo et al. 2015).

Results of the VIC model

First, ε-NSGA II is used to calibrate parameters for the high-fidelity model with a spatial resolution of 0.25°. Figure 3 illustrates the daily streamflow of NX simulated by the high-fidelity model. It shows that the hydrograph has a good efficiency with an NSE value of 0.87, a KGE value of 0.91, and a PBIAS value of −1.11% in the calibration period, and 0.86, 0.88, and −0.27% in the validation period. Then, the model parameters are applied in the low-fidelity models with a spatial resolution of 0.50, 0.75, and 1.00°, respectively. For ease of calculation, large grid cells will be divided into several 0.25° × 0.25° cells for storage. In this process, the smaller grid cells within the same larger grid cell share the same data without interpolation. Table 1 shows the performance of VIC with different resolutions. It can be observed that as the spatial resolution decreases, the model's performance also declines. Among the three metrics, the difference in KGE is the most noticeable. When using VIC25 as the baseline, the KGE and NSE values of VIC50 and VIC75 are almost identical, while the PBIAS of VIC50 is relatively better than that of VIC75. Whether using VIC25 or the observation as the baseline, the performance of VIC100 is the worst, showing the largest discrepancy compared to VIC25.
Table 1

Comparison of performance across the models with different resolutions

MetricModelCalibrationValidationTrainingTesting
KGE VIC25 0.91 0.88 – – 
VIC50 0.81 0.79 0.80 0.79 
VIC75 0.77 0.75 0.81 0.78 
VIC100 0.65 0.63 0.64 0.62 
NSE VIC25 0.87 0.86 – – 
VIC50 0.85 0.84 0.96 0.96 
VIC75 0.81 0.79 0.96 0.95 
VIC100 0.75 0.73 0.88 0.86 
PBIAS VIC25 −1.11% −0.27% – – 
VIC50 18.26% 19.83% 17.50% 20.16% 
VIC75 20.32% 21.69% 18.80% 22.02% 
VIC100 33.86% 35.93% 32.27% 36.30% 
MetricModelCalibrationValidationTrainingTesting
KGE VIC25 0.91 0.88 – – 
VIC50 0.81 0.79 0.80 0.79 
VIC75 0.77 0.75 0.81 0.78 
VIC100 0.65 0.63 0.64 0.62 
NSE VIC25 0.87 0.86 – – 
VIC50 0.85 0.84 0.96 0.96 
VIC75 0.81 0.79 0.96 0.95 
VIC100 0.75 0.73 0.88 0.86 
PBIAS VIC25 −1.11% −0.27% – – 
VIC50 18.26% 19.83% 17.50% 20.16% 
VIC75 20.32% 21.69% 18.80% 22.02% 
VIC100 33.86% 35.93% 32.27% 36.30% 

Note: VIC25, VIC50, VIC75, and VIC100 represent the models with spatial resolutions of 0.25, 0.50, 0.75, and 1.00°. Items ‘Calibration’ and ‘Validation’ represent the performance between the VIC-simulated streamflow and the observations at NX station in the calibration and validation periods. Items ‘Training’ and ‘Testing’ represent the performance between the streamflow simulated by low-fidelity VIC models and the VIC25-simulated streamflow at NX station in the training and testing periods.

Figure 3

Daily streamflow hydrograph of VIC for the calibration and validation periods of the NX station, where VIC25, VIC50, VIC75, and VIC100 represent the models with spatial resolutions of 0.25, 0.50, 0.75, and 1.00°.

Figure 3

Daily streamflow hydrograph of VIC for the calibration and validation periods of the NX station, where VIC25, VIC50, VIC75, and VIC100 represent the models with spatial resolutions of 0.25, 0.50, 0.75, and 1.00°.

Close modal
Figure 4 shows the spatial maps of NSE, KGE, and PBIAS between the VIC25-simulated runoff and the low-fidelity VIC-simulated runoff during the training and testing periods. The performance of the same model shows no significant difference between the training period and the testing period. Furthermore, the spatial maps also show that as the simulation resolution decreases, the model's performance becomes worse. Notably, in the YTR basin, areas with large runoff deviations are mainly located in the upstream and downstream regions. These areas tend to expand as the model's spatial resolution decreases. Meanwhile, the simulation accuracy in the midstream region remains largely consistent with that of VIC25. This suggests that the deviations in the upstream and downstream regions could be a major cause of the runoff discrepancies observed at the NX station, the outlet of the YTR basin in this study.
Figure 4

Spatial maps of KGE, NSE, and PBIAS between the runoff simulated by the high-fidelity model and the runoff simulated by the low-fidelity model during the training and testing period, where VIC50, VIC75, and VIC100 represent the metrics between the model with spatial resolutions of 0.25° and the models with spatial resolutions of 0.50, 0.75, and 1.00°, respectively.

Figure 4

Spatial maps of KGE, NSE, and PBIAS between the runoff simulated by the high-fidelity model and the runoff simulated by the low-fidelity model during the training and testing period, where VIC50, VIC75, and VIC100 represent the metrics between the model with spatial resolutions of 0.25° and the models with spatial resolutions of 0.50, 0.75, and 1.00°, respectively.

Close modal

Selection of principal modes

First, EOF decomposition is performed on the results of the VIC25 model. Based on the number of grid cells, the EOF decomposition can produce up to 322 modes. Then, the cumulative percent variance explained by these 322 modes is calculated, as shown in Figure 5. It is found that the first 16 modes account for 75% of variance, while the remaining 306 modes explain the remaining 25%, which is usually considered as noise.
Figure 5

Cumulative percent of variance explained by each mode.

Figure 5

Cumulative percent of variance explained by each mode.

Close modal
To further determine the optimal number of EOF modes, VIC100, which has the largest difference from the high-fidelity model (VIC25), is used as a low-fidelity model to construct the SM (SM100). Figure 6 shows the KGE, NSE, and PBIAS values for streamflow at the NX station between SM100 and VIC25 under different numbers of EOF modes in the training and testing period. It can be observed that as the number of EOF modes increases, the performance of the SM improves across all three parameters, KGE, NSE, and PBIAS. The most significant improvement occurs when the number of modes increases from 1 to 2, indicating that even a small increase in mode complexity can substantially enhance the model's accuracy. As more modes are added, the KGE and NSE values trend closer to 1, indicating improved correlation with the high-fidelity model, while the PBIAS values move closer to 0, reflecting reduced systematic error. However, after the number of modes reaches around 16, the improvements in the evaluation metrics become less pronounced, suggesting that adding more modes beyond this point yields diminishing returns and that the model's performance has reached a stable state. This indicates an optimal balance between model complexity and simulation accuracy is around 16 modes, which is consistent with the results of the percent variance explained.
Figure 6

KGE, NSE, and PBIAS between the streamflow simulated by the high-fidelity model at the NX station and the streamflow simulated by the SM with different modes in (a)–(c) the training period and (d)–(f) the testing period.

Figure 6

KGE, NSE, and PBIAS between the streamflow simulated by the high-fidelity model at the NX station and the streamflow simulated by the SM with different modes in (a)–(c) the training period and (d)–(f) the testing period.

Close modal
The results during the training period and the testing period are relatively consistent, with the testing period showing slightly lower performance. Therefore, the analysis here focuses on the testing period. Figure 7 shows the KGE, NSE, and PBIAS values for runoff at each grid cell between SM100 and VIC25 under different numbers of EOF modes in the testing period. Figure 7 further supports the above conclusion, showing that the most notable improvement occurs when the number of modes increases from 1 to 2. This highlights a significant gain in the SM's performance with the initial increase in complexity. As the number of modes continues to increase, the improvement is gradually slowing down, and the SM's performance reaches a stable state of around 16 modes. This suggests that while initial increases in mode number greatly enhance model accuracy, adding more modes beyond this point provides minimal additional benefit. Therefore, the study selects 16 as the optimal number of modes. For consistency and to simplify the calculations, the same number of modes is also used to construct the other two SMs, i.e. SM50 and SM75.
Figure 7

(a) KGE, (b) NSE, and (c) PBIAS between the runoff of grid cells simulated by the high-fidelity model and the runoff simulated by the SM with different modes.

Figure 7

(a) KGE, (b) NSE, and (c) PBIAS between the runoff of grid cells simulated by the high-fidelity model and the runoff simulated by the SM with different modes.

Close modal

Performance of SMs constructed with different spatial resolutions

Figure 8(a) illustrates the streamflow time series of the different SMs (SM50, SM75, and SM100) compared with the high-fidelity model (VIC25). During the training period, the streamflow time series of all four models are nearly identical, indicating effective learning from the training data. In the testing period, when using the original model as a benchmark, the performance of the SMs is notable, with the KGE values of 0.90, 0.85, and 0.89; the NSE values of 0.97 for all three SMs; and the PBIAS values of −5.03, −5.68, and −6.21%, respectively. These metrics indicate that the SMs perform well, exhibiting a high degree of similarity to the high-fidelity model's results, thus validating their effectiveness as surrogates in hydrological modeling.
Figure 8

(a) Daily streamflow hydrograph and (b), (c) empirical cumulative distribution function (CDF) plots of the simulated streamflow at the NX station between the SMs (SM50, SM75, and SM100) and the high-fidelity model (VIC25), where VIC25, SM50, SM75, and SM100 represent the VIC model with spatial resolutions of 0.25° and the SMs constructed with spatial resolutions of 0.50, 0.75, and 1.00°, respectively.

Figure 8

(a) Daily streamflow hydrograph and (b), (c) empirical cumulative distribution function (CDF) plots of the simulated streamflow at the NX station between the SMs (SM50, SM75, and SM100) and the high-fidelity model (VIC25), where VIC25, SM50, SM75, and SM100 represent the VIC model with spatial resolutions of 0.25° and the SMs constructed with spatial resolutions of 0.50, 0.75, and 1.00°, respectively.

Close modal

Figure 8(b) further confirms that the streamflow time series during the training period are nearly identical across all models. However, Figure 8(a) and 8(c) reveal that the peak flows produced by the SMs are slightly lower than those of the high-fidelity model during the testing period. This suggests that while the SMs effectively replicate the overall streamflow dynamics, they may underpredict peak streamflow events. Additionally, the simulation capabilities of the three SMs are consistent with each other, indicating that they can reliably serve as surrogates for the high-fidelity model.

Figure 9 illustrates the spatial performance of the SMs at different grid cells, using the high-fidelity model as the reference, which reveals that the three SMs exhibit a high spatial consistency, with no significant differences in their simulation accuracy across the study area. Compared to Figure 4, which shows the initial performance of the low-fidelity models, Figure 9 indicates a noticeable improvement in the performance of the SMs. This enhancement highlights the effectiveness of the adjustments made to the SMs, leading to better alignment with the original model. This improvement highlights the effectiveness of the SMs, thereby getting closer to the high-fidelity model.
Figure 9

Spatial maps of KGE, NSE, and PBIAS between the runoff simulated by the high-fidelity model and the runoff simulated by the SMs during the training and testing period, where SM50, SM75, and SM100 represent the metrics between the model with spatial resolutions of 0.25° and the SMs constructed with spatial resolutions of 0.50, 0.75, and 1.00°, respectively.

Figure 9

Spatial maps of KGE, NSE, and PBIAS between the runoff simulated by the high-fidelity model and the runoff simulated by the SMs during the training and testing period, where SM50, SM75, and SM100 represent the metrics between the model with spatial resolutions of 0.25° and the SMs constructed with spatial resolutions of 0.50, 0.75, and 1.00°, respectively.

Close modal

Furthermore, Figure 9 confirms again that the SMs provide nearly identical performance to the high-fidelity model during the training period. However, during the testing period, the SMs tend to underestimate runoff in the upper reaches of the YTR basin. This underestimation is likely a contributing factor to the observed lower streamflow estimates at the NX station. Despite this, the overall consistency between the SMs and the high-fidelity model supports their utility as computationally efficient surrogates for simulating hydrological processes.

Comparison of SMs with different machine learning methods

This section compares the ability of four machine learning methods, i.e. GPR, BP, SVM, and RF, to improve the performance of the low-fidelity model. Using the high-fidelity model (VIC25) as a benchmark, Table 2 presents the performance of streamflow at the NX station simulated by the SM (SM100) with these four machine learning methods during both training and testing periods. The results indicate that the SVM model performed best during the testing period, achieving a KGE value of 0.98, an NSE value of 0.99, and a PBIAS value of 0.14%. This suggests that SVM is highly effective at enhancing the SM's accuracy. The BP model follows closely, with a KGE value of 0.94, an NSE value of 0.99, and a PBIAS value of −0.54%, showing solid performance in error metrics. The RF and GPR models exhibit comparatively weaker performance. Figure 10 illustrates the performance of different models in simulating grid cell runoff during the testing period. It is evident from the figure that SVM and BP models outperform GPR and RF in terms of spatial accuracy. The spatial distributions of the SVM and BP models are closer to VIC25, showing a slightly better ability to capture the runoff variations across different grid cells than GPR and RF. Despite a slight difference in model accuracy, all four models demonstrate satisfactory performance overall, successfully improving the accuracy of the low-fidelity models when compared to the baseline.
Table 2

Performance between the streamflow simulated by the high-fidelity model and the streamflow simulated by the SM with machine learning methods at the NX station

MethodTrainingTesting
KGENSEPBIAS (%)KGENSEPBIAS (%)
GPR 1.00 1.00 0.00 0.88 0.97 −6.21 
BP 0.99 0.99 0.12 0.94 0.99 −0.54 
SVM 0.99 0.99 0.37 0.98 0.99 0.14 
RF 1.00 1.00 −0.02 0.92 0.98 −3.44 
MethodTrainingTesting
KGENSEPBIAS (%)KGENSEPBIAS (%)
GPR 1.00 1.00 0.00 0.88 0.97 −6.21 
BP 0.99 0.99 0.12 0.94 0.99 −0.54 
SVM 0.99 0.99 0.37 0.98 0.99 0.14 
RF 1.00 1.00 −0.02 0.92 0.98 −3.44 
Figure 10

Performance between the runoff of grid cells simulated by the high-fidelity model and the runoff simulated by the SM with machine learning methods in the testing period.

Figure 10

Performance between the runoff of grid cells simulated by the high-fidelity model and the runoff simulated by the SM with machine learning methods in the testing period.

Close modal
Figure 11

Distribution of the glacier and six subwatersheds in the YTR, referring to Gu et al. (2020b).

Figure 11

Distribution of the glacier and six subwatersheds in the YTR, referring to Gu et al. (2020b).

Close modal

Efficiency evaluation

Another key point of the SM is the efficiency. In this study, the SM and the VIC models are operated with one processor of an Intel i9-13900HX Central Processing Unit (CPU) on the Ubuntu 22.04 platform. Based on the practical application of the SMs, this study uses the testing period to evaluate the computational efficiency of the models. Table 3 presents the time required to compute runoff for the testing period using different resolution VIC models under the same operating environment. The computational saving index (Razavi et al. 2012) is used to measure the efficiency gains of surrogates compared with VIC. For VIC, the consumed time decreases remarkably with the simulated spatial resolution from 0.25 to 1.00°. For the LSP model, it requires only a few seconds to reconstruct the low-fidelity model into the high-fidelity model. As a result, the computational time of the LSP model and the routing process is almost negligible compared to the simulation time required by the VIC model. It is noteworthy that the time required for model spatial resolution decreases from 25 to 50, 75, and 100° does not reduce in a proportional manner. This is because, at very low spatial resolutions, more grid cells are needed to ensure complete coverage of the basin. Consequently, the computational time does not scale down linearly with the reduction in resolution. Overall, the simulation efficiency of the SMs shows a significant improvement compared to the high-fidelity model, particularly with SM100, which saves 86.31% of the computational time.

Table 3

Comparison of the time consumed by VIC and the SM

ModelConsumed time (s)Computational saving (%)
VICEOFRoutingTotal
VIC25 1,207 – 1,213 – 
SM50 420 12 438 63.89 
SM75 225 12 243 79.97 
SM100 158 12 166 86.31 
ModelConsumed time (s)Computational saving (%)
VICEOFRoutingTotal
VIC25 1,207 – 1,213 – 
SM50 420 12 438 63.89 
SM75 225 12 243 79.97 
SM100 158 12 166 86.31 

Note. VIC’ is the time consumed with VIC; ‘EOF’ is the time consumed with the EOF model; ‘Routing’ is the time consumed for routing the streamflow at the outlet station; ‘Total’ is the total time consumed with SM (VIC + EOF + Routing); the computational saving index δ is calculated with the formula ‘δ = (tvts)/tv × 100%’, where tv is the time consumed by VIC and ts is the total time consumed by SMs.

Analysis of spatial results

Figures 4 and 9 illustrate a notable improvement in the spatial simulation accuracy of the SMs compared to the low-fidelity models. However, certain areas still exhibit lower accuracy in the simulations. Interestingly, these areas correspond to Cluster 2 and Cluster 4, as defined in the study by Gu et al. (2020b), which divided the YTR basin into six subbasins, as shown in Figure 11. Cluster 1 and Cluster 2 are located in transition zones between glacier-covered and non-glacier regions. These zones present unique modeling challenges due to the complex hydrological processes, such as snowmelt, glacier dynamics, and varying precipitation patterns. The low-fidelity models, for their larger spatial resolution, are unable to effectively simulate this information. This leads to the difficulty for the GPR model to fully capture the complex processes from the low-fidelity models. Improving model accuracy in these regions might require the integration of additional hydrogeological data. Additionally, we found that using more complex machine learning models, such as SVM and BP, can improve the spatial simulation accuracy of the SMs. This suggests that using more advanced machine learning methods can help address the challenges of simulating hydrological processes in complex regions.

Limitation and perspective

While the SM improves computational efficiency for low-fidelity models, it still faces challenges in accurately capturing fine-scale spatial details in the high-fidelity model. In some cases, the model may still struggle to preserve the full spatial heterogeneity of the hydrological processes, particularly for areas with highly dynamic or complex topography. The model is primarily tested in the YTR basin, and its performance may vary when applied to different geographical regions with distinct climatic and hydrological characteristics. The transferability of SM to other basins needs further validation to assess its robustness across different hydrological regimes. Future work should focus on applying SM to other basins with varying hydrological regimes and climate conditions to test its generalizability. Additionally, expanding the model to multi-scale applications (e.g. from regional to global scales) could further demonstrate its potential for large-scale hydrological assessments.

This study has not considered global climate models (GCMs) or shared socioeconomic pathways (SSPs) due to the focus on improving the computational efficiency and accuracy of the low-fidelity model. Including GCMs and SSPs will significantly increase the complexity and computational cost of the analysis. In future work, GCMs and SSPs will be considered in conjunction with SM for assessing the impacts of future climate scenarios on streamflow and water resources. This could enable the model to provide valuable insights into how hydrological behavior may change under different climate change trajectories, providing better guidance for long-term water resource management and climate adaptation planning. In addition, the ability of SM to significantly reduce computational time opens up opportunities for its integration with real-time hydrological forecasting systems. By integrating numerical weather forecast data, SM could provide rapid flood predictions and inform flood management decisions.

To improve the efficiency of the VIC model and alleviate the burden of computation, the VIC SM based on a PIML called the LSP model was proposed and explored in the YTR basin. By conducting EOF decomposition on low-fidelity models, the spatial and temporal features were extracted successfully, whereas the spatial features were from the high-fidelity model. The machine learning algorithms were then employed to correct the temporal feature series from the low-fidelity models. By utilizing these spatial and temporal features, high-fidelity spatial simulation was reconstructed. The main results we found are as follows:

  • (1) As the spatial resolution of the VIC model decreases, the capacity of both the spatial runoff simulation and the streamflow simulation declines. Specifically, at a spatial resolution of 1.00°, the model's performance is poorest, with a KGE value of 0.63, an NSE value of 0.73, and a PBIAS value of 35.93% in the validation period, using observation as a benchmark.

  • (2) The LSP model effectively enhances the simulation accuracy of low-fidelity models. SM100 developed from VIC100, benchmarked against SM25, shows notable improvement during the testing period, with a KGE value of 0.88, an NSE value of 0.97, and a PBIAS value of −6.21%. This represents a notable enhancement compared to VIC100.

  • (3) Different machine learning algorithms have varying effects on the LSP model's performance. Compared to GPR and RF, SVM, and BP show improvements in spatial simulation and peak flow simulation at the NX station. Overall, the SMs developed with different machine learning algorithms achieve acceptable simulation accuracy.

  • (4) SMs built from low-fidelity models with different spatial resolutions exhibit similar simulation performance. However, simulation efficiency increases as the spatial resolution of the base low-fidelity model increases. Notably, SM100 can reduce computational time by 86.31% compared to VIC25.

This study is financially supported by the National Natural Science Foundation of China (52209036; 52309038; and 52479028), the National Key Research and Development Program of China (2021YFC3201105), and the Natural Science Foundation of Zhejiang Province (LY22E090010; LMS25E090004). We are grateful to the National Tibetan Plateau Data Center (http://data.tpdc.ac.cn) and the National Cryosphere Desert Data Center (http://www.ncdc.ac.cn) for providing the datasets for modeling.

All relevant data are available from an online repository or repositories: CMFD data are downloaded from https://doi.org/10.11888/AtmosphericPhysics.tpe.249369.file. SRTM DEM data are downloaded from https://earthexplorer.usgs.gov/. Soil data are downloaded from https://www.fao.org/soils-portal/soil-survey/soil-maps-and-databases/harmonized-world-soil-database-v12/en/. WestDC landcover data are downloaded from https://cstr.escience.org.cn/CSTR:11738.11.ncdc.Westdc.2020.678. The Second Glacier Inventory of China can be accessed at https://doi.org/10.3972/glacier.001.2013.db.

The authors declare there is no conflict.

Andreadis
K. M.
,
Storck
P.
&
Lettenmaier
D. P.
(
2009
)
Modeling snow accumulation and ablation processes in forested environments
,
Water Resources Research
,
45
(
5
),
W05429
.
Beale
M. H.
,
Hagan
M. T.
&
Demuth
H. B.
(
2010
)
Neural network toolbox™ user's guide. Natick, MA, The Mathworks Inc
.
Behzadian
K.
,
Kapelan
Z.
,
Savic
D.
&
Ardeshir
A.
(
2009
)
Stochastic sampling design using a multi-objective genetic algorithm and adaptive neural networks
,
Environmental Modelling & Software
,
24
(
4
),
530
541
.
Bowling
L. C.
,
Pomeroy
J. W.
&
Lettenmaier
D. P.
(
2004
)
Parameterization of blowing-snow sublimation in a macroscale hydrology model
,
Journal of Hydrometeorology
,
5
(
5
),
745
762
.
Chen
X.
,
Long
D.
,
Hong
Y.
,
Zeng
C.
&
Yan
D.
(
2017
)
Improved modeling of snow and glacier melting by a progressive two-stage calibration strategy with GRACE and multisource data: how snow and glacier meltwater contributes to the runoff of the upper Brahmaputra River basin?
Water Resources Research
,
53
(
3
),
2431
2466
.
Cherkauer
K. A.
&
Lettenmaier
D. P.
(
1999
)
Hydrologic effects of frozen soils in the upper Mississippi River basin
,
Journal of Geophysical Research: Atmospheres
,
104
(
D16
),
19599
19610
.
Cherkauer
K. A.
,
Bowling
L. C.
&
Lettenmaier
D. P.
(
2003
)
Variable infiltration capacity cold land process model updates
,
Global and Planetary Change
,
38
(
1–2
),
151
159
.
Cortes
C.
&
Vapnik
V.
(
1995
)
Support-vector networks
,
Machine Learning
,
20
,
273
297
.
Donnelly
J.
,
Abolfathi
S.
,
Pearson
J.
,
Chatrabgoun
O.
&
Daneshkhah
A.
(
2022
)
Gaussian process emulation of spatio-temporal outputs of a 2D inland flood model
,
Water Research
,
225
,
119100
.
Fischer
G.
,
Nachtergaele
F.
,
Prieler
S.
,
van Velthuizen
H. T.
,
Verelst
L.
&
Wiberg
D.
(
2008
)
Global Agro-Ecological Zones Assessment Foragriculture (GAEZ 2008)
.
Laxenburg, Austria/Rome, Italy/Cambridge
:
FAO/IIASA
.
Fraehr
N.
,
Wang
Q. J.
,
Wu
W.
&
Nathan
R.
(
2023a
)
Supercharging hydrodynamic inundation models for instant flood insight
,
Nature Water
,
1
(
10
),
835
843
.
Fraehr
N.
,
Wang
Q. J.
,
Wu
W.
&
Nathan
R.
(
2023b
)
Development of a fast and accurate hybrid model for floodplain inundation simulations
,
Water Resources Research
,
59
(
6
),
e2022WR033836
.
Gong
W.
,
Duan
Q.
,
Li
J.
,
Wang
C.
,
Di
Z.
,
Ye
A.
,
Miao
C.
&
Dai
Y.
(
2016
)
Multiobjective adaptive surrogate modeling-based optimization for parameter estimation of large, complex geophysical models
,
Water Resources Research
,
52
(
3
),
1984
2008
.
Gu
H.
,
Xu
Y. P.
,
Ma
D.
,
Xie
J.
,
Liu
L.
&
Bai
Z.
(
2020b
)
A surrogate model for the variable infiltration capacity model using deep learning artificial neural network
,
Journal of Hydrology
,
588
,
125019
.
Gu
H.
,
Xu
Y. P.
,
Liu
L.
,
Xie
J.
,
Wang
L.
,
Pan
S.
&
Guo
Y.
(
2023
)
Seasonal catchment memory of high mountain rivers in the Tibetan plateau
,
Nature Communications
,
14
(
1
),
3173
.
Gu
H.
,
Xu
Y. P.
,
Wang
L.
,
Ma
D.
,
Liang
X.
,
Guo
Y.
&
Liu
L.
(
2024a
)
Seasonal streamflow forecasting by surrogate modeling in the Yarlung Zangbo River basin, China
.
Journal of Hydrology: Regional Studies
,
53
,
101835
.
Gu
H.
,
Liu
L.
,
Xu
Y. P.
,
Ma
D.
,
Xie
J.
&
Yu
X.
(
2024b
)
Estimation of seasonal precipitation memory curves for major rivers in the Tibetan plateau based on GRACE satellites data
,
Journal of Hydrology: Regional Studies
,
55
,
101942
.
Guo
W.
,
Liu
S.
,
Xu
J.
,
Wu
L.
,
Shangguan
D.
,
Yao
X.
,
Wei
J.
,
Bao
W.
,
Yu
P.
&
Liu
Q.
(
2015
)
The second Chinese glacier inventory: data, methods and results
,
Journal of Glaciology
,
61
(
226
),
357
372
.
Gupta
H. V.
,
Kling
H.
,
Yilmaz
K. K.
&
Martinez
G. F.
(
2009
)
Decomposition of the mean squared error and NSE performance criteria: implications for improving hydrological modelling
,
Journal of Hydrology
,
377
(
1–2
),
80
91
.
He
J.
,
Yang
K.
,
Tang
W.
,
Lu
H.
,
Qin
J.
,
Chen
Y.
&
Li
X.
(
2020
)
The first high-resolution meteorological forcing dataset for land process studies over China
,
Scientific Data
,
7
(
1
),
25
.
Hu
R.
,
Fang
F.
,
Pain
C. C.
&
Navon
I. M.
(
2019
)
Rapid spatio-temporal flood prediction and uncertainty quantification using a deep learning method
,
Journal of Hydrology
,
575
,
911
920
.
Karniadakis
G. E.
,
Kevrekidis
I. G.
,
Lu
L.
,
Perdikaris
P.
,
Wang
S.
&
Yang
L.
(
2021
)
Physics-informed machine learning
,
Nature Reviews Physics
,
3
(
6
),
422
440
.
Kling
H.
,
Fuchs
M.
&
Paulin
M.
(
2012
)
Runoff conditions in the upper Danube basin under an ensemble of climate change scenarios
,
Journal of Hydrology
,
424–425
,
264
277
.
Li
F.
&
Liu
C.
(
2009
). '
Application study of BP neural network on stock market prediction
’,
IEEE. 2009 Ninth International Conference on Hybrid Intelligent Systems
.
Shenyang, China
, pp.
174
178
.
Liang
X.
,
Lettenmaier
D. P.
,
Wood
E. F.
&
Burges
S. J.
(
1994
)
A simple hydrologically based model of land surface water and energy fluxes for general circulation models
,
Journal of Geophysical Research Atmospheres
,
99
(
D7
),
14415
14428
.
Liang
X.
,
Lettenmaier
D. P.
&
Wood
E. F.
(
1996
)
One-dimensional statistical dynamic representation of subgrid spatial variability of precipitation in the two-layer variable infiltration capacity model
,
Journal of Geophysical Research Atmospheres
,
101
(
D16
),
21403
21422
.
Lohmann
D.
,
Raschke
E.
,
Nijssen
B.
&
Lettenmaier
D. P.
(
1998
)
Regional scale hydrology: i. formulation of the VIC-2L model coupled to a routing model
,
Hydrological Sciences Journal
,
43
(
1
),
131
141
.
Lu
G.
,
Xiao
H.
,
Wu
Z.
,
Zhang
S.
&
Li
Y.
(
2013
)
Assessing the impacts of future climate change on hydrology in Huang-Huai-Hai region in China using the PRECIS and VIC models
,
Journal of Hydrologic Engineering
,
18
(
9
),
1077
1087
.
Maier
H. R.
,
Kapelan
Z.
,
Kasprzyk
J.
,
Kollat
J.
,
Matott
L. S.
,
Cunha
M. C.
,
Dandy
G. C.
,
Gibbs
M. S.
,
Keedwell
E.
&
Marchi
A.
(
2014
)
Evolutionary algorithms and other metaheuristics in water resources: current status, research challenges and future directions
,
Environmental Modelling & Software
,
62
,
271
299
.
Mariano
C.
&
Monica
B.
(
2021
)
A random forest-based algorithm for data-intensive spatial interpolation in crop yield mapping
,
Computers and Electronics in Agriculture
,
184
,
106094
.
Marukatat
S.
(
2023
)
Tutorial on PCA and approximate PCA and approximate kernel PCA
,
Artificial Intelligence Review
,
56
(
6
),
5445
5477
.
Mishra
B. R.
,
Vogeti
R. K.
,
Jauhari
R.
,
Raju
K. S.
&
Kumar
D. N.
(
2024
)
Boosting algorithms for projecting streamflow in the lower Godavari basin for different climate change scenarios
,
Water Science and TechnologyWater Science and Technology
,
89
(
3
),
613
634
.
Moriasi
D. N.
,
Arnold
J. G.
,
Van Liew
M. W.
,
Bingner
R. L.
,
Harmel
R. D.
&
Veith
T. L.
(
2007
)
Model evaluation guidelines for systematic quantification of accuracy in watershed simulations
,
Transactions of the ASABE
,
50
(
3
),
885
900
.
Mosimann
M.
,
Kauzlaric
M.
,
Schick
S.
,
Martius
O.
&
Zischg
A. P.
(
2024
)
Evaluation of surrogate flood models for the use in impact-based flood warning systems at national scale
,
Environmental Modelling & Software
,
173
,
105936
.
Nash
J. E.
&
Sutcliffe
J. V.
(
1970
)
River flow forecasting through conceptual models part I – a discussion of principles
,
Journal of Hydrology
,
10
(
3
),
282
290
.
Ran
Y.
,
Li
X.
&
Lu
L.
(
2010
)
Evaluation of four remote sensing based land cover products over China
,
International Journal of Remote Sensing
,
31
(
2
),
391
401
.
Razavi
S.
&
Tolson
B. A.
(
2013
)
An efficient framework for hydrologic model calibration on long data periods
,
Water Resources Research
,
49
(
12
),
8418
8431
.
Razavi
S.
,
Tolson
B. A.
&
Burn
D. H.
(
2012
)
Review of surrogate modeling in water resources
,
Water Resources Research
,
48
(
7
),
W07401
.
Rumelhart
D. E.
,
Hinton
G. E.
&
Williams
R. J.
(
1986
)
. Learning representations by back-propagating errors
,
Nature
,
323
(
6088
),
533
536
.
Schölkopf
B.
,
Platt
J. C.
,
Shawe-Taylor
J.
,
Smola
A. J.
&
Williamson
R. C.
(
2001
)
Estimating the support of a high-dimensional distribution
,
Neural Computation
,
13
(
7
),
1443
1471
.
Sharma
A.
&
Tiwari
K. N.
(
2014
)
A comparative appraisal of hydrological behavior of SRTM DEM at catchment level
,
Journal of Hydrology
,
519
,
1394
1404
.
Sridhar
V.
,
Ali
S. A.
&
Lakshmi
V.
(
2019
)
Assessment and validation of total water storage in the chesapeake Bay watershed using GRACE
,
Journal of Hydrology: Regional Studies
,
24
,
100607
.
Su
F.
,
Zhang
L.
,
Ou
T.
,
Chen
D.
,
Yao
T.
,
Tong
K.
&
Qi
Y.
(
2016
)
Hydrological response to future climate changes for the major upstream river basins in the Tibetan plateau
,
Global and Planetary Change
,
136
,
82
95
.
Tsai
W.
,
Feng
D.
,
Pan
M.
,
Beck
H.
,
Lawson
K.
,
Yang
Y.
,
Liu
J.
&
Shen
C.
(
2021
)
From calibration to parameter learning: harnessing the scaling effects of big data in geoscientific modeling
,
Nature Communications
,
12
(
1
),
5988
.
Vogeti
R. K.
,
Raju
K. S.
,
Nagesh Kumar
D.
,
Rajesh
A. M.
,
Somanath Kumar
S. V.
&
Jha
Y. S. K.
(
2023
)
Application of hydrological models in climate change framework for a river basin in India
,
Journal of Water and Climate Change
,
14
(
9
),
3150
3165
.
Vogeti
R.K.
,
Jauhari
R.
,
Mishra
B. R.
,
Raju
K. S.
&
Nagesh Kumar
D.
(
2024
)
Deep learning algorithms and their fuzzy extensions for streamflow prediction in climate change framework
,
Journal of Water and Climate Change
,
15
(
2
),
832
848
.
Wang
C.
,
Jiang
S.
,
Zheng
Y.
,
Han
F.
,
Kumar
R.
,
Rakovec
O.
&
Li
S.
(
2024
)
Distributed hydrological modeling with physics-encoded deep learning: a general framework and its application in the Amazon
,
Water Resources Research
,
60
(
4
),
e2023WR036170
.
Williams
C. K.
&
Rasmussen
C. E.
(
2006
)
Gaussian Processes for Machine Learning, 2
.
Cambridge, MA
:
MIT Press
.
Yang
K.
&
He
J.
(
2019
)
China Meteorological Forcing Dataset (1979–2018)
.
Beijing, China
:
National Tibetan Plateau Data Center
.
Yaseen
Z. M.
(
2023
)
A new benchmark on machine learning methodologies for hydrological processes modelling: a comprehensive review for limitations and future research directions
,
Knowledge-Based Engineering and Sciences
,
4 (3), 65–103
.
You
Q.
,
Kang
S.
,
Wu
Y.
&
Yan
Y.
(
2007
)
Climate change over the Yarlung Zangbo River basin during 1961–2005
,
Journal of Geographical Sciences
,
17
(
4
),
409
420
.
Zhou
Y.
,
Wu
W.
,
Nathan
R.
&
Wang
Q. J.
(
2021
)
A rapid flood inundation modelling framework using deep learning with spatial reduction and reconstruction
,
Environmental Modelling & Software
,
143
,
105112
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).