Accurate hydrological simulations comply with the water (sixth) Sustainable Development Goals (SDGs). The study investigates the utility of ANN and SVR, as well as the post-simulation bias treatment of these simulations at Swat River basin, Pakistan. For this, climate variables were lag adjusted for the first time, then cross-correlated with the flow to identify the most associative delay time. In sensitivity analysis, seven combinations were selected as input with suitable hyperparameters. For SVR, grid search cross-validation determined the optimal set of hyper-parameters, while for ANN, neurons and hidden layers were optimized by trial and error. We ran model by using optimized hyperparameter configurations and input combinations. In comparison to SVRs (Root mean square error (RMSE) 34.2; mean absolute error (MAE) 3.0; CC 0.91) values, respectively, ANN fits the observations better than SVR with (RMSE 11.9; MAE 1.14; CC 0.99). Linear bias-corrected simulations greatly improved ANN performance (RMSE 3.98; MAE 0.625; CC 0.99), while the improvement was slight in the case of SVR (RMSE 35; MAE 0.58; CC 0.92). On seasonal scale, bias-corrected simulations remedy low- and high-flow seasonal discrepancies. Flow duration analysis results reveal deviation at low- and high-flow conditions by models, which were then reconciled by applying bias corrections.

  • The study represents a couple of AI prognostic models with bias scaling on the Swat Basin case study.

  • ANN performs best with higher input parameters, while SVR performs robustly with lower input parameters.

  • Bias scaling of SVR (SVR-BC) improves in depicting peaks.

  • Bias correction of ANN yields better flow series having minimum errors than other models.

Water is necessary for sustaining the biosphere. Humans engineer its uses for food, energy, and sustenance. However, no matter how much resourceful it is, it could be disastrous in the form of floods, GLOFs, cloudbursts, etc. This shows that the timely prediction regarding water systems is valuable for managing necessary supplies for hydropower generation, water supply, irrigation, etc., and helps avoid water-related damages (early floods warning, dam breaks, etc.). Hence, several studies have been conducted on water prediction/forecasting, specifically addressing river flows (Wang et al. 2009; Hadi & Tombul 2018; Adnan et al. 2019). River hydrology has high relevance with the supply of fresh water. Timely prediction is imperative for fulfilling the demand for communities, industries, etc. (Panagopoulos 2021, 2022; Panagopoulos & Giannika 2022).

Typically, there are two ways of forecasting streamflow: conventional modeling and data-driven modeling for predicting streamflow. Conventional models are based on mathematical equations and conceptual representations of the underlying processes that describe the real physical system of the basin (Hassan et al. 2014). For imitating a real scenario, a conventional model needs to incorporate a variety of climatic and physiographic parameters that could be of high data demand. For instance, almost all the models required precipitation and temperature as their primary model input, while other climate parameters also needed are humidity, evapotranspiration, wind speed, solar radiation, etc. Physiographic parameters generally comprise surface roughness, slope, elevation, etc. Fulfilling these data demands is an onerous task for model parameterization; on the other hand, deficient quantities could limit the performance of streamflow prediction. Additionally, the streamflow process is also influenced by spatiotemporal irregularity in the watershed characteristics such as a change in surface roughness with a change in areal land use, monthly, annually, etc. (Ali & Shahbaz 2020). For the sake of simplicity, these models neglect such spatiotemporal characteristics and consider them temporally stationary which could induce uncertainty in model inferences. Conventional modeling for forecasting stream flow includes hydrological models such as HEC-HMS, SWAT, VIC, HBV, and ARMAX models.

Alternatively, data-driven modeling includes different machine-learning approaches. Among several approaches, artificial neural networking (ANN) and support vector machine (SVM) are popularly being used for hydrological applications (Tongal & Booij 2018; Essam et al. 2022). Previously several studies have compared their performances with conventional models and reported better results. For instance, Hsu et al. (1995) showed that the ANN model provided a better representation of the rainfall-runoff relationships than the ARMAX time series approach or the conceptual Sacramento model. Zealand et al. (1999) explored the capability of ANN and confirmed its superiority over conventional models during training and testing periods. Demirel et al. (2009) compared the ANN model and the popular SWAT model; their study suggested that the ANN model is more effective in predicting peak flow than the SWAT model. Kerh & Lee (2006) applied a backpropagation neural networking model and found that the ANN model performs relatively better than that of the conventional Muskingum method. The ANN model is applied to forecast river flow in comparison to an analytic power model. They found that the neural network model performed better than the analytic power model (Karunanithi et al. 1994). Chiang et al. (2022) after conducting a study on the rural watershed in Taiwan concluded that SVR performed better than the hydrologic modeling system (HEC-HMS). Furthermore, in the last decade, SVM which is one of the soft computational techniques has been successfully used in hydrology and proved a better alternative to overcome some of the basic gaps in the application of ANN models (Adnan et al. 2017). Khan & Coulibaly (2006) used SVM models to predict lake water levels in comparison to a multilayer perceptron (MLP) and multiplicative seasonal autoregressive model (SAR). They found that SVM prediction accuracy is better than the other two models for multi-month ahead streamflow prediction. Çimen & Kisi (2009) compared the potential of SVM and ANN in modeling lake-level fluctuations in Turkey. Adnan et al. (2017) conducted a study in which a monthly stream flow prediction model was created using ANN and SVM. SVM was found to be more accurate in forecasting streamflow than ANN. Guo et al. (2011) predicted monthly river discharges. They explored that the SVM outperformed the ANN models.

In this study, the Swat River was selected, and less research was done on this area. Additionally, it is a crucial resource for leveraging the regional economy (Shah et al. 2016) and building hydropower capacity (Sabir et al. 2014). Large agricultural lands consistently suffer from flooding every year. Despite having a favorable location and gradient, the Swat River basin was not previously explored by government authorities for hydropower development until recently. In this regard, hydrological studies are crucial for informing managers regarding flow quantities and formulating water policies accordingly. The region also lacks climate observatories which is the major constraint in understanding the hydrologic processes of the basin. This could affect the model's capacity to comprehend complex processes, for instance, actual snow melting response as several intricate processes (melting rate at high latitude, etc.,) remain hidden and unknown. Additionally, an orographic gradient in high altitude also differs between low spatial instances, making it onerous to capture localized precipitation response. In this way, process-based modeling in these conditions would be susceptible to uncertainty, defective parameterization and bias.

While the process-based model suffers, in this study, the data-driven model is assumed to be more suitable under the given conditions as it could skip underlying physical processes and could directly relate climate variables to flow in an efficient manner. Nevertheless, one notable limitation of data-driven model is that it has inheriting bias toward the quantity of the dataset as they favor data in the normal distributed range (proximity to mean). Due to this flow the predicting model could generate a weaker response against the value that lies in extreme zones (months consist of frequent outliers or extreme values) hence creating annual recurrent bias in the model simulation. Therefore, for the removal of these persistent errors, each month of the flow series is needed to be treated individually. However, almost all previously conducted studies ignore the post-simulation treatment of flow series biases.

This research intends to use ANN and SVR models for predicting the 10-daily streamflow of the Swat basin at the Khawazakhela bridge. It will exploit the machine learning schemes by setting out comprehensive procedures for deriving optimal inferences. The 10-daily flow series was used as it is a moderate temporal resolution commonly used by industry practitioners in applied hydrology. Necessary climate data of 5 years (2008–2012) acquired from Pakistan Meteorological Department (PMD) and the irrigation department KPK were used in this examination. This paper is organized in the following order. The study area is explained in section 2. The SVR and ANN model architecture, statistical evaluators, and bias correction method are discussed in section 3. The Pre-modeling analyses including time lag adjustment, setting up model, and sensitivity analysis are represented in section 4. Outcomes of the study are explained in section 5. Finally, conclusions are drawn in section 6.

The study basin is situated in the northern part of Pakistan from 35.87° N to 34.94° S latitude and 72.165° W to 72.873° E longitude (Figure 1). The watershed has a drainage area of 3,584 km2. The region includes high altitude, rugged terrain, and a snowy basin. The mean elevation above sea level of the Swat basin is recorded as 3,521 meters. The climate of the Swat basin is colder in winter to pleasant in summer. This region receives precipitation from both systems, the southern summer monsoon system and the winterly western Mediterranean system. According to 5-year records of the meteorological stations at Kalam, the Swat basin received mean annual precipitation of 1,176.1 mm. For the same period, the mean temperature of the catchment varied from 4.5 to 17.9 °C. The Flow of the Swat basin is directly affected by glacier melts and monsoonal rain. The recorded average annual flow of the same 5-year period is 80.4 cumec at Khawazakhela gauge station.

Support vector regression

SVM is among the most popular and widely used data-driven approaches for hydrological applications (Parisouj et al. 2020). Support vector regression (SVR) is a sub-type of SVM which is used to describe regression by Support vectors. The SVR estimates function from the datasets (x, y), in which x represents the input vector and y represents the output. The regression function could be mathematically represented in the following equation:
formula
(1)
where ‘‘ is a non-linear function where x is mapped into a feature space, while b and x represent the weight vector and coefficient that have to be estimated from the data, respectively (Kalteh 2013).
formula
(2)
where
formula
(3)
Equation (3) represents the e-insensitive loss function. While in Equation (2), if the constant variable C> 0 then it implies an interdependency between approximated error and weight vector ||w||. ‘e’ is also known as the tube size that is equal to the approximated accuracy from the training model of data points. Both ‘C’ and ‘e’ have to be chosen in advance by the user (Kalteh 2013).
Figure 1

Location map of the Swat River basin.

Figure 1

Location map of the Swat River basin.

Close modal

Artificial neural network

An artificial neural network (ANN) is a non-linear black box statistical approach. It is a computational model that is inspired by the nervous system of humans, and it is composed of a high number of simple but densely interconnected processing elements or artificial neurons (Chen et al. 2013). An artificial neuron or more commonly known as a perceptron in ANN performs two major functions. First, it applies a summation function to calculate the total weighted sum of all the inputs to the neuron and also adds a bias value (constant) to it. Secondly, it applies an activation or transformation function on the net input to produce an output, as shown in Figure 2.
Figure 2

Representation of the statistical model of an artificial neuron.

Figure 2

Representation of the statistical model of an artificial neuron.

Close modal
These artificial neurons are interconnected with each other in a non-linear structure. The structure is composed of mainly three layers: an input layer, one or more hidden layers, and an output layer, as shown in Figure 3. The number of neurons in the input and output layer is equal to the number of inputs and outputs, respectively.
Figure 3

ANN architecture for modeling climate-streamflow relationship with one hidden layer.

Figure 3

ANN architecture for modeling climate-streamflow relationship with one hidden layer.

Close modal
In Figure 3, the links between each neuron of the input and hidden layer have weights and those weights are labeled as ‘‘. Each neuron in the hidden layer calculates the weighted sum of the neurons of the input layer as by the following equation (Ballı & Tarımer 2013).
formula
(4)
The activation function determines the activation level of the neuron by scaling the net input. There are many kinds of activation functions, but in this ANN architecture, a rectified linear unit (ReLU) function is used. The ReLU function is widely used because the tanh and sigmoid functions are more expensive than the ReLU function (Maas et al. n.d.; Xu et al. 2015). The ReLU function is mathematically represented in the following equation .
formula
(5)

Model's performance evaluation

The mean absolute error (MAE) value is calculated by taking the average of absolute error (difference between actual and predicted values). The absolute error is a function that is used to return a positive number. So, the difference between actual and predicted values will always be returned as positive (Adnan et al. 2017):
formula
(6)
where, ‘n’ is the total number of values, ‘ is the predicted value while ‘ is represented as the actual value and the error is ‘ (difference between the actual value and predicted value). The range of MAE value is .
The root mean square error (RMSE) value is the root square of the average of the square of the error (between the actual and predicted values) and is mathematically represented as follows (Adnan et al. 2017):
formula
(7)
where ‘n’ is the total number of values, ‘ is the predicted value while ‘ is represented as the actual value and the error is ‘ (difference between the actual value and predicted values). The range of RMSE value is .
Pearson correlation coefficients are used to measure the relation's strength between two variables. It is a popular type of CC and is defined as a linear relationship between actual and predicted datasets. And it is mathematically represented as follows (Hao et al. 2022):
formula
(8)

where ‘x’ is the actual value, while ‘y’ is the predicted value, while ‘’ and ‘’ are the mean of x and y, respectively. Its values range between −1 and 1, with 1 being the ideal value.

Bias correction

The model inherently contains model bias which could be detrimental to the quality of the simulation. Nowadays, various approaches are being used to treat such simulation biases, varying from structurally complex to simpler ones (linear scaling, Quantile delta mapping, etc.) (Syed et al. 2022; Usman et al. 2022). These approaches typically treat climate variables derived from numerical simulations. Nevertheless, treatment can also be applied to reconcile flow simulations as they also entail analogous procedures. For this study, a simpler linear bias scaling was chosen as it is relatively easier to apply and report to produce good results (Shrestha et al. 2017). Equations are provided below in the context of flow training and testing simulation models (Syed et al. 2022).
formula
(9)
formula
(10)
where V is the flow variable, is the unit time which is 10 daily, M is the long-term mean, * indicates linear bias-corrected. However, subscript refers to trained flow model simulation, stands for the actual flow of the training phase, and is the tested flow model simulation.
A process was followed for the refinement of data as illustrated in the graphical abstract.
Figure 4

Time lag adjustment analysis (left) and time-lagged adjusted series of highly correlated variables (right).

Figure 4

Time lag adjustment analysis (left) and time-lagged adjusted series of highly correlated variables (right).

Close modal

Time lag adjustment:

Data-driven modeling produces a better result when input variables possess a strong associative relationship with the target set. However, in hydrological systems, phenomena such as flow routine, infiltration and, obstructions delay the flow response and obscure the associative relationship. For this time lag adjustment must be carried out to seek adequate correlation. In this study, four variables were considered for analysis, namely, precipitation (P), synthesized accumulated precipitation (AP), minimum temperature (Tn), and maximum temperature (Tx). Variables are allowed to lag for several days and correlation against each daily lag was then computed. Plots of each variable are given in Figure 4.
Figure 5

(a1,b1) represents actual and simulated flow and (a2,b2) showing scatterplot of actual and simulated values SVR and SVR-BC and ANN and ANN-BC models.

Figure 5

(a1,b1) represents actual and simulated flow and (a2,b2) showing scatterplot of actual and simulated values SVR and SVR-BC and ANN and ANN-BC models.

Close modal

The results suggest that precipitation variables depicted a weaker correlation with the flow. However, synthesized AP was found relatively better. A higher correlation of temperature indicates that the catchment regime is majorly governed by snow hydrology as temperature regulates the amount of water in the catchment by freezing and melting snowpacks. Nevertheless, the determined suitable lag times are as follows; For AP lag equals 16 days, P lag equals 16 days, Tx lag equals 17 days, and Tn lag equals 17 days.

Setting up model

Before developing a machine-learning model, input data are required to be organized in the machine-learning-compatible format. The procedure includes normalizing or scaling data and defining training and testing proportions. For this, time-lagged adjusted variables from the previous analysis were selected as input variables and are then further processed. Oftentimes, the data consist of discrepant magnitudes that could be erroneously interpreted by the modeling scheme. To compensate for this constraint, the data need to be scaled, this process is called normalization. For this study, the Min–Max normalization approach was adopted. It allows us to scale the data between 0 and 1. For reserving the appropriate size for training, data were split into 80:20 training and testing ratio.

Sensitivity analysis

Sensitivity analysis was performed to recognize the appropriate number of input combinations and model hyperparameter configurations. For this, firstly seven input combinations were developed considering previously selected suitable time-lagged variables. These combinations are kept same for both ANN and SVR models and were tested against respective model configurations of hyperparameter. In the case of SVR, a grid search cross-validation function was applied to appraise the sensitivity of diverse sets of model parameters. Grid search is an exhaustive method in which the training set is split into user-defined series numbers and independently validated against the user-defined range of model parameters. First, the identification of the most sensitive parameters was made individually. Three hyper parameters namely (regularization parameter (C); {default = 1}, Kernel; {linear, poly, rbf, sigmoid}, epsilon; {default = 0.1}) were found highly influential. Furthermore, selected parameters were given an interval range and incorporated into the grid search function with a cross-validation split equal to three. Models were scored using Pearson correlation. This process was repeated corresponding to each input combination (Pedregosa et al. 2011). While for ANN, the trial-and-error technique was used to select the optimum number of neurons and hidden layers for the given input combinations. For simplicity, two hidden layers were selected and the number of neurons in the hidden layers was increased consecutively till the performance reaches the maximum level. The epoch for the model was selected on the bases of the loss function when its gradient becomes nearly equal to 0. The highly correlated input configurations for each input combination were evaluated and ranked accordingly.

The results of the sensitivity analysis are given in Table 1. From all the iterations performed, only the highest-scoring configurations are mentioned in Table 1 corresponding to each input combination. The analysis suggests that the third number input combination and configuration setting performed best in the case of ANN. While the 14th number would well suit SVR modeling.

Table 1

Final model hyperparameter values with highest Pearson correlation after trial-and-error for each input combination

S no.Input combinationsConfigurationsScores (CC)Ranking
ANN 
(Input layer; neurons), (hidden layers; neurons), (output layer; neuron) 
P16 (1), (10,10), (1) 0.59 
P16, AP22 (2), (10,20), (1) 0.68 
3 P16, AP22, Tx17, Tn17 (4), (20,20), (1) 0.99 1 
AP22, Tx17, Tn17 (3), (20,15), (1) 0.95 
Tx17, Tn17 (2), (10,20), (1) 0.92 
AP22, Tx17 (2), (10,20), (1) 0.80 
AP22, Tn17 (2), (10,20), (1) 0.74 
SVR 
(Number of splits), (svr_c), (svr_epsilon), (svr_kernel) 
P16 (3), (150), (0.3), (rbf) 0.44 
P16, AP22 (3), (30), (0.3), (rbf) 0.50 
10 P16, AP22, Tx17, Tn17 (3), (20), (0.1), (poly) 0.78 
11 AP22, Tx17, Tn17 (3), (150), (0.3), (rbf) 0.88 
12 Tx17, Tn17 (3), (150), (0.3), (rbf) 0.86 
13 AP22, Tx17 (3), (120), (0.3), (rbf) 0.84 
14 AP22, Tn17 (3), (150), (0.2), (rbf) 0.90 1 
S no.Input combinationsConfigurationsScores (CC)Ranking
ANN 
(Input layer; neurons), (hidden layers; neurons), (output layer; neuron) 
P16 (1), (10,10), (1) 0.59 
P16, AP22 (2), (10,20), (1) 0.68 
3 P16, AP22, Tx17, Tn17 (4), (20,20), (1) 0.99 1 
AP22, Tx17, Tn17 (3), (20,15), (1) 0.95 
Tx17, Tn17 (2), (10,20), (1) 0.92 
AP22, Tx17 (2), (10,20), (1) 0.80 
AP22, Tn17 (2), (10,20), (1) 0.74 
SVR 
(Number of splits), (svr_c), (svr_epsilon), (svr_kernel) 
P16 (3), (150), (0.3), (rbf) 0.44 
P16, AP22 (3), (30), (0.3), (rbf) 0.50 
10 P16, AP22, Tx17, Tn17 (3), (20), (0.1), (poly) 0.78 
11 AP22, Tx17, Tn17 (3), (150), (0.3), (rbf) 0.88 
12 Tx17, Tn17 (3), (150), (0.3), (rbf) 0.86 
13 AP22, Tx17 (3), (120), (0.3), (rbf) 0.84 
14 AP22, Tn17 (3), (150), (0.2), (rbf) 0.90 1 

Best among input combinations is highlighted in bold.

The determined two combinations along with their optimized configuration were selected to train the model. Training and testing datasets were independently tested using MAE, RMSE, and CC statistical measures. Details of statistical parameters are provided in Section 3.3.

The results of the model are presented in Table 2. Both models successfully captured the synoptic flow trends; however, the ANN outperformed SVR and produced quality simulations in both the training and testing phases. For both models, Tn and AP are determined to be more critical predictands, this emphasises that physical AP is a more effective input than precipitation itself as it also ingests the routing response of the basin. On the other hand, Tn was found to be a better regulator than Tx, specifically regulating snow melting. Flow series analysis depicted in Figure 5 highlighted discrepancies in SVR simulation that seems to produce delayed or underestimated response. This might be indicative of temporal model bias and weak processing of the training set, which could be either due to insufficient training size of high values for SVR to process or a complicated relationship between input and target set. This is also confirmed by the dispersion of high values in a scatter plot. Nevertheless, as a whole, SVR entails a limited set of input variables and yields reasonable results. On the other hand, ANN harnesses a maximum number of inputs and produced outstanding results. It was successful in imitating observed flow conditions and showed a higher level of collinearity as depicted in Figure 5.
Table 2

Statistical evaluation of raw and bias-corrected ANN and SVR model simulations at training and testing epochs

No.Input combinationModelTraining
Testing
MAERMSECCMAERMSECC
Tn17, AP22 SVR 4.6 32.0 0.90 1.4 36.4 0.93 
P16, AP22, Tx17, Tn17 ANN 0.2 10.8 0.99 2.08 13.0 0.99 
Bias-corrected schemes 
Tn17, AP22 SVR-BC 0.0 33.3 0.92 −1.16 36.7 0.91 
P16, AP22, Tx17, Tn17 ANN-BC 0.0 4.27 0.99 1.25 3.69 0.99 
No.Input combinationModelTraining
Testing
MAERMSECCMAERMSECC
Tn17, AP22 SVR 4.6 32.0 0.90 1.4 36.4 0.93 
P16, AP22, Tx17, Tn17 ANN 0.2 10.8 0.99 2.08 13.0 0.99 
Bias-corrected schemes 
Tn17, AP22 SVR-BC 0.0 33.3 0.92 −1.16 36.7 0.91 
P16, AP22, Tx17, Tn17 ANN-BC 0.0 4.27 0.99 1.25 3.69 0.99 
Figure 6

Mean annual seasonal of the simulation model.

Figure 6

Mean annual seasonal of the simulation model.

Close modal

Furthermore, general model simulations inherently contained model biases which could adversely affect the model simulation. To remedy this, we employed a simple linear bias scaling technique the details of which are provided in section 3.4. The findings show that bias scaling considerably improved ANN model simulations both for training and testing sets. In contrast, SVR couldn't exhibit much improvement except slightly treating underestimations. Overall, the ANN simulation has lower variability of 71.15 compared to the SVR, SVR-BC and ANN-BC simulations, which are 86.25, 79.65 and 85.78, respectively.

Seasonal analysis

The performance of simulations was also tested on individual seasons. An analysis was carried out on four seasons, namely, DJF; (December, January, and February), MAM; (March, April, and May), JJA; (June, July, and August) and SON; (September, October, and November). The mean annual of each season was determined and plotted in Figure 6. During low flow conditions, the DJF model displayed an insensitive response corresponding to actual flow. As can be seen, all simulations underestimated the first year and overestimated he rest of the years. Poor results can be justified either due to the reason that in low flow seasons snow melting and accumulation occur simultaneously; therefore, the model struggles to adjudicate the flow conversion rate accurately or due to the reason that low flow magnitudes lie close to the extreme range of normal distribution. The latter can be further explained as the majority of training data lies in close proximity to the mean hence model finds easy to assimilate these quantities, while in the case of DJF, most flow magnitude lies far away from the mean of the training set. Nevertheless, both reasons highlight the constraints in assimilation and predictive biases by data-driven models. In DJF, SVR and SVR-BC performed relatively better in depicting the actual flow dynamics. From ANN-based simulation, ANN-BC averaged out the extremes and produced a stagnant but relatively better response. In the case of MAM and JJA, ANN and ANN-BC provide a precise imitation of flow magnitude and trends. While SVR exhibited a different degree of deviation. In SON, actual flow dynamics remained stable and, thus, all the models provided decent performance. In summary, all the simulations were found deficient under low flow conditions.
Figure 7

Flow duration curve (FDC) analysis of (a) SVR, (b) ANN, (c) SVR-BC, (d) ANN-BC.

Figure 7

Flow duration curve (FDC) analysis of (a) SVR, (b) ANN, (c) SVR-BC, (d) ANN-BC.

Close modal

Flow duration curve analysis

Flow duration curves (FDCs) possess high importance in the water industry especially in defining project time-based utilities (Castellarin et al. 2007). In this regard, overlaying accuracy of flow simulation becomes greater importance. Practically, FDCs inform discrepancies between flow at different percentile. For this analysis, percentiles were divided into low flow; 91–100%, Intermediate flow; 11–90% and high flow conditions; 0–10%. Actual flow dynamics suggest that up to 60% of the time (low and intermediate-low) flow condition remains almost unchanged. For a sample time period, simulations have shown slght exaggerations except bias-corrected models. Nevertheless, corresponding to the rest of the intermediate observation, simulations displayed little variations. At high flow conditions, ANN-based simulation overlay the observation adequately, while SVR exhibited underestimated quantities. However, improvement in high-flow prediction can be noticed after bias correction as shown in Figure 7.

In this study, we appraise the utility of popular machine learning algorithms SVR and ANN to predict 10-daily flow simulations at the Swat River, Pakistan using climate dataset, precipitation (P), and minimum and maximum temperature (Tn, Tx). The study follows comprehensive procedures for flow simulation enhancement such as time lag adjustment, sensitivity analysis, and bias correction. AP was calculated using raw precipitation, to give, in all four inputs that were used for flow prediction. Optimum time lag adjustment was carried out to account for the delayed response and for improving the associative relationship between flow and input climate variables. According to the analysis, P16, AP22, Tx17, and Tn17 were identified as highly correlated variables. Furthermore to get the most pragmatic configuration and combinations, seven input combinations were developed and tested for seeking the best hyperparameters for individual modeling schemes. In the case of SVR, a cross-validation grid search method is used, in which the performance of different combinations of hyperparameter is tested corresponding to each user-defined data partition. For ANN, a trial-and-error procedure is used to determine the suitable neuron and hidden layer combinations.

For training and testing, 80 and 20% data were kept reserved for modeling purposes, respectively. The results of ANN were able to produce a highly colinear response to observation in both training and testing epochs than that of SVR. To deal with simulation biases, linear bias scaling was applied to derived model simulations. The method was able to significantly improve model representation and minimized ANN's MAE and RMSE values from 0.2 and 10.8 to 0.0 and 4.27, respectively. While SVR linear bias correction technique yielded an enhancement in its CC value from 0.90 to 0.92 and reduced the MAE from 4.6 to 0.0. Although significant improvements were observed in overall flow visual improvements in peak flow were witnessed in the case of SVR. Seasonal Analysis was carried out to better demonstrate the flow conditions in four seasons (DJF, MAM, JJA, and SON). Model simulations were found deficient under low flow conditions except for bias-corrected simulations that were able to reduce some discrepancy. FDCs were developed to analyze water quantity at a certain exceedance probability of time. Flows were divided into three conditions, namely, low flow (91–100%), Intermediate flow (11–90%), and high flow (0–10%). The analysis reveals slight underestimation by ANN at low to intermediate-low flow conditions; however, this was greatly resolved after applying bias correction. The SVR exhibited underestimation at high flow conditions, but improvements could be seen in high flow prediction after bias correction.

The results show that machine-learning modeling could be a valuable tool in applied hydrology. However, their accuracies are subject to numerous factors such as data quality, quantity, robust combination and configuration of hyper-parameter and the type of scheme itself (SVR, ANN, etc.). This study also emphasized on the use of simple bias-correcting techniques for improving model simulations regardless of the modeling framework.

Although the methodology adopted in this study is appropriate and has been justified by peer-reviewed literature, it is important to acknowledge that there are still limitations that need to be addressed. It should be noted that while ANNs offer ease of use, they require an excessive amount of data to train properly, whereas physical-based models do not have such a limitation. Therefore, in situations where data are limited, physical-based models may be a more suitable choice. Additionally, the non-stationarity of hydro-climatology has become more evident in the wake of climate change, which presents another challenge. ANNs are weak in predicting values that are beyond the scope of their training samples, so it is crucial to train the model on longer records and utilize variables that correspond to extreme flow values. Furthermore, as stated earlier in the conclusion, streamflow reflects the effects of climate variables, which can impact predicted values, thereby affecting the validity of forecast models.’

Z.S., Sibtain.S. (S.S.), S.H., and P.M. conceptualized the whole article; S.S., Z.S., and F.K. rendered support in data curation; S.S. and P.M. rendered support in data curation; S.S. and Z.S. investigated the article; S.S. and Z.S. devised the methodology; Saqlain S., S.M.T., and S. S., brought the resources; S.S., S.H. and Z.S. wrote the original draft.

All relevant data are available from an online repository at https://data.mendeley.com/datasets/d4bvcf5n5k.

The authors declare there is no conflict.

Adnan
R. M.
,
Yuan
X.
,
Kisi
O.
&
Yuan
Y.
2017
Streamflow forecasting using artificial neural network and support vector machine models
.
American Academic Scientific Research Journal for Engineering, Technology, and Sciences
29
(
1
),
Article 1
.
Adnan
R. M.
,
Liang
Z.
,
Trajkovic
S.
,
Zounemat-Kermani
M.
,
Li
B.
&
Kisi
O.
2019
Daily streamflow prediction using optimally pruned extreme learning machine
.
Journal of Hydrology
577
,
123981
.
https://doi.org/10.1016/j.jhydrol.2019.123981
.
Ali
S.
&
Shahbaz
M.
2020
Streamflow forecasting by modeling the rainfall–streamflow relationship using artificial neural networks
.
Modeling Earth Systems and Environment
6
(
3
),
1645
1656
.
https://doi.org/10.1007/s40808-020-00780-3
.
Ballı
S.
&
Tarımer
İ.
2013
An application of artificial neural networks for prediction and comparison with statistical methods
.
Electronics and Electrical Engineering
19
(
2
),
101
105
.
https://doi.org/10.5755/j01.eee.19.2.3478
.
Castellarin
A.
,
Camorani
G.
&
Brath
A.
2007
Predicting annual and long-term flow-duration curves in ungauged basins
.
Advances in Water Resources
30
(
4
),
937
953
.
https://doi.org/10.1016/j.advwatres.2006.08.006
.
Chen
S. M.
,
Wang
Y. M.
&
Tsou
I.
2013
Using artificial neural network approach for modelling rainfall–runoff due to typhoon
.
Journal of Earth System Science
122
(
2
),
399
405
.
https://doi.org/10.1007/s12040-013-0289-8
.
Çimen
M.
&
Kisi
O.
2009
Comparison of two different data-driven techniques in modeling lake level fluctuations in Turkey
.
Journal of Hydrology
378
(
3–4
),
253
262
.
https://doi.org/10.1016/j.jhydrol.2009.09.029
.
Demirel
M. C.
,
Venancio
A.
&
Kahya
E.
2009
Flow forecast by SWAT model and ANN in Pracana basin, Portugal
.
Advances in Engineering Software
40
(
7
),
467
473
.
https://doi.org/10.1016/j.advengsoft.2008.08.002
.
Essam
Y.
,
Huang
Y. F.
,
Ng
J. L.
,
Birima
A. H.
,
Ahmed
A. N.
&
El-Shafie
A.
2022
Predicting streamflow in Peninsular Malaysia using support vector machine and deep learning algorithms
.
Scientific Reports
12
(
1
),
3883
.
https://doi.org/10.1038/s41598-022-07693-4
.
Guo
J.
,
Zhou
J.
,
Qin
H.
,
Zou
Q.
&
Li
Q.
2011
Monthly streamflow forecasting based on improved support vector machine model
.
Expert Systems with Applications
38
(
10
),
13073
13081
.
https://doi.org/10.1016/j.eswa.2011.04.114
.
Hadi
S. J.
&
Tombul
M.
2018
Forecasting daily streamflow for basins with different physical characteristics through data-driven methods
.
Water Resources Management
32
(
10
),
3405
3422
.
https://doi.org/10.1007/s11269-018-1998-1
.
Hassan
Z.
,
Shamsudin
S.
&
Harun
S.
2014
Minimum input variances for modelling rainfall-runoff using ANN
.
Jurnal Teknologi
69
(
3
).
https://doi.org/10.11113/jt.v69.3154.
Hsu
K.
,
Gupta
H. V.
&
Sorooshian
S.
1995
Artificial neural network modeling of the rainfall-runoff process
.
Water Resources Research
31
(
10
),
2517
2530
.
https://doi.org/10.1029/95WR01955
.
Karunanithi
N.
,
Grenney
W. J.
,
Whitley
D.
&
Bovee
K.
1994
Neural networks for river flow prediction
.
Journal of Computing in Civil Engineering
8
(
2
),
201
220
.
https://doi.org/10.1061/(ASCE)0887-3801(1994)8:2(201)
.
Kerh
T.
&
Lee
C. S.
2006
Neural networks forecasting of flood discharge at an unmeasured station using river upstream information
.
Advances in Engineering Software
37
(
8
),
533
543
.
https://doi.org/10.1016/j.advengsoft.2005.11.002
.
Khan
M. S.
&
Coulibaly
P.
2006
Application of support vector machine in lake water level prediction
.
Journal of Hydrologic Engineering
11
(
3
),
199
205
.
https://doi.org/10.1061/(ASCE)1084-0699(2006)11:3(199)
.
Maas
A. L.
,
Hannun
A. Y.
&
Ng
A. Y.
n.d.
Rectifier Nonlinearities Improve Neural Network Acoustic Models. 6.
Panagopoulos
A.
2021
Water-energy nexus: desalination technologies and renewable energy sources
.
Environmental Science and Pollution Research
28
(
17
),
21009
21022
.
https://doi.org/10.1007/s11356-021-13332-8
.
Panagopoulos
A.
2022
Brine management (saline water & wastewater effluents): sustainable utilization and resource recovery strategy through Minimal and Zero Liquid Discharge (MLD & ZLD) desalination systems
.
Chemical Engineering and Processing – Process Intensification
176
,
108944
.
https://doi.org/10.1016/j.cep.2022.108944
.
Panagopoulos
A.
&
Giannika
V.
2022
Decarbonized and circular brine management/valorization for water & valuable resource recovery via minimal/zero liquid discharge (MLD/ZLD) strategies
.
Journal of Environmental Management
324
,
116239
.
https://doi.org/10.1016/j.jenvman.2022.116239
.
Parisouj
P.
,
Mohebzadeh
H.
&
Lee
T.
2020
Employing machine learning algorithms for streamflow prediction: a case study of Four River basins with different climatic zones in the United States
.
Water Resources Management
34
(
13
),
4113
4131
.
https://doi.org/10.1007/s11269-020-02659-5
.
Pedregosa
F.
,
Varoquaux
G.
,
Gramfort
A.
,
Michel
V.
,
Thirion
B.
,
Grisel
O.
,
Blondel
M.
,
Prettenhofer
P.
,
Weiss
R.
,
Dubourg
V.
,
Vanderplas
J.
,
Passos
A.
,
Cournapeau
D.
,
Brucher
M.
,
Perrot
M.
&
Duchesnay
É.
2011
Scikit-learn: machine learning in python
.
Journal of Machine Learning Research
12
(
85
),
2825
2830
.
Sabir
M. A.
,
Rehman
S. S.
,
Umar
M.
,
Waseem
A.
,
Farooq
M.
,
Fariduulah
&
Irshad
M.
2014
Assessment of hydro power potential of Swat, Kohistan Himalayas: a solution for energy shortfall in region
.
Water Resources
41
(
5
),
612
618
.
https://doi.org/10.1134/S0097807814050091
.
Shah
S. A.
,
Shah
N. A.
,
Ullah
S.
,
Alam
M. M.
,
Badshah
H.
,
Ullah
S.
&
Mumtaz
A. S.
2016
Documenting the indigenous knowledge on medicinal flora from communities residing near Swat River (Suvastu) and in high mountainous areas in swat-Pakistan
.
Journal of Ethnopharmacology
182
,
67
79
.
https://doi.org/10.1016/j.jep.2016.02.008
.
Shrestha
M.
,
Acharya
S. C.
&
Shrestha
P. K.
2017
Bias correction of climate models for hydrological modelling – Are simple methods still useful?: are simple bias correction methods still useful?
Meteorological Applications
24
(
3
),
531
539
.
https://doi.org/10.1002/met.1655
.
Syed
Z.
,
Ahmad
S.
,
Dahri
Z. H.
,
Azmat
M.
,
Shoaib
M.
,
Inam
A.
,
Qamar
M. U.
,
Hussain
S. Z.
&
Ahmad
S.
2022
Hydroclimatology of the Chitral River in the Indus Basin under changing climate
.
Atmosphere
13
(
2
),
295
.
https://doi.org/10.3390/atmos13020295
.
Tongal
H.
&
Booij
M. J.
2018
Simulation and forecasting of streamflows using machine learning models coupled with base flow separation
.
Journal of Hydrology
564
,
266
282
.
https://doi.org/10.1016/j.jhydrol.2018.07.004
.
Usman
M.
,
Manzanas
R.
,
Ndehedehe
C. E.
,
Ahmad
B.
,
Adeyeri
O. E.
&
Dudzai
C.
2022
On the benefits of bias correction techniques for streamflow simulation in complex terrain catchments: a case-study for the Chitral River Basin in Pakistan
.
Hydrology
9
(
11
),
188
.
https://doi.org/10.3390/hydrology9110188
.
Wang
W.-C.
,
Chau
K.-W.
,
Cheng
C.-T.
&
Qiu
L.
2009
A comparison of performance of several artificial intelligence methods for forecasting monthly discharge time series
.
Journal of Hydrology
374
(
3–4
),
294
306
.
https://doi.org/10.1016/j.jhydrol.2009.06.019
.
Xu
B.
,
Wang
N.
,
Chen
T.
&
Li
M.
2015
Empirical evaluation of rectified activations in convolutional network
.
https://doi.org/10.48550/ARXIV.1505.00853
Zealand
C. M.
,
Burn
D. H.
&
Simonovic
S. P.
1999
Short term streamflow forecasting using artificial neural networks
.
Journal of Hydrology
214
(
1–4
),
32
48
.
https://doi.org/10.1016/S0022-1694(98)00242-X
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).