The current research work was carried out to simulate monthly streamflow historical record using Soil and Water Assessment Tool (SWAT) and Artificial Neural Network (ANN) at the Astore Basin, Gilgit-Baltistan, Pakistan. The performance of SWAT and ANN models was assessed during calibration (1985–2005) and validation (2006–2010) periods via statistical indicators such as coefficient of determination (R2), Nash–Sutcliffe efficiency (NSE), percent bias (PBIAS), and root-mean-square error (RMSE). R2, NSE, PBIAS, and RMSE values for SWAT (ANN with Architecture (2,27,1)) models during calibration are 0.80 (0.88), 0.73 (0.82), 15.7 (0.008), and 79.81 (70.34), respectively, while during validation, the corresponding values are 0.71 (0.86), 0.66 (0.95), 17.3 (0.10), and 106.26 (75.92). The results implied that the ANN model is superior to the SWAT model based on the statistical performance indicators. The SWAT results demonstrated an underestimation of the high flow and overestimation of the low flow. Comparatively, the ANN model performed very well in estimating the general and extreme flow conditions. The findings of this research highlighted its potential as a valuable tool for accurate streamflow forecasting and decision-making. The current study recommends that additional machine learning models may be compared with the SWAT model output to improve monthly streamflow predictions in the Astore Basin.

  • SWAT and ANN model performance was assessed via statistical performance indicators.

  • Compromise programming was used to rank ANN architectures.

  • ANN architecture (2,27,1) outperformed SWAT model.

  • SWAT model exhibits limitations in accurately estimating high and low flows.

  • ANN model excels in predicting both general and extreme flow scenarios.

The main purpose of the hydrological cycle is to convert rainfall into runoff (Jimeno-Sáez et al. 2018). This is a complicated process that involves a non-linear relationship between rainfall, runoff, and other hydrological processes as well as watershed features. Hydrometeorological processes (interception, evapotranspiration (ET), percolation, and infiltration) impacts surface runoff (Haruna & Abba 2021). These hydrological processes are also influenced by watershed attributes and weather patterns. Moreover, the spatiotemporal land-use dynamics and climate change produce a significant level of uncertainty (Niedda et al. 2014). Rainfall–runoff modeling is used for water resource planning and extreme events assessment (Jehanzaib et al. 2022). The hydraulic structures design and extreme events forecasting mainly depends upon the rainfall–runoff relationship (Rajurkar et al. 2004).

Hydrological models are developed using the fundamental principle of water cycle (Chen et al. 2012). The structure of the aforementioned models varies from simple lumped models to complicated process-based distributed models. There are numerous hydrological models namely SWAT, Systeme Hydrologique European (SHE) model, Institute of Hydrology Distributed Model (IHDM), Hydrological Simulation Program-FORTRAN (HSPF), and Variable Infiltration Capacity (VIC) which are widely used for hydrological simulations (Jehanzaib et al. 2022). Hydrological model selection is purely based on model simulation capabilities and data availability of the project area (Henriksen et al. 2003). SWAT is one of the most sophisticated semi-distributed hydrological model which assess streamflow at watershed scale (Grusson et al. 2017). SWAT requires a large amount of spatiotemporal hydrometeorological and terrestrial data to evaluate streamflow (Pang et al. 2020). The model parameterization and calibration process are complicated as it is based on large number of parameters and their complex interconnections (Noori & Kalin 2016). SWAT-CUP (SWAT Calibration and Uncertainty Procedures) is a program which excessively used to perform SWAT model calibration, validation, uncertainty analysis and sensitivity analysis (Abbaspour et al. 2015).

It is well-founded knowledge that the SWAT model is excessively used for streamflow assessment (Cibin et al. 2010; Grusson et al. 2017). Haleem et al. (2022) used SWAT to isolate the impacts of climate change and anthropogenic activities on streamflow in the Upper Indus Basin (UIB). Wisal et al. (2020) assessed the suitability of gridded precipitation data for streamflow forecasting using the SWAT model. Li et al. (2010) used the SWAT model to simulate streamflow and sediment load. Setegn et al. (2010) used the SWAT model to examine the impact of regional geography, land use, soil, and climate on hydrology. Tibebe & Bewket (2011) computed monthly streamflow and erosion rate in the Keleta region of Ethiopia using the SWAT model. In mountainous basins, SWAT is frequently employed to investigate how snow affects the various components of water cycle (Shahid et al. 2021).

The literature shows that data-driven models are widely used to simulate hydrological processes at global scale (Rajurkar et al. 2004; Wang et al. 2006). The aforementioned models are considered superior owing to their minimal input meteorological variables requirement, less computation time, and ability to deliver highly precise and accurate results. Artificial intelligence (AI) approaches like the Artificial Neural Network (ANN) have emerged as a viable alternative to traditional hydrological models. ANN's capacity to model non-linear interactions between the input and output parameters without the need for understanding the underlying physical processes is perhaps their strongest feature, making them more suitable than other methods (Noori & Kalin 2016). Systems-related data can be analyzed using empirical approaches like AI, soft computing (SC), data mining (DM), computational intelligence (CI), and machine learning without explicitly accounting for the physical behavior of the objective system (Solomatine et al. 2008).

Machine learning algorithms are excessively used in hydrological modeling (Solomatine et al. 2008; Khan et al. 2023). ANN-based models have been used for rainfall–runoff and sediment simulations (Tokar & Johnson 1999). ANN models have been used to predict floods and reservoir inflows (Lohani et al. 2012). Huang et al. (2014) used support vector machine (SVM) to simulate the river's monthly flow at Huaxi station, and the findings demonstrated that the model has a high level of accuracy in predicting the river's monthly flow. SVM models and ANN were used by Ghorbani et al. (2016) to estimate the daily flow of the Cypress River in Texas. Their findings suggested that the SVM model performs better than the ANN in terms of river flow forecasting and precision.

In the last decade, there have been few studies comparing SWAT with data-driven models. Pradhan et al. (2020) compared the accuracy of three ANN models to the SWAT model in predicting streamflow and found that ANN models results are more precise. Koycegiz & Buyukyildiz (2019) compared SWAT, support vector machines (SVM), and ANN in the headwaters of the Carsamba River, which are located in the Konya Closed Basin of Turkey where SWAT was outscored in streamflow simulation by the data-driven models (ANN and SVM). Jajarmizadeh et al. (2015) used SWAT and SVM models to assess monthly stream flow in southern Iran. The SVM results were superior to SWAT model. Daily flow in Portugal's Pracena basin was predicted using ANN and SWAT models by Demirel et al. (2009). According to the findings, the ANN model was more accurate in forecasting peak flow than the SWAT model.

In the current research work, monthly streamflow data is simulated using the ANN and SWAT models for the Astore Basin located at Indus River. Intercomparison of empirical and hydrological model has not been reported in the region. The gap was filled by the current study to assess the performance of ANN and SWAT model in simulating the streamflow. As Pakistan is a data-scarce country, this study will assist the water experts and policy-makers in gaining reliable alternative to data-intensive hydrological models in the stated region.

Study area

The UIB is located in the extreme North of Pakistan which lies between 33.67° and 37.20° N, and 70.50° and 77.50° E coordinates, which is mainly covered by snow and glaciers. The elevation of UIB ranges between 8,500 and 200 m above mean sea level. UIB's border is shared by China, India, and Afghanistan. Additionally, Pakistan and Afghanistan both share the Indus River Basin, where Pakistan cover the larger portion 32.48–37.07°N and 67.33–81.83°E (Haleem et al. 2022). UIB contributes to over half of Pakistan's surface water resources and is crucial to the country's long-term economic growth.

This study focuses on the upper region of the Indus River in Pakistan, specifically Astore (Gilgit-Baltistan), as depicted in Figure 1. The elevation of the stated basin ranges from 547 to 8,060 m with 45,400 km2 area. The Astore Basin, which accounts for 3% of all UIB glacial covering and 14% of all glacier and snow coverage, is located in the western Himalaya peaks (Hasson et al. 2014). There is only one climate station (Astore) that monitors the climate and precipitation in the Astore Basin, where the outflow of the basin is monitored at Doyian.
Figure 1

Study area map of the current research work.

Figure 1

Study area map of the current research work.

Close modal

Data collection

Precipitation and temperature (min and max) historical record from 1985 to 2010 were collected from Pakistan's Meteorological Department (PMD). Hydrological information, including monthly and mean flow, was collected from the Water and Power Development Authority (WAPDA). The 26-year span (1985–2010) was split into two phases: the calibration phase (1985–2005) and the validation phase (2006–2010). All monthly streamflow data were verified to be complete and free of any significant gaps. PMD and WAPDA follow the International Meteorological Organization's Guide on Hydrological Practices for monitoring hydrometeorological parameters (Rahman et al. 2022).

Digital Elevation Model (DEM), soil and land-use data were downloaded from the National Aeronautics and Space Administration (NASA) website (https://www.earthdata.nasa.gov/), Food and Agriculture Organization (FAO) (https://www.fao.org/) and Moderate Resolution Imaging Spectroradiometer (MODIS) (https://earthexplorer.usgs.gov/), respectively. Figure 2 demonstrates the details of soil and land-use classes and their percentage compositions for the specified watershed.
Figure 2

Details on soil and land-use classes, along with their percentage compositions.

Figure 2

Details on soil and land-use classes, along with their percentage compositions.

Close modal

Methods

SWAT model

SWAT is a process-based semi-distributed watershed hydrological model that was introduced by the United States Department of Agriculture – Agriculture Research Services (USDA-ARS) in 1990. It was designed to assess the impacts of meteorological and terrestrial determinants on streamflow (Wisal et al. 2020). In order to give precise spatial details and increase the model's accuracy, SWAT divides huge basin into smaller sub-basins. The SWAT model was developed to predict and simulate that how land use and agricultural practices will affect water resources in terms of water quantity and quality at the basin scale (Rahman et al. 2022). Furthermore, scenario-based simulations using the SWAT model were mainly used to explore the hydrological response to land-use dynamics and climate change (Yin et al. 2017).

Input data for the SWAT model

SWAT model analysis is mainly based on meteorological (precipitation and air temperature) and terrestrial (DEM, land use, and soil) parameters. The basin's boundaries and topographic data were extracted from the DEM. The SWAT model simulation is based on the water balance equation which is as follows:

SW0 and SWt denote the initial and final levels of soil moisture. Pday denotes daily precipitation, RSurf denotes the surface runoff, ETday denotes daily evapotranspiration, Wweep denotes water that has infiltrated into the ground, and Rgw denotes groundwater return flow. All of the variables listed above have ‘mm’ as their units.

Calibration and validation of the SWAT model

The Doyian monitoring station at the Astore Basin was chosen for model calibration to bring in front the most dominant parameters of the streamflow using SWAT-CUP which was automatically calibrated through Sequential Uncertainty Fitting program algorithm (SUFI-2) (Abbaspour et al. 2015). The SUFI-2 algorithm employs 11 distinct objective functions, including the R2, Modified Coefficient of Determination (b R2), NSE, Modified Nash–Sutcliffe Efficiency (MNS), Ratio of the Standard Deviation of Observations to the Root-Mean-Square-Error (RSR), Ranked Sum of Squares (SSQR), Kling–Gupta Efficiency (KGE), PBIAS, Multiplicative Form of the Square Error (MSE), Summation Form of the Square Error, and the χ2. These objective functions collectively provide a rigorous framework for model assessment and parameter calibration, enabling researchers to make informed decisions and refine models to accurately represent complex real-world processes. While a variety of objective functions are available for optimizing parameter ranges, our study focused on the assessment of a single objective function, the NSE. This decision was driven by NSE's inherent ability to effectively capture temporal dynamics, a critical aspect in hydrologic calibration. Moreover, NSE holds a prominent and well-established position as one of the most widely adopted statistical measures in the field (Gassman et al. 2007). By focusing on NSE, we aimed to ensure a comprehensive and precise evaluation of temporal calibration within the context of our study. Monthly streamflow data from 1985 to 2005 were used to calibrate the SWAT-CUP model. The main purpose of calibration is to determine goodness-of-fit index (GFI) between the measured and modeled data using R2, NSE, PBIAS, and RMSE statistical performance indicators. For R2 and NSE, the most favorable value is 1, whereas it is 0 for PBIAS and RMSE. After satisfactory calibration process, the model was tested using historical record ranging from 2006 to 2010.

Artificial Neural Networks

The ANN is a mathematical technique which is based on human nervous system. ANNs are non-linear parameterized models used in categorization and empirical regression modeling. In comparison to typical linear statistical models, they are more flexible and can find more deep relations in data. Neurons are the building blocks of ANNs which are arranged in layers and linked together by a number of different connections. The basic building block of a neural network is its layer-based arrangement of neurons which serve as information processing units. Neurons in one layer are linked with neurons of another layer where the strength of the linkage is assessed by the weight of the linkage between them. The ANN is divided into feed-forward, recurrent, and self-organizing networks based on the direction of flow of information and processing. The ANN is divided into single-layer and multilayer networks according to the number of neurons layers. Literature shows that multilayer feed-forward networks is widely used in hydrology (Shiau & Hsu 2016). Wang et al. (2006) used a multilayer perceptron ANN for hydrological modeling. The current study is conducted using feed-forward network with the back propagation approach. The ANN is composed of an input layer, one or more hidden layers, and output layer (Wang et al. 2015). The schematic diagram demonstrating interconnection between various layers of ANNs is shown by Figure 3.
Figure 3

Schematic diagram demonstrating interconnection between ANN various layers.

Figure 3

Schematic diagram demonstrating interconnection between ANN various layers.

Close modal
Input data are fed via input neurons layer to the network. Hidden layer neurons receive signals from the previous layer neurons and process the data. Based on the input data, output is produced in the output layer. The following equation describes the operation of a neuron:
where bj is a bias linked to neuron j, xi is an input of the vector of inputs i = 1, 2, n), f is an activation function, and wi is the weight attached to the connection link through which the input xi gets to current neuron j from a neuron in the preceding layer. y is the output of a neuron j.

ANN modeling technique

Typically, dataset is split into segments while modeling through ANN technique. The datasets used in this study were split into two phases: the training phase (80% of the data) and the testing phase (20% of the data). So, the 26-year (1985–2010) historical record was split into training (1985–2005) and testing (2006–2010) periods. In order to train the network, we applied a backpropagation approach which compares modeled values to observed data and then computes the network error. The output error was repeatedly recycled backwards through the network until the network's parameters were adjusted to desirable values (Javan et al. 2015). The training phase comes to an end when the computed error reduces to its minimum level in the validation datasets (Gholami et al. 2015). There are numerous backpropagation approaches for network training. It is well-founded knowledge that the Levenberg–Marquardt (LM) algorithm is superior to other algorithms in terms of higher performance (smaller estimated error) and faster convergence (when calculating epoch size) (Nayebi et al. 2006). Literature shows that streamflow is widely forecasted using the LM algorithm (Yaseen et al. 2015).

Compromise Programming

Compromise Programming (CP) is a method for combining several statistical measures (Iqbal et al. 2021). CP was used in this study to evaluate and rank different ANN architectures. The evaluation was based on four fundamental performance indicators: R2, NSE, PBIAS, and RMSE, which collectively gauge the ANN's effectiveness. By using the values from these measures, the researchers calculated a specialized distance measure called (LP) metric within the framework of CP (Shiru & Chung 2021). The LP metric is expressed as

The parameter m is equal to 1, and Wn represents the actual value of a statistical performance measure, whereas indicates the perfect value of the performance measure, obtained when model simulations fully match with observed data. The Lp metric is always positive. Lower Lp values are preferred since they imply higher model performance.

Calibration and validation of the SWAT model

SWAT is a hydrological physically based semi-distributed model which was initially calibrated which will reduce uncertainties in the modeled values. The 26-year historical record which ranges from 1985 to 2010 was split into calibration (1985–2005) and validation (2006–2010) periods at the Doyian monitoring station of the Astore Basin. The NSE was used as objective function during the calibration process. After model calibration, sensitivity analysis was carried out to bring in front the parameters which mainly impacts streamflow. The reader is referred to Abbaspour et al. (2015) for a detailed explanation of model calibration procedures. Table 1 illustrates the 28 most significant parameters chosen for model calibration.

Table 1

Sensitive parameters along with their fitted values and initial ranges

Parameter nameDescriptionFitted valveRanges
CN2.mgt SCS runoff curve number 78.83 (35, 98) 
CNOP.mgt SCS runoff curve number for moisture condition 29.05 (0, 100) 
ALPHA_BF.gw Baseflow alpha factor (days) 0.47 (0, 1) 
GW_DELAY.gw Groundwater delay (days) 401.21 (0, 500) 
GWQMN.gw Threshold depth of water in the shallow aquifer required for return flow to occur (mm) 105.21 (0, 5,000) 
SHALLST.gw Initial depth of water in the shallow aquifer (mm) 4,840.87 (0, 50,000) 
DEEPST.gw Initial depth of water in the deep aquifer (mm) 3,635.45 (0, 50,000) 
RCHRG_DP.gw Deep aquifer percolation fraction 0.38 (0, 1) 
SURLAG.hru Surface runoff lag time 20.22 (0.05, 24) 
SLSUBBSN.hru Average slope length 60.96 (10, 150) 
SLSOIL.hru Slope length for lateral subsurface flow 178.21 (0, 150) 
EPCO.hru Plant uptake compensation factor 0.56 (0, 1) 
ESCO.hru Soil evaporation compensation factor 0.98 (0, 1) 
LAT_TTIME.hru Lateral flow travel time 13.69 (0, 180) 
OV_N.hru Manning's n value for overland flow 9.44 (0.01, 30) 
SOL_K(..).sol Saturated hydraulic conductivity 908.04 (0, 2,000) 
SFTMP.bsn Snowfall temperature 10.74 (−20, 20) 
SMTMP.bsn Snow melt base temperature 15.70 (−20, 20) 
SMFMX.bsn Maximum melt rate for snow during year (occurs on summer solstice) 8.52 (0, 20) 
SMFMN.bsn Minimum melt rate for snow during the year (occurs on winter solstice) 5.08 (0, 20) 
SNOCOVMX.bsn Minimum snow water content that corresponds to 100% snow cover 367.1 (0, 500) 
SNO50COV.bsn Snow water equivalent that corresponds to 50% snow cover 0.40 (0, 1) 
ADJ_PKR.bsn Peak rate adjustment factor for sediment routing in the subbasin (tributary channels) 1.91 (0.5, 2) 
CH_K2.rte Effective hydraulic conductivity in main channel alluvium 52.56 (−0.01, 500) 
ALPHA_BNK.rte Baseflow alpha factor for bank storage 0.26 (0, 1) 
CH_N2.rte Manning's n value for the main channel 0.18 (−0.01, 0.3) 
TMPINC(..).sub Temperature adjustment 45.08 (0, 100) 
SNO_SUB.sub Initial snow water content 59.89 (0, 150) 
Parameter nameDescriptionFitted valveRanges
CN2.mgt SCS runoff curve number 78.83 (35, 98) 
CNOP.mgt SCS runoff curve number for moisture condition 29.05 (0, 100) 
ALPHA_BF.gw Baseflow alpha factor (days) 0.47 (0, 1) 
GW_DELAY.gw Groundwater delay (days) 401.21 (0, 500) 
GWQMN.gw Threshold depth of water in the shallow aquifer required for return flow to occur (mm) 105.21 (0, 5,000) 
SHALLST.gw Initial depth of water in the shallow aquifer (mm) 4,840.87 (0, 50,000) 
DEEPST.gw Initial depth of water in the deep aquifer (mm) 3,635.45 (0, 50,000) 
RCHRG_DP.gw Deep aquifer percolation fraction 0.38 (0, 1) 
SURLAG.hru Surface runoff lag time 20.22 (0.05, 24) 
SLSUBBSN.hru Average slope length 60.96 (10, 150) 
SLSOIL.hru Slope length for lateral subsurface flow 178.21 (0, 150) 
EPCO.hru Plant uptake compensation factor 0.56 (0, 1) 
ESCO.hru Soil evaporation compensation factor 0.98 (0, 1) 
LAT_TTIME.hru Lateral flow travel time 13.69 (0, 180) 
OV_N.hru Manning's n value for overland flow 9.44 (0.01, 30) 
SOL_K(..).sol Saturated hydraulic conductivity 908.04 (0, 2,000) 
SFTMP.bsn Snowfall temperature 10.74 (−20, 20) 
SMTMP.bsn Snow melt base temperature 15.70 (−20, 20) 
SMFMX.bsn Maximum melt rate for snow during year (occurs on summer solstice) 8.52 (0, 20) 
SMFMN.bsn Minimum melt rate for snow during the year (occurs on winter solstice) 5.08 (0, 20) 
SNOCOVMX.bsn Minimum snow water content that corresponds to 100% snow cover 367.1 (0, 500) 
SNO50COV.bsn Snow water equivalent that corresponds to 50% snow cover 0.40 (0, 1) 
ADJ_PKR.bsn Peak rate adjustment factor for sediment routing in the subbasin (tributary channels) 1.91 (0.5, 2) 
CH_K2.rte Effective hydraulic conductivity in main channel alluvium 52.56 (−0.01, 500) 
ALPHA_BNK.rte Baseflow alpha factor for bank storage 0.26 (0, 1) 
CH_N2.rte Manning's n value for the main channel 0.18 (−0.01, 0.3) 
TMPINC(..).sub Temperature adjustment 45.08 (0, 100) 
SNO_SUB.sub Initial snow water content 59.89 (0, 150) 

As demonstrated by Table 2, four statistical performance indicators like R2, NSE, PBIAS, and RMSE were used to evaluate that how well the model performed in forecasting the watershed conditions. In this study, the performance of the SWAT model was assessed using four statistical indicators, i.e., R2, NSE, PBIAS, and RMSE. The R2, NSE, PBIAS, RMSE value for both calibration and validation periods were (0.80 and 0.71), (0.73 and 0.66), (15.7 and 17.3), and (79.81 and 106.26), respectively.

Table 2

Statistical indicators for calibration and validation periods

Statistical indicators
Calibration (1985–2005)Validation (2006–2010)
R2 NSE PBIAS RMSE R2 NSE PBIAS RMSE 
0.80 0.73 15.7 79.81 0.71 0.66 17.3 106.26 
Statistical indicators
Calibration (1985–2005)Validation (2006–2010)
R2 NSE PBIAS RMSE R2 NSE PBIAS RMSE 
0.80 0.73 15.7 79.81 0.71 0.66 17.3 106.26 

As obvious from Figure 4, the SWAT model successfully tracked the observed streamflow for the relevant time periods. However, some peaks were under-estimated during calibration but less so during the validation period. Moreover, the SWAT model under-estimated the peak flow and over-estimated the low-flow conditions. In addition, the SWAT model precisely estimated flow for general conditions. In conclusion, the SWAT model performed well in the understudy of the Astore Basin.
Figure 4

Fluctuations in monthly streamflow during the calibration and validation periods.

Figure 4

Fluctuations in monthly streamflow during the calibration and validation periods.

Close modal

The ANN model's training and testing

Data-driven models are widely used for hydrological modeling as it produces precise results. Trial and error technique was used to choose the final ANN structure. The process of nodes addition was continued until further addition of nodes has no effect on the ANN performance. Moreover, it is also assured that the selected network structure has lower MSE and minimum level of complexity. The 26-year historical record which ranges from 1985 to 2010 was split into calibration (1985–2005) and validation (2006–2010) periods at the Doyian monitoring station of the Astore Basin. 80% observed data werer used for model calibration and 20% for model validation. ANN architecture was selected based on model performance indicators, i.e., R2, NSE, PBIAS, RMSE. These indicators are commonly used in various fields to assess the performance of models and compare different configurations. The best ANN architecture, along with how well it performs for each scenario that has been trained and tested for the study basin, is demonstrated in Table 3. The ANN architecture (2,27,1) was selected owing to highest R2 and NSE values, and lowest RMSE value for both calibration and validation periods at the Astore Basin. NSE and PBIAS for the aforementioned architecture fall within very good and good range, respectively. The aforementioned statistical performance indicators are widely used to assess the performance of hydrological and data-driven models for futuristic streamflow projections (Ali & Shahbaz 2020; Haruna & Abba 2021; Rahman et al. 2022). The simulated streamflow of ANN model is very close to the observed streamflow for both general and extreme flow conditions during calibration and validation periods as demonstrated by Figure 5. SWAT model underestimated the peak flow. This demonstrates the usefulness of the ANN model and its effectiveness in modeling streamflow.
Table 3

Statistical performance indicators of various ANN architectures

Statistical indicators
InputOutputTraining (1985–2005)
Testing (2006–2010)
Precipitation and temperatureStreamflowArchitectureR2NSEPBIASRMSER2NSEPBIASRMSE
  2,1,1 0.83 0.74 −0.039 84.14 0.78 0.93 −0.002 90.32 
  2,4,1 0.81 0.73 −0.109 85.72 0.87 0.93 −0.076 94.76 
  2,5,1 0.85 0.76 −0.035 81.45 0.76 0.93 0.016 92.34 
  2,7,1 0.84 0.77 −0.030 80.32 0.79 0.93 0.021 91.52 
  2,10,1 0.86 0.73 −0.037 85.82 0.68 0.94 −0.417 82.94 
  2,11,1 0.87 0.78 0.003 78.62 0.72 0.93 −0.110 88.95 
  2,13,1 0.85 0.77 0.048 79.01 0.84 0.94 0.047 83.75 
  2,19,1 0.89 0.77 0.021 79.55 0.65 0.96 0.075 71.60 
  2,27,1 0.88 0.82 0.008 70.34 0.86 0.95 0.100 75.92 
  2,28,1 0.88 0.77 0.001 79.46 0.72 0.94 −0.077 82.84 
Ideal valves 0.89 0.82 0.001 70.34 0.87 0.96 0.002 71.60 
Statistical indicators
InputOutputTraining (1985–2005)
Testing (2006–2010)
Precipitation and temperatureStreamflowArchitectureR2NSEPBIASRMSER2NSEPBIASRMSE
  2,1,1 0.83 0.74 −0.039 84.14 0.78 0.93 −0.002 90.32 
  2,4,1 0.81 0.73 −0.109 85.72 0.87 0.93 −0.076 94.76 
  2,5,1 0.85 0.76 −0.035 81.45 0.76 0.93 0.016 92.34 
  2,7,1 0.84 0.77 −0.030 80.32 0.79 0.93 0.021 91.52 
  2,10,1 0.86 0.73 −0.037 85.82 0.68 0.94 −0.417 82.94 
  2,11,1 0.87 0.78 0.003 78.62 0.72 0.93 −0.110 88.95 
  2,13,1 0.85 0.77 0.048 79.01 0.84 0.94 0.047 83.75 
  2,19,1 0.89 0.77 0.021 79.55 0.65 0.96 0.075 71.60 
  2,27,1 0.88 0.82 0.008 70.34 0.86 0.95 0.100 75.92 
  2,28,1 0.88 0.77 0.001 79.46 0.72 0.94 −0.077 82.84 
Ideal valves 0.89 0.82 0.001 70.34 0.87 0.96 0.002 71.60 
Figure 5

Fluctuations in monthly streamflow during the training and testing periods.

Figure 5

Fluctuations in monthly streamflow during the training and testing periods.

Close modal

CP-based ANN architecture ranking

Performance metrics have been calculated for all ANN architecture, and the resulting rankings obtained through the CP approach are shown in Table 4. According to their respective rankings, the top three architectures are (2,27,1), (2,19,1), and (2,28,1). Using the CP method, the architecture with the lowest ranking was (2,4,1). These findings highlight how well the CP technique evaluates and ranks ANN architectures, providing important information for our research's decision-making.

Table 4

Performance metrics and ranking of ANN architectures using the CP method

ArchitectureDifference in metrics and ideal values
SumRank
Training (1985–2005)
Testing (2006–2010)
R2NSEPBIASRMSER2NSEPBIASRMSE
2,1,1 0.06 0.08 0.038 13.80 0.09 0.03 0.000 18.72 32.81 
2,4,1 0.08 0.09 0.108 15.39 0.03 0.074 23.16 38.93 10 
2,5,1 0.04 0.06 0.034 11.12 0.11 0.03 0.014 20.74 32.14 
2,7,1 0.05 0.05 0.029 9.98 0.08 0.03 0.019 19.91 30.15 
2,10,1 0.03 0.09 0.036 15.48 0.19 0.01 0.415 11.34 27.59 
2,11,1 0.02 0.04 0.002 8.29 0.15 0.02 0.108 17.35 25.99 
2,13,1 0.04 0.05 0.047 8.68 0.03 0.02 0.045 12.14 21.04 
2,19,1 0.05 0.020 9.21 0.22 0.00 0.073 0.00 9.57 
2,27,1 0.01 0.00 0.007 0.00 0.01 0.01 0.098 4.32 4.45 1 
2,28,1 0.01 0.05 0.000 9.12 0.15 0.01 0.075 11.24 20.65 
ArchitectureDifference in metrics and ideal values
SumRank
Training (1985–2005)
Testing (2006–2010)
R2NSEPBIASRMSER2NSEPBIASRMSE
2,1,1 0.06 0.08 0.038 13.80 0.09 0.03 0.000 18.72 32.81 
2,4,1 0.08 0.09 0.108 15.39 0.03 0.074 23.16 38.93 10 
2,5,1 0.04 0.06 0.034 11.12 0.11 0.03 0.014 20.74 32.14 
2,7,1 0.05 0.05 0.029 9.98 0.08 0.03 0.019 19.91 30.15 
2,10,1 0.03 0.09 0.036 15.48 0.19 0.01 0.415 11.34 27.59 
2,11,1 0.02 0.04 0.002 8.29 0.15 0.02 0.108 17.35 25.99 
2,13,1 0.04 0.05 0.047 8.68 0.03 0.02 0.045 12.14 21.04 
2,19,1 0.05 0.020 9.21 0.22 0.00 0.073 0.00 9.57 
2,27,1 0.01 0.00 0.007 0.00 0.01 0.01 0.098 4.32 4.45 1 
2,28,1 0.01 0.05 0.000 9.12 0.15 0.01 0.075 11.24 20.65 

SWAT and ANN comparison

Data-driven techniques are extensively used for hydrological modeling. In this study, SWAT and ANN models were used to simulate streamflow at the Astore Basin. The best ANN architecture (2,27,1)-simulated streamflow was compared with the SWAT model. The SWAT and ANN models results were compared through statistical performance indicators. Table 5 and Figure 6 clearly demonstrated that the ANN model outscored the SWAT model for streamflow modeling in the Astore Basin in both calibration and validation periods. The R2 and NSE for SWAT and ANN models were 0.80 and 0.73, and 0.88 and 0.82, respectively, for the calibration period. Similarly, ANN outperformed the SWAT model in the validation period with R2 and NSE of 0.86 and 0.95, compared to 0.71 and 0.66 for the SWAT model. The R2 values of the ANN model were superior to the SWAT model during both periods. The PBIAS values for all scenarios were less than 25%, which categorized them as very good.
Table 5

Statistical indicators for the performance evaluation of SWAT and ANN models

Statistical indicators
Calibration (1985–2005)
Validation (2006–2010)
ModelR2NSEPBIASRMSER2NSEPBIASRMSE
SWAT 0.80 0.73 15.7 79.81 0.71 0.66 17.3 106.26 
ANN 0.88 0.82 0.008 70.34 0.86 0.95 0.100 75.92 
Statistical indicators
Calibration (1985–2005)
Validation (2006–2010)
ModelR2NSEPBIASRMSER2NSEPBIASRMSE
SWAT 0.80 0.73 15.7 79.81 0.71 0.66 17.3 106.26 
ANN 0.88 0.82 0.008 70.34 0.86 0.95 0.100 75.92 
Figure 6

Intercomparison of SWAT and ANN models for model calibration and validation.

Figure 6

Intercomparison of SWAT and ANN models for model calibration and validation.

Close modal
Both models performed well in tracking the observed streamflow based on GFI. Figure 6 clearly demonstrated that in this research work, the ANN model performs superior to the SWAT model in estimating high-flow events. In addition, the observed and modeled data distribution is demonstrated by the box plots as shown in Figure 7 for both calibration and testing periods. As per the data demonstrated by the box plots, the SWAT model under-estimated the high flow while over-estimated the low flow. Comparatively, the ANN model performed well in estimating the general and extreme flow conditions in the Astore Basin.
Figure 7

Distribution of observed and modeled streamflow data for the calibration and validation periods.

Figure 7

Distribution of observed and modeled streamflow data for the calibration and validation periods.

Close modal

Accurate hydrological process simulation plays vital role in sustainable water resources planning and management. Data-driven and hydrological models are usually used for hydrological processes simulation. In this research work, ANN technique overweighs the SWAT model in predicting the streamflow in general and extreme conditions. The results of the current study regarding underestimation of the peak flow prediction by the SWAT model is supported by the literature (Demirel et al. 2009; Makwana et al. 2017). Streamflow generation processes vary significantly during low-, medium-, and high-flow periods. Low-flow events are primarily caused by base flow, whereas high-flow events are caused by intense storm rainfall (Wu et al. 2009). SWAT models are good at estimating low flows but struggle to simulate very high streamflow accurately. In contrast, a single ANN can achieve better results for very high values but may have limited accuracy in predicting the lowest values, as demonstrated by Kim et al. (2015). As a result, these models are well-suited for simulating basin streamflow. An ANN model is recommended to simulate high-flow events in studies involving extreme hydrologic events such as floods. The SWAT model, on the other hand, would be preferable for hydrological management studies where low-flow events are of greater interest.

The SWAT model weak performance may be due to the following reasons: (1) The identification of the most sensitive factors may affect the SWAT model performance (Cibin et al. 2010), (2) The identified critical snow-specific characteristics may also cause uncertainty in the model output, (3) Each sub-basin's flow accumulate within one another before reaching the Indus River, making it hard to precisely track the hydropeak of the streamflow. The formulation of SWAT is considered responsible for the observed peak-flow inefficiency, as indicated in various studies (Demirel et al. 2009; Jimeno-Sáez et al. 2018). Therefore, in terms of simulating streamflow, the SWAT model's performance is notably weaker in comparison to the ANN model.

The results of this study suggest that using ANN models can be beneficial in reducing errors when estimating high streamflow values, although there is still a tendency to underestimate them. This problem arises because high-flow values are scarce in training datasets, while medium and low values predominate, as shown in Figure 6. Similar challenges have been reported in the research of Talebizadeh et al. (2010) and Jimeno-Sáez et al. (2018). There are few drawbacks of ANN technique which are discussed below: ANN technique has no relationship with the physical characteristics of the watershed. ANN approach is based on the lumped method which ignores the heterogeneity of sub-catchment parameters. Moreover, empirical models which include ANN technique cannot take into account watershed dynamics like soil moisture, runoff generation, pollutants export, etc. The SWAT model is a process-based model which is data-intensive requiring many input parameters for its analysis like DEM, land use/cover, soil type, hydrometeorological data, etc. Unfortunately, the aforementioned parameters are not easily available everywhere due to poor station distributions, socioeconomic and political difficulties, and limited information exchange between governments in transboundary basins. Machine learning models, like ANN, are helpful in such conditions because they do not require a lot of input parameters. This study demonstrates that ANN outperforms the SWAT model despite requiring fewer resources and input parameters. This promising efficiency opens up possibilities for future developments, such as using the ANN approach to simulate water quality processes, as seen in some related studies (Kuo et al. 2007; Huo et al. 2013).

The current research work was carried out to simulate monthly streamflow data using ANN and SWAT models at the Astore Basin, Pakistan. The results of both models were compared via statistical performance indicators to assess its modeling capabilities. The best ANN architecture (2,27,1) simulated streamflow was compared with the SWAT model. For the SWAT model, the R2 values were 0.80 and 0.71, NSE values were 0.73 and 0.66, PBIAS values were 15.7 and 17.3, and RMSE values were 79.81 and 106.26 for calibration and validation, respectively. On the other hand, for the ANN model with architecture (2,27,1), the R2 values were 0.88 and 0.86, NSE values were 0.82 and 0.95, PBIAS values were 0.008 and 0.10, and RMSE values were 70.34 and 75.92 for calibration and validation, respectively. The results suggested that the ANN model is superior to the SWAT model for streamflow prediction. The box plot analysis further demonstrated that the SWAT model under-estimated the high flow while over-estimated the low flow. Comparatively, the ANN model performed well in estimating the general and extreme flow conditions. The findings of the study can be utilized in formulating improved water policies which will help the water managers in suggesting flood adaptation strategies, water allocation for various sectors and climate resilient water infrastructure design in the region. The impacts of additional meteorological variables are recommended to enhance the accuracy of ANN model. Despite the excellent performance of the ANN model, it is essential to acknowledge potential limitations or uncertainties associated with the modeling approaches. Therefore, further studies could explore the potential of other machine learning-based models and compare their results with the SWAT model in the Astore Basin. Such a comparative analysis would provide a comprehensive understanding of different modeling techniques and support the adoption of advanced and accurate methods in water resource management in the region. Overall, the findings emphasize the superiority of the ANN model and its potential to contribute to better water resource management decisions in the Astore Basin.

The Indus River plays a vital role in Pakistan's economic growth and food production as it contributes approximately 25% to the country's gross domestic product and provides 90% water for food production. As per the World Bank's report (2020–2021), 32% projected decrease in water by 2025 will cause approximately 70 million tons of food shortage in the country. River water allocation has always been a big problem among provinces. The projected decline in water will worsen the situation. In this regard, the findings of this research hold significant implications for policy and water resource management. The superior performance of the ANN model compared to the SWAT model in simulating monthly streamflow data during both calibration and validation periods highlights the potential of machine learning-based models in formulating cohesive inter-provincial National Water Policy. Utilizing ANN models for streamflow forecasting can lead to improved water allocation among provinces, more effective flood management strategies, and better planning for water-related infrastructure development. Additionally, the suggestion to investigate the effects of additional meteorological variables opens up opportunities for further enhancing the accuracy and comprehensiveness of the ANN model's predictions.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Abbaspour
K. C.
,
Rouholahnejad
E.
,
Vaghefi
S.
,
Srinivasan
R.
,
Yang
H.
&
Kløve
B.
2015
A continental-scale hydrology and water quality model for Europe: calibration and uncertainty of a high-resolution large-scale SWAT model
.
Journal of Hydrology
524
,
733
752
.
Cibin
R.
,
Sudheer
K.
&
Chaubey
I.
2010
Sensitivity and identifiability of stream flow generation parameters of the SWAT model
.
Hydrological Processes: An International Journal
24
(
9
),
1133
1148
.
Demirel
M. C.
,
Venancio
A.
&
Kahya
E.
2009
Flow forecast by SWAT model and ANN in Pracana basin, Portugal
.
Advances in Engineering Software
40
(
7
),
467
473
.
Gassman
P. W.
,
Reyes
M. R.
,
Green
C. H.
&
Arnold
J. G.
2007
The soil and water assessment tool: historical development, applications, and future research directions
.
Transactions of the ASABE
50
(
4
),
1211
1250
.
Gholami
V.
,
Chau
K. W.
,
Fadaee
F.
,
Torkaman
J.
&
Ghaffari
A.
2015
Modeling of groundwater level fluctuations using dendrochronology in alluvial aquifers
.
Journal of Hydrology
529
,
1060
1069
.
Ghorbani
M. A.
,
Khatibi
R.
,
Goel
A.
,
FazeliFard
M. H.
&
Azani
A.
2016
Modeling river discharge time series using support vector machine and artificial neural networks
.
Environmental Earth Sciences
75
,
1
13
.
Grusson
Y.
,
Anctil
F.
,
Sauvage
S.
&
Sánchez Pérez
J. M.
2017
Testing the SWAT model with gridded weather data of different spatial resolutions
.
Water
9
(
1
),
54
.
Haleem
K.
,
Khan
A. U.
,
Ahmad
S.
,
Khan
M.
,
Khan
F. A.
,
Khan
W.
&
Khan
J.
2022
Hydrological impacts of climate and land-use change on flow regime variations in upper Indus Basin
.
Journal of Water Climate Change
13
(
2
),
758
770
.
Hasson
S.
,
Lucarini
V.
,
Khan
M. R.
,
Petitta
M.
,
Bolch
T.
&
Gioli
G.
2014
Early 21st century snow cover state over the western river basins of the Indus River system
.
Hydrology Earth System Sciences
18
(
10
),
4077
4100
.
Henriksen
H. J.
,
Troldborg
L.
,
Nyegaard
P.
,
Sonnenborg
T. O.
,
Refsgaard
J. C.
&
Madsen
B.
2003
Methodology for construction, calibration and validation of a national hydrological model for Denmark
.
Journal of Hydrology
280
(
1–4
),
52
71
.
Huang
S.
,
Chang
J.
,
Huang
Q.
&
Chen
Y.
2014
Monthly streamflow prediction using modified EMD-based support vector machine
.
Journal of Hydrology
511
,
764
775
.
Huo
S.
,
He
Z.
,
Su
J.
,
Xi
B.
&
Zhu
C.
2013
Using artificial neural network models for eutrophication prediction
.
Procedia Environmental Sciences
18
,
310
316
.
Iqbal
Z.
,
Shahid
S.
,
Ahmed
K.
,
Ismail
T.
,
Ziarh
G. F.
,
Chung
E.-S.
&
Wang
X.
2021
Evaluation of CMIP6 GCM rainfall in mainland Southeast Asia
.
Atmospheric Research
254
,
105525
.
Jajarmizadeh
M.
,
Kakaei Lafdani
E.
,
Harun
S.
&
Ahmadi
A.
2015
Application of SVM and SWAT models for monthly streamflow prediction, a case study in South of Iran
.
Journal of Civil Engineering
19
,
345
357
.
Javan
K.
,
Lialestani
M. R. F. H.
&
Nejadhossein
M.
2015
A comparison of ANN and HSPF models for runoff simulation in Gharehsoo River watershed
.
Iran. Modeling Earth Systems Environment
1
,
1
13
.
Jimeno-Sáez
P.
,
Senent-Aparicio
J.
,
Pérez-Sánchez
J.
&
Pulido-Velazquez
D.
2018
A comparison of SWAT and ANN models for daily runoff simulation in different climatic zones of peninsular Spain
.
Water
10
(
2
),
192
.
Khan
M.
,
Khan
A. U.
,
Khan
J.
,
Khan
S.
,
Haleem
K.
&
Khan
F. A.
2023
Streamflow forecasting for the Hunza river basin using ANN, RNN, and ANFIS models
.
Water Practice Technology
18
(
5
),
981
993
.
Kim
M.
,
Baek
S.
,
Ligaray
M.
,
Pyo
J.
,
Park
M.
&
Cho
K. H.
2015
Comparative studies of different imputation methods for recovering streamflow observation
.
Water
7
(
12
),
6847
6860
.
Kuo
J.-T.
,
Hsieh
M.-H.
,
Lung
W.-S.
&
She
N.
2007
Using artificial neural network for reservoir eutrophication prediction
.
Ecological Modelling
200
(
1–2
),
171
177
.
Nayebi
M.
,
Khalili
D.
,
Amin
S.
&
Zand-Parsa
S.
2006
Daily stream flow prediction capability of artificial neural networks as influenced by minimum air temperature data
.
Biosystems Engineering
95
(
4
),
557
567
.
Pang
J.
,
Zhang
H.
,
Xu
Q.
,
Wang
Y.
,
Wang
Y.
,
Zhang
O.
&
Hao
J.
2020
Hydrological evaluation of open-access precipitation data using SWAT at multiple temporal and spatial scales
.
Hydrology Earth System Sciences
24
(
7
),
3603
3626
.
Rahman
K. U.
,
Pham
Q. B.
,
Jadoon
K. Z.
,
Shahid
M.
,
Kushwaha
D. P.
,
Duan
Z.
,
Mohammadi
B.
,
Khedher
K. M.
&
Anh
D. T.
2022
Comparison of machine learning and process-based SWAT model in simulating streamflow in the Upper Indus Basin
.
Applied Water Science
12
(
8
),
178
.
Rajurkar
M.
,
Kothyari
U.
&
Chaube
U.
2004
Modeling of the daily rainfall-runoff relationship with artificial neural network
.
Journal of Hydrology
285
(
1–4
),
96
113
.
Setegn
S. G.
,
Srinivasan
R.
,
Melesse
A. M.
&
Dargahi
B.
2010
SWAT model application and prediction uncertainty analysis in the Lake Tana Basin, Ethiopia
.
Hydrological Processes: An International Journal
24
(
3
),
357
367
.
Shahid
M.
,
Rahman
K. U.
,
Haider
S.
,
Gabriel
H. F.
,
Khan
A. J.
,
Pham
Q. B.
,
Pande
C. B.
,
Linh
N. T. T.
&
Anh
D. T.
2021
Quantitative assessment of regional land use and climate change impact on runoff across Gilgit watershed
.
Environmental Earth Sciences
80
,
1
18
.
Shiru
M. S.
&
Chung
E.-S.
2021
Performance evaluation of CMIP6 global climate models for selecting models for climate projection over Nigeria
.
Theoretical Applied Climatology
146
(
1–2
),
599
615
.
Solomatine
D.
,
See
L. M.
&
Abrahart
R.
2008
Data-driven modelling: concepts, approaches and experiences
.
Practical hydroinformatics
68,
17
30
.
Talebizadeh
M.
,
Morid
S.
,
Ayyoubzadeh
S. A.
&
Ghasemzadeh
M.
2010
Uncertainty analysis in sediment load modeling using ANN and SWAT model
.
Water Resources Management
24
,
1747
1761
.
Tokar
A. S.
&
Johnson
P. A.
1999
Rainfall-runoff modeling using artificial neural networks
.
Journal of Hydrologic Engineering
4
(
3
),
232
239
.
Wang
W.
,
Van Gelder
P. H.
,
Vrijling
J.
&
Ma
J.
2006
Forecasting daily streamflow using hybrid ANN models
.
Journal of Hydrology
324
(
1–4
),
383
399
.
Wang
Y.
,
Guo
S.
,
Xiong
L.
,
Liu
P.
&
Liu
D.
2015
Daily runoff forecasting model based on ANN and data preprocessing techniques
.
Water
7
(
8
),
4144
4160
.
Wu, C., Chau, K. W. & Li, Y. S. 2009 Predicting monthly streamflow using data‐driven models coupled with data‐preprocessing techniques. Water Resources Research 45 (8), 111–116
.
Yaseen
Z. M.
,
El-Shafie
A.
,
Jaafar
O.
,
Afan
H. A.
&
Sayl
K. N.
2015
Artificial intelligence based models for stream-flow forecasting: 2000–2015
.
Journal of Hydrology
530
,
829
844
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).