Rainfall–runoff (R–R) analysis is essential for sustainable water resource management. In the present study focusing on the Peddavagu River Basin, various modelling approaches were explored, including the widely used Soil and Water Assessment Tool (SWAT) model, as well as seven artificial intelligence (AI) models. The AI models consisted of seven data-driven models, namely support vector regression, artificial neural network, multiple linear regression, Extreme Gradient Boosting (XGBoost) regression, k-nearest neighbour regression, and random forest regression, along with one deep learning model called long short-term memory (LSTM). To evaluate the performance of these models, a calibration period from 1990 to 2005 and a validation period from 2006 to 2010 were considered. The evaluation metrics used were R2 (coefficient of determination) and NSE (Nash–Sutcliffe Efficiency). The study's findings revealed that all eight models yielded generally acceptable results for modelling the R–R process in the Peddavagu River Basin. Specifically, the LSTM demonstrated very good performance in simulating R–R during both the calibration period (R2 is 0.88 and NSE is 0.88) and the validation period (R2 is 0.88 and NSE is 0.85). In conclusion, the study highlighted the growing trend of adopting AI techniques, particularly the LSTM model, for R–R analysis.

  • The study used SWAT and seven AI models for the Peddavagu River Basin.

  • LSTM performed well in simulating R–R during calibration (R2 is 0.88 and NSE is 0.88) and validation (R2 is 0.88 and NSE is 0.85).

  • These models are valuable for sustainable water management in the Peddavagu River Basin.

CWC

Central Water Commission

SWAT

Soil and Water Assessment Tool

CN2

Initial SCS CN II value

GW_DELAY

groundwater delay (days)

IMD

Indian Meteorological Department

ALPHA_BF

baseflow alpha factor (days)

SW

sub-watershed

SURLAG

surface runoff lag time in the HRU (days)

R2

coefficient of determination

GWQMN

threshold depth of water in the shallow aquifer required for return flow to occur (mm)

NSE

Nash–Sutcliffe Efficiency

ALPHA_BNK

baseflow alpha factor for bank storage (days)

GW_REVAP

groundwater ‘revap’ coefficient

PBIAS

percent of bias

SOL_AWC

available water capacity of the soil layer

R–R

rainfall–runoff

OV_N

Manning's ‘n’ value for overland flow

AI

artificial intelligence

ESCO

soil evaporation compensation factor

LSTM

long short-term memory

SOL_K

saturated hydraulic conductivity (mm/h)

FAO

Food and Agriculture Organization

CH_N2

Manning's n value for main channel

CH_K2

effective hydraulic conductivity (mm/h)

RCHRG_DP

deep aquifer percolation fraction

SOL_BD

moist bulk density (Mg/m3 or g/cm3)

CANMX

maximum canopy storage (mm)

REVAPMN

threshold depth of water in the shallow aquifer for ‘revap’ to occur

EPCO

plant uptake compensation factor

The management and planning of water resources, including irrigation water management, river basin engineering, reservoir operation, flood control, and navigation, rely heavily on rainfall–runoff (R–R) modelling (Santos & da Silva 2014; Noori & Kalin 2016; Shekar & Mathew 2022c). Additionally, it is crucial for the mitigation, early detection, and incident mitigation of natural catastrophes like flooding and drought (Shamseldin 2010; Nourani et al. 2013; Suwarno et al. 2020; Kumar et al. 2021; Gupta et al. 2023). The most popular methods for predicting discharge and estimating water balance are R–R models (Beven 2012). However, modelling the R–R process is a challenging hydrologic problem since the process exhibits randomness and complicated spatial and temporal dynamics (Singh & Sankarasubramanian 2014).

There have been numerous challenges over the past few decades to pinpoint and fully comprehend the R–R process (Omar et al. 2020, 2021a, 2021b, 2022a, 2022b; Gaur et al. 2021; Shekhar et al. 2021; Singh et al. 2023; Srivastava et al. 2023). Physically based, conceptual models and mathematically based, data-driven models are two primary groups of modelling methodologies (Fathian et al. 2018). Physically based models require a large amount of hydroclimatic data and input parameters to simulate complex hydrologic processes such as R–R. However, the usage of physically based models is frequently constrained by the lack of access to such physical data (Liu & Todini 2002; Lu et al. 2013). The connection between meteorological data and runoff is, however, captured by mathematically based, data-driven models without explicit understanding of the physical behaviour of the watershed programme (Modarres & Ouarda 2013; Kan et al. 2015).

There are numerous models in the hydrology literature; however, investigations on river discharge prediction have indicated that R–R models that directly acquire the physical phenomena of the streamflow approach are more effective (Malago et al. 2016; Pandey et al. 2016). Soil and Water Assessment Tool (SWAT), one of many watershed hydrology models, is used extensively around the world for applications, policy creation, environmental conditions, analysis for water restoration at several geographic scales, and decision-making (Guse et al. 2016; Himanshu et al. 2017; Dhami et al. 2018; Frizzle et al. 2021; Getachew et al. 2021; Dash et al. 2023). The model is also semi-distributed because it operates at the hydrologic response unit (HRU) level. The smallest spatial unit in the model is called the HRU, and the typical definition of the HRU combines together all identical soils, land use/land cover (LULC), and slopes within a sub-watershed (Neitsch et al. 2011; Nerantzaki et al. 2015; Schmalz et al. 2015; Swain et al. 2018; Aadhar et al. 2019; Veettil & Mishra 2020; Gupta et al. 2022; Mathew et al. 2022; Shekar & Mathew 2022a, 2022b; Mathew & Shekar 2023).

The SWAT model has been effectively employed in a number of applications, including the assessment of the consequences of hydropower projects, surface water, impacts on groundwater, climate change, snowmelt, the cycling of non-point source pollutants, etc. (Tokar & Markus 2000; Sinnathamby et al. 2017; Gupta et al. 2020a, 2020b; Nazari-Sharabian et al. 2020; Gupta et al. 2022; Khajuria et al. 2022; Nyakundi et al. 2022; Rautela et al. 2022, 2023; Umugwaneza et al. 2022; Gaur et al. 2023). Furthermore, this model needs a lot of temporal and spatial data, as well as features that can often be difficult to predict (Makwana & Tiwari 2014). The accuracy of the input data and model parameters determine how well the model performs. Additionally, the lengthy and difficult calibration and validation processes are caused by the numerous parameters, a wide variety of values, and the intricate connections between them (Rezaeianzadeh et al. 2013; Omani et al. 2017).

On the other hand, with the development of artificial intelligence (AI) during the past few decades, academics have become increasingly interested in the estimation of hydrological variables (He et al. 2014; Chutiman et al. 2022). AI techniques are becoming more popular among engineers and are increasingly important in the modelling of water resources and R–R (Jeong & Kim 2005). AI techniques can effectively handle large amounts of noisy, non-linear, and dynamic data, particularly when the basic physical relationships are not properly known (Ateeq-ur-Rauf et al. 2018; Elkiran et al. 2019). As a result, they are good choices for time series modelling issues with a data-driven approach (Lallahem & Maina 2003). Black box simulations that are precise about the non-linear and non-stationary behaviour of the R–R process include support vector regression, adaptive neuro-fuzzy systems, artificial neural networks (ANNs), and others. These models have already been successfully applied in numerous papers (Dawson & Wilby 1998; Sajikumara & Thandaveswara 1999; Tokar & Johnson 1999; Antar et al. 2006; Gazzaz et al. 2012).

Data-driven models have been widely employed in the fields of sustainable water management and hydrology since they are recognised as being capable of simulating extremely non-linear and complicated hydrological processes (Shoaib et al. 2014). Low complexity and computing costs, high adaptability and transferability are all distinct benefits of neural network-based models (Wu et al. 2009). The long short-term memory (LSTM)-based techniques in the hydrological domains perform quite well, notably for simulations of rainfall and runoff (Yin et al. 2021; Schmidhuber 2015; Shen 2018; Xiang et al. 2020; Bai et al. 2021). LSTM, an improved recurrent neural network, can manage the challenge of long-distance dependency that recurrent neutral networks (RNNs) are unable to solve due to the vanishing and exploding gradients (Kratzert et al. 2018). This time series forecasting model is currently among the more widely used ones (Marcais & de-Dreuzy 2017; Karim et al. 2018; Yuan et al. 2018; Zhang et al. 2018; Bai et al. 2019; Fan et al. 2020).

The study addresses several significant gaps in existing research. Firstly, there were limited comparative studies between the SWAT model and AI techniques for simulating R–R. Additionally, there have been very limited efforts to employ advanced deep learning methods, such as LSTM, for hydrological issues. Moreover, no previous study has compared R–R modelling using the SWAT model, AI techniques, which include seven data-driven models and an LSTM deep learning model. Furthermore, the Peddavagu watershed has not previously been subjected to R–R modelling using SWAT and seven AI models. Therefore, the primary objective of this study was to compare the performance of three distinct modelling approaches: the SWAT model, seven data-driven models (support vector regression model, ANN, multilinear regression model, k-nearest neighbour (KNN) regression model, XGBoost regression model, random forest (RF) regression model), and the deep learning model (LSTM) for monthly streamflow modelling in the Peddavagu River Basin. By conducting this comparative analysis and introducing the application of LSTM in R–R modelling, the study contributes to addressing these research gaps, providing valuable insights into the performance of different modelling approaches and their suitability for the specific context of the Peddavagu River Basin.

According to the Socioeconomic Data and Applications Centre (SEDAC), the Peddavagu River Basin has a population of about 3,30,000 people and is primarily located in the northern parts of Telangana State, inside the Deccan Plateau region. The majority of the population is engaged in agricultural activities, and agriculture dominates the land use in the basin, primarily focused on crops such as paddy, cotton, and pulses. With an annual average precipitation of around 1,150 mm, as indicated by the Indian Meteorological Department, the hydrology of the basin is significantly influenced by the monsoon rains occurring from June to September. The river receives its water from various seasonal streams and tributaries. As shown in Figure 1, the Peddavagu watershed area is divided between Telangana and Maharashtra states in India. It covers an area of 3,150 km2. According to the SRTM-DEM, the Peddavagu watershed is between 160 and 655 m above sea level. The watershed of Peddavagu is located between latitudes 18°45′0′′ and 19°45′0′′ north and longitudes 79°45′0′′ and 78°30′0′′ east. The outlet locations are 19°19′50′′ north and 79°30′14′′ east (site location: Bhatpalli).
Figure 1

Map of the Peddavagu watershed's location.

Figure 1

Map of the Peddavagu watershed's location.

Close modal

The study design involved comparing the performance of the SWAT model with seven AI techniques, including ANNs, KNN regression, linear regression, XGBoost regression, RF regression, support vector regression, and the LSTM deep learning model. The study was carried out by implementing the different models for monthly streamflow modelling in the Peddavagu River Basin. The calibration period encompassed the years 1990–2005, while the validation period covered the years 2006–2010. The models were calibrated and validated using observed streamflow data. The data analysis involved assessing the performance of the models based on evaluation metrics, namely R2 and NSE. These metrics were used to measure the correlation between simulated and observed streamflow values and the agreement between simulated and observed streamflow patterns, respectively. The results of the different models were compared to determine their effectiveness in R–R analysis.

Data

Hydrometeorological data: SWAT requires rainfall, maximum temperature, discharge, wind speed, minimum temperature, solar, soil, relative humidity, and LULC for modelling various physical processes (Tripathi et al. 2004; Shekar & Mathew 2023a, 2023b). For SWAT modelling applications in India where point rainfall measurements are not available, various studies suggest the India Meteorological Department (IMD), which provides gridded data with a geographic resolution of 0.25° × 0.25° (Kolluru et al. 2020; Setti et al. 2020; Tan et al. 2021). In this present study area, daily rainfall data with a 0.25° × 0.25° grid and daily maximum temperature data and minimum temperature data with a 1° × 1° grid were provided from 1987 to 2010 and collected from IMD (https://mausam.imd.gov.in/). Wind speed, relative humidity, and solar data for the Peddavagu river watershed have been downloaded from NASA Power (https://power.larc.nasa.gov/data-access-viewer/). Using observed discharge data from the Central Water Commission (CWC), Government of India, the SWAT model was validated and calibrated. The CWC of India's R–R simulation (http://www.cwc.gov.in/) used discharge data from 1990 to 2010. The DEM is taken from the NASA website to define the watershed basin (https://earthexplorer.usgs.gov/). The map was created from the Landsat 5 satellite images using supervised classification techniques (Roy et al. 2016) (Figure 2). The Food and Agriculture Organization (FAO) made a soil map and database available in 1974 under the name Digital Soil Map of the World (DSMW). According to Figure 2, there are two distinct textural classes in the research region.
  • (a)

    SWAT

Figure 2

(a) LULC; (b) slope; (c) soil classification; and (d) 13 sub-watersheds.

Figure 2

(a) LULC; (b) slope; (c) soil classification; and (d) 13 sub-watersheds.

Close modal

A physically based semi-distributed parameter model known as SWAT was developed to forecast sediment movement, nutrient transport, erosion from agricultural sources, and runoff in watersheds under various management options (Arnold & Allen 1996; Neitsch et al. 2004). In this research, the hydrologic model for the Peddavagu River Basin was set up using ARCSWAT, an interface in ARCGIS 10.7. Geo-environmental data with a high degree of spatial variability, such as soil data, LULC, and DEM, were used. There are 83 HRUs altogether among the 13 sub-watersheds that make up the Peddavagu River Basin. Figure 2 depicts the delineated watersheds and sub-watersheds that were derived using the SWAT model. The first 3 years of the simulation were chosen as the warm-up period, and the SWAT model was run from 1987 to 2010 throughout that time. The model was run with the maximum temperature, the solar radiation, the minimum temperature data, the relative humidity, the rainfall, and the wind speed datasets as weather inputs, along with DEM, LULC, and a soil map as shown in Figure 2. In this study, the model incorporates various techniques, including the Penman–Monteith approach for estimating potential evapotranspiration and the variable storage method for channel flow routing. These methods are employed to enhance the accuracy and reliability of the model's simulations. Using the curve number method developed by the Soil Conservation Service (SCS), the SWAT calculates the surface runoff volume from each HRU. The model was then run to simulate the surface runoff after this stage.

The sequential uncertainty fitting (SUFI-2) optimisation technique in SWAT-CUP was used to calibrate and validate the SWAT model for the watershed. Finding the most sensitive variables in the Peddavagu River Basin is the first stage in calibrating and validating the SWAT model (Abbaspour et al. 2015). In the current investigation, each model parameter's sensitivity was evaluated using a one-at-a-time sensitivity analysis in the SWAT-CUP programme. According to the sensitivity analysis, SOL_BD, ESCO, ALPHA_BNK, SOL_AWC, REVAPMN, OV_N, GW_DELAY, GW_REVAP, SOL_K, EPCO, RCHRG_DP, CH_N2, ALPHA_BF, CANMX, SURLAG, GWQMN, CH_K2, and CN2 were the most sensitive parameters in the study region. The SWAT model of the Peddavagu River Basin was validated and calibrated using these parameters. From 1990 to 2005, there was a calibration period, and from 2006 to 2010, there was a validation period. During model calibration, NSE was taken into account as the objective function. R2 and Nash–Sutcliffe Efficiency (NSE) were tested during streamflow (m3/s) calibration by SWAT-CUP to confirm a maximum concordance between observed and anticipated water budget components. The entire methodology for SWAT is shown in Figure 3.
  • (b)

    Data-driven and LSTM models

Figure 3

Flowchart for R–R modelling using the SWAT model.

Figure 3

Flowchart for R–R modelling using the SWAT model.

Close modal
The ANN, the support vector regression model, the multilinear regression model, the RF regression model, the XGBoost regression model, the K-neighbor regression model, and the LSTM model are the models under consideration in the study. Data collection has been done in the Peddavagu watershed for a period ranging between 1990 and 2010. The data pre-processing follows immediately after data collection, which includes initially filling up null values with zero, then eliminating erroneous entries, then normalising the data so that feature values fall between zero and one (Equation (1)), and then splitting the data into training and testing and then standardising the entire training data, or calibration, and testing data, or validation. Correlation analysis is done to select the most prominent features influencing the output variable, and those unrelated features can be eliminated in this step. The data are then fitted into the various regression models, and predictions are made as a result. In the case of deep learning, the number of layers, number of epochs, learning rate, and batch size are selected by hit and trial. The predicted results are then evaluated by means of some performance evaluators like the R2 score and the NSE (Figure 4).
(1)
where xi′ is the normalised value of any variable x for an ith sample, max(x) is the maximum values of x and min(x) is the minimum values of x, respectively.
Figure 4

Methodology of artificial intelligence models.

Figure 4

Methodology of artificial intelligence models.

Close modal

Model performance evaluation

The evaluation of performance is a crucial step during the operation of any research project. It is essential to assess the effectiveness of each deployed model or procedure using one or more metrics to ensure reliable results from SWAT and data-driven models. There are various efficacy indicators available for evaluating model performance, with NSE and R2 being the most commonly used ones (Moriasi et al. 2007).

Nash–Sutcliffe Efficiency

NSE, a normalised statistic, quantifies the disparity between the residual variance and the variance of the observed data (Nash & Sutcliffe 1970). NSE is computed as shown in Equation (2): The number between zero and one is the NSE. When NSE = 1, as shown in Table 1 (Moriasi et al. 2007), it denotes a perfect match between observed and anticipated values.
(2)
where is the mean of actual discharges, M is the number of observations, is the mean of simulated discharges, is the actual discharge at ‘T’ time and is the simulated discharge.
Table 1

Shows the recommended statistics' overall performance ratings for a monthly time step

Performance ratingPBIAS (%)NSE
Unsatisfactory PBIAS > ±25 NSE < 0.50 
Satisfactory ±15 < PBIAS < ±25 0.50 < NSE < 0.65 
Good ±10 < PBIAS < ±15 0.65 < NSE < 0.75 
Very good PBIAS < ±10 0.75 < NSE < 1.00 
Performance ratingPBIAS (%)NSE
Unsatisfactory PBIAS > ±25 NSE < 0.50 
Satisfactory ±15 < PBIAS < ±25 0.50 < NSE < 0.65 
Good ±10 < PBIAS < ±15 0.65 < NSE < 0.75 
Very good PBIAS < ±10 0.75 < NSE < 1.00 

Coefficient of determination

The R2 statistic is used to measure the proportion of variance in the observed data that can be explained by the model. Ranging from zero to one, higher values of R2, typically above 0.5, indicate that the model can explain a larger portion of the error variance, indicating a better fit between the model and the observed data (Santhi et al. 2001; Van-Liew et al. 2007). R2 is calculated with Equation (3):
(3)
where M is the number of observations , y is the standard deviation of X and Y, respectively; and j are the simulated and observed values, respectively; and m are the mean of simulated and observed values, respectively.
  • (a)

    R–R modelling using SWAT

Although the SWAT model produces a variety of outcomes at the outlet of each sub-watershed, the focus of this study is on the streamflow at the main outlet of the total watershed because observable data for streamflow into the Peddavagu is available (site Bhatpalli). The Peddavagu watershed is divided into 13 sub-watersheds, and the main outlet is located at sub-watershed 12 (Figure 2). The total contributing area of the Peddavagu watershed is 3,150 km2. The period from 1987 to 2010 is divided into three parts, with the years 1987 through 1989 serving as the warm-up period, 1990 through 2005 as the calibration period, and 2006 through 2010 as the validation period.

The model is then calibrated using the observed discharge for the years 1990–2010. The calibration was carried out at monthly intervals. The observed daily data were scaled up to the monthly level and provided the objective function of maximising NSE to the SWAT-CUP model. In SWAT-CUP, the SUFI2 tool is the most often used algorithm, and it is also used in this work. As can be seen in Table 2, a total of 18 parameters were employed for calibration and validation. The chosen parameters are relevant to the hydrological processes involved in R–R modelling. Each parameter plays a crucial role in influencing various aspects of the water cycle, such as surface runoff, channel flow, groundwater contribution, infiltration, evaporation, and storage dynamics. By calibrating and validating the model using this specific set of parameters, the study aims to accurately represent these hydrological processes and improve the precision of the simulated streamflow in the Peddavagu River Basin. The careful selection of these parameters ensures that the model captures the important dynamics and provides a comprehensive understanding of the R–R behaviour in the study area. After many simulations, the NSE parameter, among other objective functions, was used to validate and calibrate the SWAT model's performance. R2 was 0.63 and 0.75 during the calibration and validation periods, respectively (Figure 5). The NSE was 0.71 during the validation period and 0.62 during the calibration period, respectively. During the calibration period, the SWAT model produced an R-factor of 0.67 and a P-factor of 0.72; during the validation period, it produced an R-factor of 0.69 and a P-factor of 0.9, respectively (Thavhana et al. 2018). Percent of bias (PBIAS) represents the average tendency for the simulated data to differ from the real data in either a bigger or smaller manner (Gupta et al. 1999). Positive values suggest that the model has an underestimation bias, whereas negative values suggest that the model has an overestimation bias (Gupta et al. 1999). According to Table 1, the calibration period and validation period were both less overestimated, as indicated by the PBIAS values of −0.1 and −11.1, respectively, which show that both values fit with very good and good performance, respectively. The simulation of the monthly runoff and the SWAT model's performance are shown in Figure 6.
Table 2

Calibrated parameters and their ranges used in SWAT-CUP

Sl. No.Parameter NameFitted ValueMin valueMax value
R__CN2.mgt −0.1 −0.25 −0.05 
R__SOL_BD(..).sol 0.259 −0.5 0.6 
A__GW_DELAY.gw 69 100 
V__ESCO.hru 0.4753 0.01 
V__GW_REVAP.gw 0.17 
V__CH_K2.rte 36.25 25 100 
V__CH_N2.rte 0.0685 0.05 0.1 
V__CANMX.hru 61 50 150 
R__OV_N.hru 0.79 
10 A__GWQMN.gw 1,820 2,000 
11 R__SOL_K(..).sol 0.272 −0.8 0.8 
12 R__SURLAG.bsn 46.5 150 
13 V__ALPHA_BNK.rte 0.085 0.5 
14 A__RCHRG_DP.gw 0.27 
15 R__SOL_AWC(..).sol −0.062 −0.1 0.1 
16 V__REVAPMN.gw 325 500 
17 V__EPCO.hru 0.69 
18 V__ALPHA_BF.gw 0.925 0.7 
Sl. No.Parameter NameFitted ValueMin valueMax value
R__CN2.mgt −0.1 −0.25 −0.05 
R__SOL_BD(..).sol 0.259 −0.5 0.6 
A__GW_DELAY.gw 69 100 
V__ESCO.hru 0.4753 0.01 
V__GW_REVAP.gw 0.17 
V__CH_K2.rte 36.25 25 100 
V__CH_N2.rte 0.0685 0.05 0.1 
V__CANMX.hru 61 50 150 
R__OV_N.hru 0.79 
10 A__GWQMN.gw 1,820 2,000 
11 R__SOL_K(..).sol 0.272 −0.8 0.8 
12 R__SURLAG.bsn 46.5 150 
13 V__ALPHA_BNK.rte 0.085 0.5 
14 A__RCHRG_DP.gw 0.27 
15 R__SOL_AWC(..).sol −0.062 −0.1 0.1 
16 V__REVAPMN.gw 325 500 
17 V__EPCO.hru 0.69 
18 V__ALPHA_BF.gw 0.925 0.7 
Figure 5

R2 for SWAT model calibration and validation periods.

Figure 5

R2 for SWAT model calibration and validation periods.

Close modal
Figure 6

SWAT model comparison of observed and simulated flows.

Figure 6

SWAT model comparison of observed and simulated flows.

Close modal

Once the fitted parameters have been obtained, they are then rewritten in ArcSWAT for the final output values. The Peddavagu watershed's water balance is presented using the SWAT model over a 21-year period (1990–2010). The overall curve number for the Peddavagu watershed is 68.72. From 1990 to 2010, there was 1,160.7 mm of precipitation per year on average, which had a 224.13-mm runoff at the main outlet. There are 571.6 mm of evaporation and transpiration in this investigation. The depths of shallow aquifer percolation, recharge to the deep aquifer, and lateral flow are 316.74, 101.36, and 47.29 mm, respectively.

  • (b)

    Data-driven models

The same calibration and validation datasets are used to construct data-driven models in addition to the physically based SWAT model. Similar to the SWAT model, 16 years of data (1990–2005) are used for training (calibration), and an additional 5 years of data (2006–2010) are used for testing (validation).

R–R modelling using ANN

Studies on hydrology typically employ ANN as a forecasting technique. Engineers frequently utilise feed-forward, back-propagation (BP) network models in ANN. It has been proven that any engineering problem may be forecasted and simulated using the three-layer BP network model (Hornik 1988; ASCE 2000). The study incorporates monthly rainfall, the maximum temperature, the wind, the relative humidity, the minimum temperature, and the solar as input variables. The study's output is monthly observed discharge. The number of neurons and the type of structure were found by trial and error, which led the network to explore a two-layer structure with 25 neurons inserted in each layer. Training and testing are performed using monthly data from the Peddavagu watershed from 1990 to 2005 and 2006 to 2010.

At the monthly scale, the ANN model exhibited excellent performance during the training period, with an R2 of 0.86 and an NSE of 0.85. The R2 and NSE performance indices at the monthly scale for the ANN model, on the other hand, were found to be 0.75 and 0.75, respectively, during the testing period. The comparison of R2 values in Figure 7 indicates that the ANN model exhibited superior performance during the calibration, with an acceptable value of 0.86. The ANN model demonstrated strong performance with an NSE value of 0.75 during both the calibration and validation periods. Overall, it was found that the ANN model's ability to simulate runoff in the Peddavagu watershed based on both R2 and NSE was satisfactory. The hydrograph between simulated and observed runoff for the training years 1990–2005 and the testing years 2006–2010 are shown, respectively, in Figure 8.
Figure 7

R2 for calibration (training) and validation (testing) for the ANN model.

Figure 7

R2 for calibration (training) and validation (testing) for the ANN model.

Close modal
Figure 8

ANN model comparison of the observed and simulated flows.

Figure 8

ANN model comparison of the observed and simulated flows.

Close modal

R–R modelling using support vector regression model

The SVM principle, which is used for difficulty classification and non-linear regression, served as the foundation for SVR (Nourani et al. 2020). Contrary to many other black box forecasting techniques, SVR minimises the error between the actual and anticipated parameters rather than reducing operational hazards as a goal function. SVR is a category of AI model that is built via supervised learning. For SVM modelling, the choice of the kernel function is essential since the performance of the SVM model depends on the selection of the kernel parameters (Xu et al. 2012). In the current study, six independent variables are included: rainfall, the maximum temperature, the wind, the relative humidity, the minimum temperature, and the solar. The prediction of the discharge observed is analysed based on these six factors. Training and testing are performed using monthly data from the Peddavagu watershed from 1990 to 2005 and 2006 to 2010. In this case, the regressor is trained using a polynomial kernel of degree 3. To figure out the accuracy of the model prediction, R2, also known as the coefficient of determination, is determined.

At the monthly scale, the SVR model exhibited excellent performance during the training period, with an R2 of 0.77 and an NSE of 0.74. The R2 and NSE performance indices at the monthly scale for the SVR model, on the other hand, were found to be 0.73 and 0.71, respectively, during the testing period. The comparison of R2 values in Figure 9 indicates that the SVR model exhibited superior performance during the calibration, with an acceptable value of 0.77. The SVR model showed good performance, with an NSE value of 0.74 during the calibration period and 0.71 during the validation period. Overall, it was found that the SVR model's ability to simulate runoff in the Peddavagu watershed based on both R2 and NSE was satisfactory. The hydrograph between simulated and observed runoff for the training years 1990–2005 and the testing years 2006–2010 are shown, respectively, in Figure 10.
Figure 9

R2 for calibration (training) and validation (testing) for the SVR model.

Figure 9

R2 for calibration (training) and validation (testing) for the SVR model.

Close modal
Figure 10

SVR model comparison of observed and simulated flows.

Figure 10

SVR model comparison of observed and simulated flows.

Close modal

R–R modelling using KNN model regression

KNN is an efficient process that predicts the numerical target by utilising a similarity metric. By assuming that similar objects exist nearby, the target is predicted using local interpolation of the targets connected to the training set's nearest neighbours. Rainfall, the maximum temperature, the wind, the relative humidity, the minimum temperature, and the solar radiation are the six independent variables in the present study. These six parameters are used to analyse the prediction of the observed discharge. Training and testing are performed using monthly data from the Peddavagu watershed from 1990 to 2005 and 2006 to 2010. Standardisation and normalisation of the data are carried out as a pre-processing step before fitting the regression model. The optimum number of neighbours has been found to be four through parameter adjustment. To figure out the accuracy of the model prediction, R2, also known as the coefficient of determination, is determined.

At the monthly scale, the KNN model exhibited excellent performance during the training period, with an R2 of 0.74 and an NSE of 0.73. The R2 and NSE performance indices at the monthly scale for the KNN model, on the other hand, were found to be 0.62 and 0.61, respectively, during the testing period. The comparison of R2 values in Figure 11 indicates that the KNN model exhibited superior performance during the calibration, with an acceptable value of 0.74. The KNN model demonstrated good performance with an NSE value of 0.73 during the calibration period and a satisfactory NSE value of 0.61 during the validation period. Overall, it was found that the KNN model's ability to simulate runoff in the Peddavagu watershed based on both R2 and NSE was satisfactory. The hydrograph between simulated and observed runoff for the training years 1990–2005 and the testing years 2006–2010 are shown, respectively, in Figure 12.
Figure 11

R2 for calibration (training) and validation (testing) for the KNN model.

Figure 11

R2 for calibration (training) and validation (testing) for the KNN model.

Close modal
Figure 12

KNN model comparison of the observed and simulated flows.

Figure 12

KNN model comparison of the observed and simulated flows.

Close modal

R–R modelling using multiple linear regression model

A statistical method for predicting continuous or real variables is linear regression. It illustrates a linear relationship between the variables and can thus show how the value of the dependent variable changes as the independent variable changes. In the case of multiple linear regression (MLR), the relationship between an independent variable and a number of dependent variables is predicted. Rainfall, the maximum temperature, the wind, the relative humidity, the minimum temperature, and the solar radiation are the six independent variables in the present study. These six parameters are used to analyse the prediction of the observed discharge. Training and testing are performed using monthly data from the Peddavagu watershed from 1990 to 2005 and 2006 to 2010. Before fitting the regression model, data standardisation and normalisation are performed as a pre-processing step. R2 is calculated to find the efficacy of the model's prediction.

At the monthly scale, the MLR model exhibited excellent performance during the training period, with an R2 of 0.59 and an NSE of 0.59. The R2 and NSE performance indices at the monthly scale for the MLR model, on the other hand, were found to be 0.62 and 0.59, respectively, during the testing period. The comparison of R2 values in Figure 13 indicates that the MLR model exhibited superior performance during the calibration, with an acceptable value of 0.62. The MLR model achieved a satisfactory NSE value of 0.59 during both the calibration and validation periods, indicating reasonable performance. Overall, it was found that the MLR model's ability to simulate runoff in the Peddavagu watershed based on both R2 and NSE was satisfactory. The hydrograph between simulated and observed runoff for the training years 1990–2005 and the testing years 2006–2010 are shown, respectively, in Figure 14.
Figure 13

R2 for calibration (training) and validation (testing) for the MLR model.

Figure 13

R2 for calibration (training) and validation (testing) for the MLR model.

Close modal
Figure 14

MLR model comparison of the observed and simulated flows.

Figure 14

MLR model comparison of the observed and simulated flows.

Close modal

R–R modelling using RF model regression

RF is a combined learning strategy for classification and regression (Liaw & Wiener 2002). The ensemble learning and decision tree frameworks are combined by the RF algorithm to produce a number of randomly selected decision trees from the data. The system then averages the data to produce a new result, which usually produces accurate classifications and forecasts. The six independent variables in the current study are maximum temperature, rainfall, relative humidity, wind, minimum temperature, and solar radiation. The analysis of the predicted discharge is done using these six parameters. Peddavagu watershed monthly data from 1990 to 2005 and 2006 to 2010 are used for training and testing. Standardisation and normalisation of the data are carried out as a pre-processing step before fitting the regression model. To determine the accuracy of the model prediction,  R2, also known as the coefficient of determination, is evaluated.

At the monthly scale, the RF model exhibited excellent performance during the training period, with an R2 of 0.95 and an NSE of 0.94. The R2 and NSE performance indices at the monthly scale for the RF model, on the other hand, were found to be 0.66 and 0.65, respectively, during the testing period. The comparison of R2 values in Figure 15 indicates that the RF model exhibited superior performance during the calibration, with an acceptable value of 0.95. The RF model exhibited very good performance during the training period with an NSE value of 0.94, and good performance during the testing period with an NSE value of 0.65. Overall, it was found that the RF model's ability to simulate runoff in the Peddavagu watershed based on both R2 and NSE was satisfactory. The hydrograph between simulated and observed runoff for the training years 1990–2005 and the testing years 2006–2010 are shown in Figure 16.
Figure 15

R2 for calibration (training) and validation (testing) for the RF model.

Figure 15

R2 for calibration (training) and validation (testing) for the RF model.

Close modal
Figure 16

RF model comparison of observed and simulated flows.

Figure 16

RF model comparison of observed and simulated flows.

Close modal

R–R modelling using XGBoost regression model

Chen & Guestrin (2016) proposed the C + +-based language XGBoost (Nobre & Neves 2019). The model has had a lot of success since it first appeared and frequently places among the top models in several data mining competitions. XGBoost is a useful gradient boosting technique for regression predictive modelling. Using decision tree models that are introduced to them one at a time and fitted to correct the prediction errors provided by prior models, the gradient boosting family of ensemble machine learning approaches produces ensembles. Rainfall, the maximum temperature, the relative humidity, the minimum temperature, the wind, and the solar are the six independent variables in the present study. These six parameters are used to analyse the prediction of the observed discharge. Training and testing are performed using monthly data from the Peddavagu watershed from 1990 to 2005 and 2006 to 2010. The number of gradient-boosted trees or estimators used is 1,000. The learning rate chosen is 0.1, and the objective is set to minimise absolute error. R2 is calculated to find the efficacy of the model's prediction.

At the monthly scale, the XGBoost model exhibited excellent performance during the training period, with an R2 of 0.95 and an NSE of 0.95. The R2 and NSE performance indices at the monthly scale for the XGBoost model, on the other hand, were found to be 0.68 and 0.68, respectively, during the testing period. The comparison of R2 values in Figure 17 indicates that the XGBoost model exhibited superior performance during the calibration, with an acceptable value of 0.95. The XGBoost model exhibited good performance with an NSE value of 0.68 during both the training period (calibration) and the testing period (validation). Overall, it was found that the XGBoost model's ability to simulate runoff in the Peddavagu watershed based on both R2 and NSE was satisfactory. The hydrograph between simulated and observed runoff for the training years 1990–2005 and the testing years 2006–2010 are shown, respectively, in Figure 18.
  • (c)

    Deep learning model

Figure 17

R2 for calibration (training) and validation (testing) for the XGBoost model.

Figure 17

R2 for calibration (training) and validation (testing) for the XGBoost model.

Close modal
Figure 18

XGBoost model comparison of observed and simulated flows.

Figure 18

XGBoost model comparison of observed and simulated flows.

Close modal

R–R modelling using the LSTM model

The inability of basic ANN to learn long-term dependencies is overcome by LSTM networks, a specific sort of neural network that can tackle problems involving historical data and sequential information (Xiang et al. 2020). The existing RNN cannot be utilised for learning since the error gradient disappears and it is impossible to store data in a layer close to the input layer when the RNN is learning. A model to address these problems is LSTM. LSTM is employed in a number of fields, such as speech recognition, language modelling, and translation, by integrating with other neural networks. It can also be used to ascertain how long-term time depends. In the present study, six independent variables are considered: rainfall, maximum temperature, wind, relative humidity, minimum temperature, and solar. The prediction of the discharge observed is analysed based on these six factors. Training and testing are performed using monthly data from the Peddavagu watershed from 1990 to 2005 and 2006 to 2010. 5 LSTM layers, each having two units, have been fitted with the training data with the objective of minimising mean squared error. The optimiser used is Adam, and the batch size is chosen as 8. The training has been accomplished in 400 epochs. R2 is calculated to find the efficacy of the model's prediction.

At the monthly scale, the LSTM model exhibited excellent performance during the training period, with an R2 of 0.88 and an NSE of 0.88. The R2 and NSE performance indices at the monthly scale for the LSTM model, on the other hand, were found to be 0.88 and 0.85, respectively, during the testing period. The comparison of R2 values in Figure 19 indicates that the XGBoost model exhibited superior performance during the calibration, with an acceptable value of 0.88. The LSTM model demonstrated very good performance with an NSE value of 0.88 during the training period (calibration) and an NSE value of 0.85 during the testing period (validation). Overall, it was found that the LSTM model's ability to simulate runoff in the Peddavagu watershed based on both R2 and NSE was satisfactory. The hydrograph between simulated and observed runoff for the training years 1990–2005 and the testing years 2006–2010 are shown, respectively, in Figure 20.
Figure 19

R2 for calibration (training) and validation (testing) for the LSTM model.

Figure 19

R2 for calibration (training) and validation (testing) for the LSTM model.

Close modal
Figure 20

LSTM model comparison of the observed and simulated flows.

Figure 20

LSTM model comparison of the observed and simulated flows.

Close modal

Comparison of SWAT, data-driven, and LSTM models

The comparison of the SWAT model with seven data-driven models and one deep learning model, including ANN, multilinear regression, support vector regression, XGBoost regression, RF regression, KNN, and LSTM, is described below. R2 values for the training or calibration dataset and the testing or validation dataset were used to compare the fitness of the models. The outcomes from several models are displayed in Table 3. The multilinear regression model produced the lowest R2 of 0.59 during the training period (calibration), whereas the RF and XGBoost models both had the highest R2 of 0.95. For the testing period (validation), the LSTM model generated the highest R2 of 0.88, while the RF and KNN models achieved the lowest R2 of 0.62. Regarding R2 values for both training (calibration) and testing (validation), the SWAT and seven artificial intelligences are appropriate (Santhi et al. 2001; Van-Liew et al. 2007).

Table 3

Results obtained from various models

ModelR2
NSE
Training/calibration dataTesting/validation dataTraining/calibration dataTesting/validation data
SWAT 0.63 0.75 0.62 0.71 
ANN 0.86 0.75 0.85 0.75 
SVR 0.77 0.73 0.74 0.71 
KNN 0.74 0.62 0.73 0.61 
MLR 0.59 0.62 0.59 0.59 
RF 0.95 0.66 0.94 0.65 
XGBoost 0.95 0.68 0.95 0.68 
LSTM 0.88 0.88 0.88 0.85 
ModelR2
NSE
Training/calibration dataTesting/validation dataTraining/calibration dataTesting/validation data
SWAT 0.63 0.75 0.62 0.71 
ANN 0.86 0.75 0.85 0.75 
SVR 0.77 0.73 0.74 0.71 
KNN 0.74 0.62 0.73 0.61 
MLR 0.59 0.62 0.59 0.59 
RF 0.95 0.66 0.94 0.65 
XGBoost 0.95 0.68 0.95 0.68 
LSTM 0.88 0.88 0.88 0.85 

The multilinear regression model achieved the lowest NSE of 0.59 for the training period (calibration), whereas the XGBoost model produced the highest NSE of 0.95 for the training period. For the testing period (validation), the multilinear regression model achieved the lowest NSE of 0.59, while the LSTM model achieved the highest NSE of 0.85. The findings of the model's LSTM and ANN are very good with respect to NSE values, and it can be said that it was very efficient at simulating the monthly flow in the Peddavagu watershed, according to Moriasi et al.'s 2007 study on NSE.

The main findings in the SWAT model are that it is unable to match the peak observed discharge in the current research region, although small observed peaks are matched with simulated discharge in the SWAT model. R2 was 0.63 and 0.75 during the calibration and validation periods, respectively. In the calibration and validation periods, the NSE was 0.62 and 0.71, respectively. SWAT performed well in the validation period compared with calibration. When compared with other literature (Thavhana et al. 2018; Zakizadeh et al. 2020), the present study area SWAT model performed well. In comparison to the multilinear regression model, the SWAT model has performed well during the calibration and validation periods. The SWAT model's comparatively poor performance when compared to AI models (with the exception of MLP) might be the result of the most sensitive parameters' identification (Cibin et al. 2010). When compared to SWAT and other models in the current study during the calibration and validation periods, the LSTM model performed very well in terms of finding peaks of observed discharge and lows of observed discharge. As a result, compared to the LSTM model, the SWAT model performs rather poorly when simulating streamflow. The LSTM model appears to achieve the best overall result when compared to all other models when it comes to having the NSE value be very good for both the training period (calibration) and testing period (validation) to simulate R–R in the Peddavagu watershed. As a result, the Peddavagu watershed can efficiently simulate R–R using the LSTM model.

The analysis of R–R is a crucial and essential step in managing and planning for water resources. Traditionally, hydrologic models have been utilised for R–R analysis, considering the complex interactions within the water cycle. In recent years, the field of hydrology has increasingly integrated AI techniques, which have shown promising results, sometimes even outperforming conventional hydrological models in simulating runoff. The SWAT model and AI models were employed in this study's runoff analysis of a novel concept in the targeted region. In order to compare the study's findings and simulate the runoff around the river basin, it was carried out at the Peddavagu River Basin in India. Generally, simulations were run between 1990 and 2010. For the calibration period, which runs from 1990 to 2005, the validation period is from 2006 to 2010. In the current study, it was found that the R2 and NSE correlations for simulating R–R for all eight models were satisfactory. It can be said that the model's LSTM and ANN outcomes are excellent in terms of NSE values and that it was highly efficient in simulating the monthly flow. Overall, when compared to all other models, the deep learning model, the LSTM model, appears to have the best performance for simulating R–R in the Peddavagu watershed throughout both the training phase (calibration) (R2 is 0.88 and NSE is 0.88) and the testing period (validation) (R2 is 0.88 and NSE is 0.85). The calibrated and validated models from this study can be valuable tools for decision-makers to plan for sustainable water management in the Peddavagu watershed.

The authors would like to thank the editor and reviewers for their valuable comments and suggestions, which helped improve the quality of this paper. The authors would like to thank the Central Water Commission of India for providing stage discharge data (http://www.cwc.gov.in/). The authors would like to thank the US Geological Survey (USGS) for making the satellite data available (https://earthexplorer.usgs.gov/). Thank you to the Indian Meteorological Department for providing rainfall and temperature data on their website at https://mausam.imd.gov.in/. We would also especially like to thank NASA Power for providing the wind speed, relative humidity, and solar data on their website at https://power.larc.nasa.gov/data-access-viewer/.

There was no funding for this project.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Aadhar
S.
,
Swain
S.
&
Rath
D. R.
2019
Application and performance assessment of SWAT hydrological model over Kharun river basin, Chhattisgarh, India
. In:
World Environmental and Water Resources Congress 2019: Watershed Management, Irrigation and Drainage, and Water Resources Planning and Management
- Selected Papers from the World Environmental and Water Resources Congress.
American Society of Civil Engineers
,
Reston, VA
, pp.
272
280
.
Abbaspour
K. C.
,
Rouholahnejad
E.
,
Vaghefi
S.
,
Srinivasan
R.
,
Yang
H.
&
Kløve
B.
2015
A continental-scale hydrology and water quality model for Europe: calibration and uncertainty of a high-resolution large-scale SWAT model
.
Journal of Hydrology
524
,
733
752
.
https:// doi.org/10.1016/J.JHYDROL.2015.03.027
.
Antar
M. A.
,
Elassiouti
I.
&
Alam
M. N.
2006
Rainfall–runoff modeling using artificial neural networks technique: a Blue Nile catchment case study
.
Hydrological Processes
20
(
5
),
1201
1216
.
Arnold
J. G.
&
Allen
P. M.
1996
Estimating hydrologic budgets for three Illinois watershedsm
.
Journal of Hydrology
176
(
1–4
),
57
77
.
Ateeq-ur-Rauf
,
Ghumman
A. R.
,
Ahmad
S.
&
Hashmi
H. N.
2018
Performance assessment of artificial neural networks and support vector regression models for stream flow predictions
.
Environmental Monitoring and Assessment
190
(
12
),
704
.
https://doi.org/10.1007/s10661-018-7012-9
.
Bai
Y.
,
Bezak
N.
,
Sapac
K.
,
Klun
M.
&
Zhang
J.
2019
Short-term streamflow forecasting using the feature-enhanced regression model
.
Water Resources Management
33
(
14
),
4783
4797
.
Bai
Y.
,
Bezak
N.
&
Zeng
B.
2021
Daily runoff forecasting using a cascade long short-Term memory model that considers different variables
.
Water Resources Management
35
,
1167
1181
.
https://doi.org/10.1007/s11269-020-02759-2
.
Beven
K. J.
2012
Rainfall–Runoff Modelling: the Primer
, 2nd.
Wiley-Blackwell
,
Chichester
,
West Sussex
.
Chen
T.
&
Guestrin
C.
2016
Xgboost: a scalable tree boosting system
. In:
Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining
, Proceedings of the 22nd ACM, San Francisco, 13–17 August 2016.
ACM
, New York, pp.
785e794
.
Chutiman
N.
,
Chiangpradit
M.
,
Kong-ied
B.
,
Busababodhin
P.
,
Chaiyasaen
C.
&
Guayjarernpanish
G.
2022
Daily maximum rainfall forecast affected by tropical cyclones using grey theory
.
Civil Engineering Journal
8
(
8
).
doi:10.28991/CEJ-2022-08-08-02
.
Cibin
R.
,
Sudheer
K.
&
Chaubey
I.
2010
Sensitivity and identifiability of stream flow generation parameters of the SWAT model
.
Hydrological Processes International Journal
24
(
9
),
1133
1148
.
Dash
S. S.
,
Sahoo
B.
&
Raghuwanshi
N. S.
2023
SWAT model calibration approaches in an integrated paddy-dominated catchment-command
.
Agricultural Water Management
.
https://doi.org/10.1016/j.agwat.2023.108138
.
Dawson
C. W.
&
Wilby
R.
1998
An artificial neural network approach to rainfall– runoff modeling
.
Journal of Hydrology
43
,
47
66
.
Dhami
B.
,
Himanshu
S. K.
,
Pandey
A.
&
Gautam
A. K.
2018
Evaluation of the SWAT model for water balance study of a mountainous snowfed river basin of Nepal
.
Environmental Earth Sciences
77
(
1
),
1
20
.
Elkiran
G.
,
Nourani
V.
&
Abba
S. I.
2019
Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach
.
Journal of Hydrology
577
,
123962
.
https://doi.org/ 10.1016/j.jhydrol.2019.123962
.
Fathian
F.
,
Fakheri-Fard
A.
,
Modarres
R.
&
Van-Gelder
P. H. A. J. M.
2018
Regional scale rainfall–runoff modeling using VARX–MGARCH approach
.
Stochastic Environmental Research and Risk Assessment
32
(
4
),
999
1016
.
Frizzle
C.
,
Fournier
R. A.
,
Trudel
M.
&
Luther
J. E.
2021
Using the soil and water assessment tool to develop a LiDAR-based index of the erosion regulation ecosystem service
.
Journal of Hydrology
595
,
1
19
.
https://doi.org/10.1016/j.jhydrol.2021.126009
.
Gaur
S.
,
Johannet
A.
,
Graillot
D.
,
Omar
P. J.
,
2021
Modeling of groundwater level using artificial neural network algorithm and WA-SVR model BT
. In:
Groundwater Resources Development and Planning in the Semi-Arid Region
(
Pande
C. B.
&
Moharir
K. N.
, eds).
Springer International Publishing
,
Cham
, pp.
129
150
.
https://doi.org/10.1007/978-3-030-68124-1_7
.
Gaur
S.
,
Omar
P. J.
,
Eslamian
S.
,
2023
Chapter 1 - Advantage of grid-free analytic element method for identification of locations and pumping rates of wells
. In:
Handbook of Hydroinformatics
(
Eslamian
S.
&
Eslamian
F.
, eds).
Elsevier
, pp.
1
10
.
https://doi.org/10.1016/B978-0-12-821962-1.00003-9
.
Gazzaz
N. M.
,
Aris
A. Z.
,
Juahir
H.
,
Ramli
N. F.
&
Yusoff
M. K.
2012
Artificial neural network modeling of the water quality index for Kinta River (Malaysia) using water quality variables as predictors
.
Marine Pollution Bulletin
64
(
11
),
2409
2420
.
Getachew
B.
,
Manjunatha
B. R.
&
Bhat
H. G.
2021
Modeling projected impacts of climate and land use/land cover changes on hydrological responses in the Lake Tana Basin, upper Blue Nile River Basin, Ethiopia
.
Journal of Hydrology
595
,
1
20
.
https://doi.org/10.1016/j.jhydrol.2021.125974
.
Gupta
H. V.
,
Sorooshian
S.
&
Yapo
P. O.
1999
Status of automatic calibration for hydrologic models: comparison with multilevel expert calibration
.
Journal of Hydrologic Engineering
4
(
2
),
135
143
.
Gupta
A.
,
Himanshu
S. K.
,
Gupta
S.
&
Singh
R.
2020a
Evaluation of the SWAT model for analysing the water balance components for the upper Sabarmati Basin
. In:
Advances in Water Resources Engineering and Management
(AlKhaddar, R., Singh, R., Dutta, S. & Kumari, M., eds.).
Springer
,
Singapore
, pp.
141
151
.
Gupta
N.
,
Mathew
A.
&
Khandelwal
S.
2020b
Spatio-temporal impact assessment of land use/land cover (LU-LC) change on land surface temperatures over Jaipur city in India
.
International Journal of Urban Sustainable Development
12 (3),
283
299
.
Gupta
N.
,
Mahato
P. K.
,
Patel
J.
,
Omar
P. J.
,
Tripathi
R. P.
,
2022
Chapter 30 - Understanding trend and its variability of rainfall and temperature over Patna (Bihar)
. In:
Current Directions in Water Scarcity Research
(
Zakwan
M.
,
Wahid
A.
,
Niazkar
M.
&
Chatterjee
U.
, eds).
Elsevier
. Vol.
7
, pp.
533
543
.
ISSN 2542-7946, ISBN 9780323919104, https://doi.org/10.1016/B978-0-323-91910-4.00030-3
.
Gupta
N.
,
Patel
J.
,
Gond
S.
,
Tripathi
R. P.
,
Omar
P. J.
,
Dikshit
P. K. S.
,
2023
Projecting future maximum temperature changes in River Ganges Basin Using Observations and Statistical Downscaling Model (SDSM)
. In:
River Dynamics and Flood Hazards. Disaster Resilience and Green Growth
(
Pandey
M.
,
Azamathulla
H.
&
Pu
J. H.
, eds).
Springer
,
Singapore
.
https://doi.org/10.1007/978-981-19-7100-6_31
.
Guse
B.
,
Pfannerstill
M.
,
Strauch
M.
,
Reusser
D. E.
,
Lüdtke
S.
,
Volk
M.
,
Gupta
H.
&
Fohrer
N.
2016
On characterizing the temporal dominance patterns of model parameters and processes
.
Hydrological Processes
30
(
13
),
2255
2270
.
https://doi.org/10.1002/HYP.10764
.
Himanshu
S. K.
,
Pandey
A.
&
Shrestha
P.
2017
Application of SWAT in an Indian river basin for modeling runoff, sediment and water balance
.
Environmental Earth Sciences
76
(
1
),
3
.
Kan
G.
,
Yao
C.
,
Li
Q.
,
Li
Z.
,
Yu
Z.
,
Liu
Z.
,
Ding
L.
,
He
X.
&
Liang
K.
2015
Improving event-based rainfall-runoff simulation using an ensemble artificial neural network based hybrid data-driven model
.
Stochastic Environmental Research and Risk Assessment
29
(
5
),
1345
1370
.
https://doi.org/10.1007/s00477-015-1040-6
.
Karim
F.
,
Majumdar
S.
,
Darabi
H.
&
Chen
S.
2018
LSTM fully convolutional networks for time series classification
.
IEEE Access
6
,
1662
1669
.
Kolluru
V.
,
Kolluru
S.
&
Konkathi
P.
2020
Evaluation and integration of reanalysis rainfall products under contrasting climatic conditions in India
.
Atmospheric Research
246
.
https://doi.org/10.1016/J.ATMOSRES.2020.105121
.
Kratzert
F.
,
Klotz
D.
,
Brenner
C.
,
Schulz
K.
&
Herrnegger
M.
2018
Rainfall–runoff modelling using long short-term memory (LSTM) networks
.
Hydrology and Earth System Sciences
22
(
11
),
6005
6022
.
Kumar
V.
,
Chaplot
B.
,
Omar
P. J.
,
Mishra
S.
&
Md. Azamathulla
H.
2021
Experimental study on infiltration pattern: opportunities for sustainable management in the northern region of India
.
Water Science and Technology
84
(
10–11
),
2675
2685
.
Liaw
A.
&
Wiener
M.
2002
Classification and regression by randomForest
.
R. News
2
(
3
),
18e22
.
Liu
Z.
&
Todini
E.
2002
Towards a comprehensive physically-based rainfall-runoff model
.
Hydrology and Earth System Sciences
6
(
5
),
859
881
.
Lu
H.
,
Hou
T.
,
Horton
R.
,
Zhu
Y.
,
Chen
X.
,
Jia
Y.
,
Wang
W.
&
Fu
X.
2013
The streamflow estimation using the Xinanjiang rainfall runoff model and dual state parameter estimation method
.
Journal of Hydrology
480
,
102
114
.
https://doi.org/10.1016/j.jhydrol.2012.12.011
.
Malago
A.
,
Efstathiou
D.
,
Bouraoui
F.
,
Nikolaidis
N. P.
,
Franchini
M.
,
Bidoglio
G.
&
Kritsotakis
M.
2016
Regional scale hydrologic modeling of a karst-dominant geomorphology: the case study of the Island of Crete
.
Journal of Hydrology
540
,
64
81
.
https://doi.org/10.1016/J.JHYDROL.2016.05.061
.
Marcais
J.
&
de-Dreuzy
J. R.
2017
Prospective interest of deep learning for hydrological inference
.
Groundwater
55
,
688
692
.
Mathew
A.
&
Shekar
P. R.
2023
Flood prioritization of basins based on geomorphometric properties using morphometric analysis and principal component analysis: a case study of the Maner River Basin
. In:
River Dynamics and Flood Hazards
.
Springer
,
Singapore
, pp.
273
288
.
https://doi.org/10.1007/978-981-19-7100-6_18
.
Modarres
R.
&
Ouarda
T. B. M. J.
2013
Modeling rainfall–runoff relationship using multivariate GARCH model
.
Journal of Hydrology
499
,
1
18
.
Moriasi
D. N.
,
Arnold
J. G.
,
Van-Liew
M. W.
,
Bingner
R. L.
,
Harmel
R. D.
&
Veith
T. L.
2007
Model evaluation guidelines for systematic quantification of accuracy in watershed simulations
.
American Society of Agricultural and Biological Engineers
50
(
3
),
885
900
.
ISSN 0001 − 2351
.
Nash, J. E. & Sutcliffe, J. V. 1970 River flow forecasting through conceptual models; Part I – a discussion of principles. Journal of Hydrology 10, 282–290.
Nazari-Sharabian
M.
,
Taheriyoun
M.
&
Karakouzian
M.
2020
Sensitivity analysis of the DEM resolution and effective parameters of runoff yield in the SWAT model: a case study
.
Journal of Water Supply: Research and Technology-Aqua
69
(
1
),
39
54
.
https://doi.org/10.2166/aqua.2019.044
.
Neitsch
S. L.
,
Arnold
J. G.
,
Kiniry
J. R.
,
Williams
J. R.
&
King
K. W.
2004
Soil and Water Assessment Tool (Version 2000)—Theoretical Documentation. GSWRL 02-01, BRC 02-05, TR-191
.
Texas Water Resources Institute
,
College Station, Texas
.
Neitsch
S. L.
,
Arnold
J. G.
,
Kiniry
J. R.
&
Williams
J. R.
2011
Soil and Water Assessment Tool Theoretical Documentation Version 2009
.
Texas Water Resources Institute technical report No. 406, Texas A & M University System
, Texas.
Nerantzaki
S. D.
,
Giannakis
G. V.
,
Efstathiou
D.
,
Nikolaidis
N. P.
,
Sibetheros
I. A.
,
Karatzas
G. P.
&
Zacharias
I.
2015
Modeling suspended sediment transport and assessing the impacts of climate change in a karstic Mediterranean watershed
.
Science of the Total Environment
538
,
288
297
.
https://doi.org/10.1016/J.SCITOTENV.2015.07.092
.
Nourani
V.
,
Baghanam
A. D.
,
Adamowski
J.
&
Gebremichael
M.
2013
Using self-organizing maps and wavelet transforms for space–time pre-processing of satellite precipitation and runoff data in neural network based rainfall–runoff modeling
.
Journal of Hydrology
476
.
https://doi.org/10.1016/j.jhydrol.2012.10.054
.
Nourani
V.
,
Gökçekuş
H.
&
Umar
I. K.
2020
Artificial intelligence-based ensemble model for prediction of vehicular traffic noise
.
Environmental Research
180
,
108852
.
https://doi.org/10.1016/j.envres.2019.108852
.
Nyakundi
R.
,
Nyadawa
M.
&
Mwangi
J.
2022
Effect of recharge and abstraction on groundwater levels
.
Civil Engineering Journal
8
(
5
).
doi:10.28991/CEJ-2022-08-05-05
.
Omani
N.
,
Srinivasan
R.
,
Smith
P. K.
&
Karthikeyan
R.
2017
Glacier mass balance simulation using SWAT distributed snow algorithm
.
Hydrological Sciences Journal
62
(
4
),
546
560
.
Omar
P. J.
,
Dwivedi
S. B.
,
Dikshit
P. K. S.
,
2020
Sustainable development and management of groundwater in Varanasi, India BT
. In:
Advances in Water Resources Engineering and Management
(
AlKhaddar
R.
,
Singh
R. K.
,
Dutta
S.
&
Kumari
M.
, eds).
Springer Singapore
,
Singapore
, pp.
201
209
.
Omar
P. J.
,
Gaur
S.
&
Dikshit
P. K. S.
2021a
Conceptualization and development of multilayered groundwater model in transient condition
.
Applied Water Science
11
,
162
.
https://doi.org/10.1007/s13201-021-01485-3
.
Omar
P. J.
,
Gaur
S.
,
Dwivedi
S. B.
,
Dikshit
P. K. S.
,
2021b
Development and application of the integrated GIS-MODFLOW Model BT
. In:
Fate and Transport of Subsurface Pollutants
(
Gupta
P. K.
&
Bharagava
R. N.
, eds).
Springer Singapore
,
Singapore
, pp.
305
314
.
https://doi.org/10.1007/978-981-15-6564-9_16
.
Omar
P. J.
,
Rai
S. P.
&
Tiwari
H.
2022a
Study of morphological changes and socio-economic impact assessment: a case study of koshi river
.
Arabian Journal of Geosciences
15
,
1426
.
https://doi.org/10.1007/s12517-022-10723-0
.
Omar
P. J.
,
Shivhare
N.
,
Dwivedi
S. B.
&
Dikshit
P. K. S.
2022b
Identification of soil erosionprone zone utilizing geo-informatics techniques and WSPM model
.
Sustainable Water Resources Management
8
,
66
.
https://doi.org/10.1007/s40899-022-00654-9
.
Pandey
A.
,
Himanshu
S. K.
,
Mishra
S. K.
&
Singh
V. P.
2016
Physically based soil erosion and sediment yield models revisited
.
Catena
147
,
595
620
.
Rautela
K. S.
,
Kumar
M.
,
Sofi
M. S.
,
Kuniyal
J. C.
&
Bhat
S. U.
2022
Modelling of streamflow and water balance in the Kuttiyadi River Basin using SWAT and remote sensing/GIS tools
.
International Journal of Environmental Research
16
(
4
),
1
4
.
Rautela
K. S.
,
Kumar
D.
,
Gandhi
B. G. R.
,
Kumar
A.
&
Dubey
A. K.
2023
Long-term hydrological simulation for the estimation of snowmelt contribution of Alaknanda River Basin, Uttarakhand using SWAT
.
Journal of Water Supply: Research and Technology-Aqua
2023
,
jws2023176
.
https://doi.org/10.2166/aqua.2023.176
.
Rezaeianzadeh
M.
,
Stein
A.
,
Tabari
H.
,
Abghari
H.
,
Jalalkamali
N.
,
Hosseinipour
E. Z.
&
Singh
V. P.
2013
Assessment of a conceptual hydrological model and artificial neural networks for daily outflows forecasting
.
International Journal of Environmental Science and Technology
10
(
6
),
1181
1192
.
Roy
P. S.
,
Meiyappan
P.
,
Joshi
P. K.
,
Kale
M. P.
,
Srivastav
V. K.
,
Srivasatava
S. K.
,
Behera
M. D.
,
Roy
A.
,
Sharma
Y.
,
Ramachandran
R. M.
,
Bhavani
P.
,
Jain
A. K.
&
Krishnamurthy
Y. V. N.
2016
Decadal Land Use and Land Cover Classifications Across India, 1985, 1995, 2005
.
ORNL DAAC
,
Oak Ridge, Tennessee
,
USA
.
https://doi.org/10.3334/ORNLDAAC/1336
.
Sajikumara
N.
&
Thandaveswara
B. S.
1999
A non-linear rainfall–runoff model using an artificial neural network
.
Journal of Hydrology
216
,
32
55
.
Santhi
C.
,
Arnold
J. G.
,
Williams
J. R.
,
Dugas
W. A.
,
Srinivasan
R.
&
Hauck
L. M.
2001
Validation of the SWAT model on a large river basin with point and nonpoint sources
.
Journal of the American Water Resources Association
37
(
5
),
1169
1188
.
Santos
C. A. G.
&
da-Silva
G. B. L.
2014
Daily streamflow forecasting using a wavelet transform and artificial neural network hybrid models
.
Hydrological Sciences Journal
59
(
2
),
312
324
.
https://doi.org/10.1080/02626667.2013.800944
.
Schmalz
B.
,
Kuemmerlen
M.
,
Kiesel
J.
,
Cai
Q.
,
Jähnig
S. C.
&
Fohrer
N.
2015
Impacts of land use changes on hydrological components and macroinvertebrate distributions in the Poyang lake area
.
Ecohydrology
8
(
6
),
1119
1136
.
https://doi.org/10.1002/ECO.1569
.
Schmidhuber
J.
2015
Deep learning in neural networks: an overview
.
Neural Networks
61
,
85
117
.
2015
.
Setti
S.
,
Maheswaran
R.
,
Sridhar
V.
,
Barik
K. K.
,
Merz
B.
&
Agarwal
A.
2020
Inter-comparison of gauge-based gridded data, reanalysis and satellite precipitation product with an emphasis on hydrological modeling
.
Atmosphere
11
(
11
),
1
26
.
https://doi.org/10.3390/ATMOS11111252
.
Shamseldin
A. Y.
2010
Artificial neural network model for river flow forecasting in a developing country
.
22
35
.
https://doi.org/10.2166/hydro.2010.027
.
Shekar
P. R.
&
Mathew
A.
2022a
Morphometric analysis for prioritizing sub-watersheds of Murredu River basin, Telangana State, India, using a geographical information system
.
Journal of Engineering and Applied Science
69
,
44
.
https://doi.org/10.1186/s44147-022-00094-4
.
Shekar
P. R.
&
Mathew
A.
2023a
Detection of land use/land cover changes in a watershed: a case study of the Murredu watershed in Telangana state, India
.
Watershed Ecology and the Environment
5
,
46
55
.
https://doi.org/10.1016/j.wsee.2022.12.003
.
Shekar
P. R.
&
Mathew
A.
2023b
Erosion susceptibility mapping based on hypsometric analysis using remote sensing and geographical information system techniques
. In:
River Dynamics and Flood Hazards
.
Springer
,
Singapore
, pp.
365
382
.
https://doi.org/10.1007/978-981-19-7100-6_26
.
Shekhar
S.
,
Chauhan
M. S.
,
Omar
P. J.
,
Jha
M.
,
2021
River discharge study in River Ganga, Varanasi using conventional and modern techniques BT
. In:
The Ganga River Basin: A Hydrometeorological Approach
(
Chauhan
M. S.
&
Ojha
C. S. P.
, eds).
Springer International Publishing
,
Cham
, pp.
101
113
.
https://doi.org/10.1007/978-3-030-60869-9_7
.
Shen
C.
2018
A transdisciplinary review of deep learning research and its relevance for water resources scientists
.
Water Resources Research
54
.
https://doi.org/10.1029/2018WR022643
.
Shoaib
M.
,
Shamseldin
A. Y.
&
Melville
B. W.
2014
Comparative study of different wavelet based neural network models for rainfall-runoff modeling
.
Journal of Hydrology
515
,
47
58
.
https://doi.org/10.1016/j.jhydrol.2014.04.055
.
Singh
H.
&
Sankarasubramanian
A.
2014
Systematic uncertainty reduction strategies for developing streamflow forecasts utilizing multiple climate models and hydrologic models
.
Water Resources Research
50
(
2
),
1288
1307
.
https://doi.org/10.1002/2013WR013855
.
Singh
A. N.
,
Mudgal
A.
,
Tripathi
R. P.
&
Omar
P. J.
2023
Assessment of wastewater treatment potential of sand beds of River Ganga at Varanasi, India
.
AQUA-Water Infrastructure, Ecosystems and Society
72
(
5
),
690
700
.
Sinnathamby
S.
,
Douglas-Mankin
K. R.
&
Craige
C.
2017
Field-scale calibration of crop-yield parameters in the Soil and Water Assessment Tool (SWAT)
.
Agricultural Water Management
180
,
61
69
.
Srivastava
S.
,
Omar
P. J.
,
Shekhar
S.
&
Gupta
S.
2023
Study of acidic air pollutant (SO2 and NO2) tolerance of microalgae with sodium bicarbonate as growth stimulant
.
AQUA-Water Infrastructure, Ecosystems and Society
72
(
5
),
739
749
.
Suwarno
I.
,
Ma'arif
A.
,
Raharja
N. M.
,
Nurjanah
A.
,
Ikhsan
J.
&
Mutiarin
D.
2020
IoT-based lava flood early warning system with rainfall intensity monitoring and disaster communication technology
.
Emerging Science Journal
4
(
2021
).
Special Issue ‘IoT, IoV, and Blockchain’. doi:10.28991/esj-2021-SP1-011
.
Swain
S.
,
Verma
M. K.
&
Verma
M. K.
2018
Streamflow estimation using SWAT model over Seonath river basin, Chhattisgarh, India
. In:
Hydrologic Modeling
(Singh, V., Yadav, S. & Yadava, R., eds.).
Springer
,
Singapore
, pp.
659
665
.
Tan
M. L.
,
Gassman
P. W.
,
Liang
J.
&
Haywood
J. M.
2021
A review of alternative climate products for SWAT modelling: sources, assessment and future directions
.
Science of the Total Environment
795
,
1
16
.
https://doi.org/10.1016/J.SCITOTENV.2021.148915
.
Thavhana
M. P.
,
Savage
M. J.
&
Moeletsi
M. E.
2018
SWAT model uncertainty analysis, calibration and validation for runoff simulation in the Luvuvhu River catchment, South Africa
.
Physics and Chemistry of the Earth
105
,
115
124
.
10.1016/j.pce.2018.03.012
.
Tokar
A. S.
&
Johnson
P. A.
1999
Rainfall–runoff modeling using artificial neural networks
.
Journal of Hydrologic Engineering
4
,
232
239
.
Tokar
A. S.
&
Markus
M.
2000
Precipitation-runoff modeling using artificial neural networks and conceptual models
.
ASCE Journal of Hydrologic Engineering
2
,
156
161
.
Tripathi
M. P.
,
Panda
R. K.
,
Raghuwanshi
N. S.
&
Singh
R.
2004
Hydrological modelling of a small watershed using generated rainfall in the soil and water assessment tool model
.
Hydrological Processes
18
(
10
),
1811
1821
.
Umugwaneza
A.
,
Chen
X.
,
Liu
T.
,
Mind'je
R.
,
Uwineza
A.
,
Kayumba
P. M.
,
Uwamahoro
S.
,
Umuhoza
J.
,
Gasirabo
A.
&
Maniraho
A. P.
2022
Integrating a GIS-based approach and a SWAT model to identify potential suitable sites for rainwater harvesting in Rwanda
.
Journal of Water Supply: Research and Technology-Aqua
71
(
3
),
415
432
.
https://doi.org/10.2166/aqua.2022.111
.
Van-Liew
M. W.
,
Veith
T. L.
,
Bosch
D. D.
&
Arnold
J. G.
2007
Suitability of SWAT for the conservation effects assessment project: a comparison on USDA-ARS experimental watersheds
.
Journal of Hydrologic Engineering
12
(
2
),
173
189
.
Veettil
V. A.
&
Mishra
A. K.
2020
Multiscale hydrological drought analysis: role of climate, catchment and morphological variables and associated thresholds
.
Journal of Hydrology
582
,
1
17
.
https://doi.org/10.1016/j.jhydrol.2019.124533
.
Wu
C. L.
,
Chau
K. W.
&
Li
Y. S.
2009
Predicting monthly streamflow using data-driven models coupled with data-preprocessing techniques
.
Water Resources Research
45
(
8
),
1
23
.
https://doi.org/10.1029/2007WR006737
.
Xiang
Z.
,
Yan
J.
&
Demir
I.
2020
A rainfall-runoff model with LSTM-based sequence-to-sequence learning
.
Water Resources Research
56
,
e2019WR025326
.
Yin
H.
,
Zhang
X.
,
Wang
F.
,
Zhang
Y.
,
Xia
R.
&
Jin
J.
2021
Rainfall-runoff modeling using LSTM-based multi-state-vector sequence-to-sequence model
.
Journal of Hydrology
598
,
126378
.
Yuan
X.
,
Chen
C.
,
Lei
X.
,
Yuan
Y.
&
Adnan
R. M.
2018
Monthly runoff forecasting based on LSTM-ALO model
.
Stochastic Environmental Research and Risk Assessment
32
(
8
),
2199
2212
.
Zakizadeh
H.
,
Ahmadi
H.
,
Zehtabian
G.
,
Moeini
A.
&
Moghaddamnia
A.
2020
A novel study of SWAT and ANN models for runoff simulation with application on dataset of metrological stations
.
Physics and Chemistry of the Earth
120
,
102899
.
Zhang
J.
,
Zhu
Y.
,
Zhang
X.
,
Ye
M.
&
Yang
J.
2018
Developing a Long Short-Term Memory (LSTM) based model for predicting water table depth in agricultural areas
.
Journal of Hydrology
,
918
929
.
https://doi.org/10.1016/j.jhydrol.2018.04.065
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).